Re: netcdf4 parallel IO

To: netcdf-hdf@xxxxxxxxxxxxxxxx, netcdf-hdf@xxxxxxxxxxxxxxxx
Subject: Re: netcdf4 parallel IO
From: MuQun Yang <ymuqun@xxxxxxxxxxxx>
Date: Thu, 26 Apr 2007 11:06:51 -0500

Hi David,

NetCDF4 is using parallel HDF5 underneath to do parallel IO. As faras I know, I don't think HDF5 is doing anything like blocking in itslayer. Other experts, please correct me if I am wrong.

If seems that your application fits to what we called irregularselection of data. Parallel HDF5 does support irregular selection ofdata via collective IO.However, the current netcdf-4 API may not support this yet. Oneworkaround solution is see whether you can combine your cells intoseveral regular selections and then use NF90_PUT_VAR. From yourexample, you can put (1 2) of cell 1 and (3 4) of cell 2 into oneregular selection and call NF90_PUT_VAR.



Kent Yang

The HDF Group

At 10:39 AM 4/26/2007, David Stuebe wrote:

Hi NETCDF folks
I work on an unstructured finite volume coastal ocean model, FVCOM,which is parallel (using MPICH2). The Read Write is a major slowdown for our large cases. On our cluster, we have one large storagedevice, an emc raid array. The network is infini-band - the networkis much faster than the raid array.
For our model we need to read large initial condition data sets, andsingle frames of forcing data while running. We also need to writesingle frames of data for output (frequently), and large restartfiles (less frequently).
I am considering two options for recoding the IO from the model. Oneis based around the future F90 netcdf 4 parallel interface whichwould allow a symmetric code- every processor does the same thing.The other option is to use netcdf 3, let the master processorread/write the data and distribute it to each node, -an asymmetric coding.
What I need to know-  are netcdf 4 parallel IO operations blocking?
The problem - the order of cells and nodes in our data set does notallow for a simple start, count read format. A data array might havedimensions (time,layers,cells). As an example, in a 2 processorcase with 8 cells, proc1 has cells(1 2 5 7) while proc2 has cells (34 6 8) - write operations would have to be in a do loop to writeeach cell individually from the processor that owns it.
For a model with 300,000 cells on 30 processors, this would be10,000 calls to NF90_PUT_VAR on each processor. Even if the callsare non-blocking this seems dangerous.
Any thoughts?

David


==============================================================================
To unsubscribe netcdf-hdf, visit:
http://www.unidata.ucar.edu/mailing-list-delete-form.html
==============================================================================

References:
- netcdf4 parallel IO
  - From: David Stuebe