Re: [netcdfgroup] nf90_char size

One reason to use netcdf over HDF5 is the fact that the
netcdf API is much simpler than HDF5. The HDF5 API
is some 6 times larger than the netcdf API.

On 5/3/2020 1:42 AM, Davide Sangalli wrote:
Thanks again.

I'll have a look to pnetcdf.
Another reason why we moved towards HDF5 was that, according to what I know, they could be able to exploit different levels of memory hierarchy in HPC simulations. Could pnetcdf do that as well ?

Besides that I'd really like some hints. Why could netcdf better than HDF5, or viceversa. Please do your worst.

For the NF90_unlimited, we are already using it in time dependent simulations in a way similar to the one you suggest. For the present case instead I'm just filling a huge complex matrix. So the interruption usually happens because there are limits on the simulation time. I'd really need to check which elements were filled and which were not without having any clue on the status.

Since you mentioned it. I'm very interested in the storage of sparse matrices. My huge matrix is indeed quite sparse. How does that work ?


Best,
D



On Sun, May 3, 2020 at 12:48 AM +0200, "Wei-Keng Liao" <wkliao@xxxxxxxxxxxxxxxx <mailto:wkliao@xxxxxxxxxxxxxxxx>> wrote:

    Hi, Dave

    Thanks for following up with the correct information about the dimension 
objects.
    I admit that I am not familiar with the NetCDF4 dimension representation in 
HDF5.

    Wei-keng

    > On May 2, 2020, at 5:28 PM, Dave Allured - NOAA Affiliate wrote: > > 
Wei-king, thanks for the info on the latest release.
    Minor detail, I found that hidden dimension scales are still
    stored as arrays, but the arrays are left unpopulated. HDF5 stores
    these as sparse, which means no wasted space in arrays that are
    never written. > > For Davide, I concur with Wei-king that
    netcdf-C 4.7.4 is okay for your purpose, and should not store
    wasted space. Version 4.7.3 behaves the same as 4.7.4. > > I
    wonder when they changed that, some time between your 4.4.1.1 and
    4.7.3. Also you used HDF5 1.8.18, I used 1.10.5. That should not
    make any difference here, but perhaps it does. > > > On Sat, May
    2, 2020 at 1:01 PM Wei-Keng Liao wrote: > > If you used the latest
    NetCDF 4.7.4, the dimensions will be stored as scalars. > >
    Wei-keng > > > On May 2, 2020, at 1:42 PM, Davide Sangalli wrote:
    > > > > Yeah, but BS_K_linearized1 is just a dimension, how can it
    be 8 GB big ? > > Same for BS_K_linearized2, how can it be 3 GB
    big ? > > These two are just two numbers > > BS_K_linearized1 =
    2,025,000,000 > > (it was chosen has a maximum variable size in my
    code to avoid overflowing the maximum allowed integer in standard
    precision) > > BS_K_linearized2 = 781,887,360 > > > > D. > > > >
    On 02/05/20 19:06, Wei-Keng Liao wrote: > >> The dump information
    shows there are actually 8 datasets in the file. > >> Below is the
    start offsets, sizes, and end offsets of individual datasets. > >>
    There is not much padding space in between the datasets. > >>
    According to this, your file is expected to be of size 16 GB. > >>
    > >> dataset name start offset size end offset > >>
    BS_K_linearized1 2,379 8,100,000,000 8,100,002,379 > >>
    BSE_RESONANT_COMPRESSED1_DONE 8,100,002,379 2,025,000,000
    10,125,002,379 > >> BSE_RESONANT_COMPRESSED2_DONE 10,125,006,475
    2,025,000,000 12,150,006,475 > >> BS_K_linearized2 12,150,006,475
    3,127,549,440 15,277,555,915 > >> BSE_RESONANT_COMPRESSED3_DONE
    15,277,557,963 781,887,360 16,059,445,323 > >> complex
    16,059,447,371 8 16,059,447,379 > >> BS_K_compressed1
    16,059,447,379 99,107,168 16,158,554,547 > >>
    BSE_RESONANT_COMPRESSED1 16,158,554,547 198,214,336 16,356,768,883
    > >> > >> Wei-keng > >> > >>> On May 2, 2020, at 11:28 AM, Davide
    Sangalli wrote: > >>> > >>> h5dump -Hp ndb.BS_COMPRESS0.005000_Q1 > >


_______________________________________________
NOTE: All exchanges posted to Unidata maintained email lists are
recorded in the Unidata inquiry tracking system and made publicly
available through the web.  Users who post to any of the lists we
maintain are reminded to remove any personal information that they
do not want to be made public.


netcdfgroup mailing list
netcdfgroup@xxxxxxxxxxxxxxxx
For list information or to unsubscribe,  visit: 
https://www.unidata.ucar.edu/mailing_lists/


  • 2020 messages navigation, sorted by:
    1. Thread
    2. Subject
    3. Author
    4. Date
    5. ↑ Table Of Contents
  • Search the netcdfgroup archives: