Due to the current gap in continued funding from the U.S. National Science Foundation (NSF), the NSF Unidata Program Center has temporarily paused most operations. See NSF Unidata Pause in Most Operations for details.
<div class="socmaildefaultfont" dir="ltr" style="font-family:Arial, Helvetica, sans-serif;font-size:10pt" ><div dir="ltr" ><div><p>On behalf of my team, I’m happy to announce the first release of HDF5-UDF: user-defined-functions for HDF5 (and NetCDF4, by extension). The project enables the embedding of Lua scripts in HDF5 so that users can programmatically define a dataset whose data is generated on-the-fly each time that dataset is read.</p> <p>The primary motivation for this project is to dramatically reduce the disk space used by datasets that are a variation of existing data. We have successfully used HDF-UDF to virtually eliminate the impact of derived data in a number of use cases; grids that used to take a few gigabytes on disk, uncompressed, now require just a couple of kilobytes.</p> <p>Underneath, the source code is converted to a bytecode representation that LuaJIT executes when the dataset is read by the application. Through Just-In-Time compilation the overhead of virtualization is barely noticed: outputting grids that have no dependency on existing datasets can be an order of magnitude faster than reaching out to disk for I/O.</p> <div>HDF5-UDF is shipped as a filter, which means that no modifications are needed to applications that read from NetCDF4 files.</div> <div> </div> <div>The project is hosted at <a href="https://github.com/lucasvr/hdf5-udf" >https://github.com/lucasvr/hdf5-udf</a></div> <div> </div> <div>We invite everyone to try it out and to open pull requests. We hope you find it as useful as we do.</div> <p>Thanks,<br>Lucas</p></div></div></div><BR>
netcdfgroup
archives: