Re: [netcdf-java] ucar.nc2.FileWriter bug fix: copySome actually copies everything at once

To: netcdf-java@xxxxxxxxxxxxxxxx
Subject: Re: [netcdf-java] ucar.nc2.FileWriter bug fix: copySome actually copies everything at once
From: Christian Ward-Garrison <cwardgar@xxxxxxxx>
Date: Sat, 10 Apr 2010 00:13:44 -0600

Hello all,

So, I think I figured out what the original copySome() is actuallydoing. Apparently, nelems is the maximum dim0 value that will be used inthe shapes of the Arrays that are read and written. The values for theother dimensions of the shapes are always maxed out. That's, um,interesting. In that light, Robert Bridle's code attempts to writechunks big enough to hold N elements, where N is the largest integermultiple of dim0's stride that still yields a chunk smaller thanmaxSize. Unfortunately, N will be 0 for sufficiently large dim0 strides.

I mention this because I use nelems differently in my copySome()implementation: it is simply the maximum number of elements to stuffinto each chunk. The sizes of the chunks *in bytes* will be no largerthan nelems*oldVar.getElementSize().


Regards,
Christian Ward-Garrison


On 4/9/2010 11:12 PM, Christian Ward-Garrison wrote:

Hello all,
In NJ 4.2.20100409.0054 the method ucar.nc2.FileWriter.copySome() issupposed to copy data for a large variable in a series of smallchunks. As written, however, it actually attempts to copy everythingat once. (Here's hoping that Thunderbird preserves the whitespace inmy preformatted code samples, or else this post will be very tough tofollow.)
private static void copySome(NetcdfFileWriteable ncfile, VariableoldVar, int nelems) throws IOException {
    String newName = N3iosp.makeValidNetcdfObjectName( oldVar.getName());

    int[] shape = oldVar.getShape();
    int[] origin = new int[oldVar.getRank()];
    int size = shape[0];

    for (int i = 0; i<  size; i += nelems) {
      origin[0] = i;
      int left = size - i;
      shape[0] = Math.min(nelems, left);

      Array data;
      try {
        data = oldVar.read(origin, shape);
    ...
I'm not exactly sure what the intended logic was, but it's clear thatin the first iteration of the loop, origin will be all zeroes (e.g.{0, 0, 0} if the rank of oldVar is 3) and that shape will be identicalto oldVar's shape. Therefore, the code will attempt to read all ofoldVar's data at once and an OutOfMemoryError will result if oldVar islarge.
On 03/29/2010, Robert Bridle proposed some new code forFileWriter.copyVarData() (which calls copySome()) that would split thewrite job into chunks:
///////////// ORIGINAL CODE //////////////////////
      int nelems = (int) (size / maxSize);
      if (nelems<= 1)
        copyAll(ncfile, oldVar);
      else
        copySome(ncfile, oldVar, nelems);
////////////////////////////////////////////////////

////////////// PROPOSED CODE //////////////////////
/*      if(size>  maxSize)
      {
        int[] shape = oldVar.getShape();
// determine the size of all the dimensions, other than thefirst.
        long sizeOfOtherDimensions = 1;
        for (int i = 1; i<  shape.length; i++) {
          if (shape[i]>= 0)
            sizeOfOtherDimensions *= shape[i];
        }
// determine number of bytes in all the dimensions, other thanthe first.long bytesInOtherDimensions = sizeOfOtherDimensions *oldVar.getElementSize();
// first dimension chunk-size that will fit within maxSize ofmemory.int firstDimensionChunkSize = (int)(maxSize/bytesInOtherDimensions);//System.out.println("We can fit: " + firstDimensionChunkSize+ " chunks in: " + maxSize + " bytes of memory.");
        copySome(ncfile, oldVar, firstDimensionChunkSize);
      }
      else
      {
        copyAll(ncfile, oldVar);
      }    */
////////////////////////////////////////////////////
This will write the data in N chunks where N is the size of theouter-most dimension. But what about when the stride of the outerdimension is very large? For example, there's a variable from amassive aggregated dataset I'm working with that has the CDL:
   float pr(ensemble=8, time=1560, lat=128, lon=256);
which means an outer-most dimension stride of 1560*128*256 =51,118,080. Using 32-bit floats, that would require 195 MB tostore--quite a bit larger than the maxSize of 1 MB.
So, I propose a different algorithm:

    /**
* An index that computes chunk shapes. It is intended to be usedto compute the origins and shapes for a series
     * of contiguous writes to a multidimensional array.
     */
    public static class ChunkingIndex extends Index {
        public ChunkingIndex(int[] shape) {
            super(shape);
        }

        /**
* Computes the shape of the largestpossible<b>contiguous</b> chunk, starting at {@link#getCurrentCounter()}
         * and with {@code size<= maxChunkSize}.
         *
* @param maxChunkSize the maximum size of the chunk shape.The actual size of the shape returned is likely* to be different, and can be found with{@link Index#computeSize}.
         * @return  the shape of the largest possible contiguous chunk.
         */
        public int[] computeChunkShape(int maxChunkSize) {
            int[] chunkShape = new int[rank];

            for (int iDim = 0; iDim<  rank; ++iDim) {
                chunkShape[iDim] = maxChunkSize / stride[iDim];
chunkShape[iDim] = (chunkShape[iDim] == 0) ? 1 :chunkShape[iDim];chunkShape[iDim] = Math.min(chunkShape[iDim],shape[iDim] - current[iDim]);
            }

            return chunkShape;
        }
    }
private static void copySome(NetcdfFileWriteable ncfile, VariableoldVar, int nelems) throws IOException {String newName =N3iosp.makeValidNetcdfObjectName(oldVar.getName());
        ChunkingIndex index = new ChunkingIndex(oldVar.getShape());
        while (index.currentElement()<  index.getSize()) {
            try {
                int[] chunkOrigin = index.getCurrentCounter();
                int[] chunkShape  = index.computeChunkShape(nelems);
                Array data = oldVar.read(chunkOrigin, chunkShape);

                if (oldVar.getDataType() == DataType.STRING) {
data = convertToChar(ncfile.findVariable(newName),data);
                }
if (data.getSize()> 0) {// zero when record dimension= 0
                    ncfile.write(newName, chunkOrigin, data);
                    if (debugWrite) {
System.out.println("write " + data.getSize() +" bytes");
                    }
                }
index.setCurrentCounter(index.currentElement() + (int)Index.computeSize(chunkShape));
            } catch (InvalidRangeException e) {
                e.printStackTrace();
                throw new IOException(e.getMessage());
            }
        }
    }
This will result in chunks that are *always* smaller than nelems,regardless of oldVar's size or shape. For example, ifoldVar.getShape() == { 5, 16, 8 } and nelems = 100, the origins andshapes of the chunk read/writes will be:
     origin      shape       size
r/w: [0, 0, 0] , [1, 12, 8], 96
r/w: [0, 12, 0], [1, 4, 8] , 32
r/w: [1, 0, 0] , [1, 12, 8], 96
r/w: [1, 12, 0], [1, 4, 8] , 32
r/w: [2, 0, 0] , [1, 12, 8], 96
r/w: [2, 12, 0], [1, 4, 8] , 32
r/w: [3, 0, 0] , [1, 12, 8], 96
r/w: [3, 12, 0], [1, 4, 8] , 32
r/w: [4, 0, 0] , [1, 12, 8], 96
r/w: [4, 12, 0], [1, 4, 8] , 32
As you can see, none of the chunks is actually 100 elements in size,but given the constraints of the Netcdf API, I don't think it can behelped. We'd need to be able to read and write 1D Arrays of valuesfrom/to a specific offset in the 1D backing array.
If you're interested, I've attached a patch containing the changes.

Regards,
Christian Ward-Garrison


_______________________________________________
netcdf-java mailing list
netcdf-java@xxxxxxxxxxxxxxxx
For list information or to unsubscribe, visit: 
http://www.unidata.ucar.edu/mailing_lists/

Follow-Ups:
- Re: [netcdf-java] ucar.nc2.FileWriter bug fix: copySome actually copies everything at once
  - From: Christian D Ward-Garrison

References:
- [netcdf-java] ucar.nc2.FileWriter bug fix: copySome actually copies everything at once
  - From: Christian Ward-Garrison

2010 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the netcdf-java archives: