BTW, the default buffer size in Structure.Iterator is 500K, which gives better
performance (I think) when the structure is actually a "psuedo-structure", that
is, stored by column rather than row.
I could try modifying the default buffer size of RAF, that might improve
performance on the server.
The trick is to have a wide range of queries to optimize over, and not just get
one case right at the expense of the whole.
Do you want to put together an interesting "load" on the server, maybe based on
a bunch of IDV tests or something?
Jeff McWhirter wrote:
What experience do you all have with the performance characteristics of
with varying bufferesizes. I have found with the point obs that it
varies dramatically for remote
data sets. e.g, getting a data iterator with BUFFERSIZE using:
and reading this data set (~90000 obs,32 values)
buffer size:4096 Total time:41882
buffer size:8192 Total time:30348
buffer size:16384 Total time:25124
buffer size:32768 Total time:23919
buffer size:65536 Total time:26789
buffer size:131072 Total time:27540
It seems as though 32768 is a sweet spot. Any ideas why the higher
buffer sizes give worse performance?
Are you allocating the buffers repeatedly? (thus triggering worse GC
On the other hand reading a local file:
which has 23000 obs and 180 vars I get the exact opposite:
buffer size:4096 Total time:8557
buffer size:8192 Total time:8534
buffer size:16384 Total time:8307
buffer size:32768 Total time:11004
buffer size:65536 Total time:9247
buffer size:131072 Total time:9010
buffer size:262144 Total time:9032
To unsubscribe thredds-dev, visit: