GALEON Interoperability Experiment Progress Report

DataFed access to THREDDS (NetCDF) Datasets through WCS Protocol  

Washington University, CAPITA, Dec 6, 2005 , rhusar@me.wustl.edu 

 

Summary

This is progress report on  THREDDS/ WCS-DataFed interoperability. The DataFed client can access the WCS data sets from the U. Florence and from the UNIDATA THREDDS WCS test servers. 

U. Florence WCS server:   sst(time-lat-lon ) dataset

http://athena.pin.unifi.it :8080/galeon/WCS-v1.0?version =1.0.0&service=WCS&request =GetCapabilities
UNIDATA WCS server:  TREDDS_CDM dataset

http://motherlode.ucar.edu :8080/thredds/wcs/global /testdata/GFS_Global_0p5deg _20051128_1800.grib2?request =GetCapabilities&version=1.0.0

The spatial queries that return spatial grids to both servers yield expected return. Within DataFed, the received WCS (NetCDF) data from both servers, is easily transformed into data views and used in distributed web applications by chaining appropriate web services.   

Our enthusiasm is tempered by the fact that substantial hurdles are still ahead of us before we reach (our) desired level of interoperability.   

 - the temporal aspects of the WCS queries needs more work both on server and client side 

 - we have not yet tested 3-4D datasets with Elevation and Forecast time dimensions with the WCS protocol (e.g. ETA model )   

 - automatic (loosely coupled) registration of WCS services in to the DataFed catalog needs work (using  ncML-GML ?)

- we are in need of geo-re-projection services (preferably web services) to make may of the grids usable for us 

We recognize that many of the current inadequacies are due to our own limited understanding of the current sate of affairs. We are looking forward pursuing these and related issues as part of this IE or through alternative venues.    

Background

This progress report contains the finding on interoperability between the GALEON WCS test servers and the DataFed WCS client. The GALEON Interoperability Experiment (Geo-interface to Atmosphere, Land, Earth, Ocean netCDF). GALEON aims to aid the data flow between GIS and Earth Science communities. I is a joint initiative between UNIDATA, OGC and involves an international group of collaborating institutions. 

This brief report was prepared by the Center for Air Pollution Impact and Trend Analysis (CAPITA) group:

Stefan Falke who pursued the general WMS/WFC/ WCS client-server connectivity

Kari Koijarvi the programmer/ implementer of the WCS client services and 

Rudy Husar, who's primarily interest is in the access and applications of the THREDDS datasets for air quality analysis. 

We see this IE as an opportunity to advance air quality research by accessing the rich virtual data holdings brokered through THREDDS. At the same time this IE can advance the interoperability of Earth Science data systems, which is a specific goal of our NSF ITR grant (Final) , the 2004-09 NASA REASoN grant and related projects. Finally, on a more personal level, we are eager to interact with a group with whom we share a belief...   ...               

DataFed as a WCS Client and Server

From the point of view of the GALEON IE, DataFed is a client of Earth Science datasets presented through the THREDDS delivery system using the OGC Web Coverage Service (WCS) protocol. DataFed itself is an application framework for accessing, processing and browsing distributed datasets using loosely coupled web service components. Within DataFed, the interface to distributed data is through data wrappers, which deal with the dataset-specific physical data access. The the subsequent data access services use the wrappers and 

(1) impose a semantically homogeneous global multi-dimensional data model (e.g. lat-lon-datetime-elevation) and 

(2) turn all data into strongly typed SOAP services, that can be loosely coupled to downstream services. 

 

Along the processing chain, intermediate data can be extracted, transformed to other formats and delivered as services (SOAP, WCS, WMS, other forms of REST etc) to other consumers. Ideally, a subset of the air quality-related datasets mediated through DataFed could/should(?) be registered in and accessible through the THREDDS Catalog (Ideas?). 

In the DataFed framework, web-applications are created in two stages: 

1. First, specific data views (e.g. MapView, TimeView) are created by chaining web services (e.g. DataAccess, Process, Render, Ovelay, Annotate etc.)

2. The views are then embedded in a web page which provides the frame, controllers and the glue code that links controllers to the chained services 

The main web app of DataFed is the generic Viewer of spatio-temporal datasets. Since all datasets registered in DataFed follow a global multidimensional data model, the same viewer can be used for all the federated datasets in DataFed. Further on DataFed General DescriptionMediation and Architecture

       

UFlorence Server WCS Data Access, Processing and Viewing through DataFed

The easiest way to explain the WCS data consumption is by showing the two datasets through the Data Viewer

http://webapps.datafed.net/dvoy_services/datafed.aspx?dataset_abbr=SeaTemp_OL

The data navigation is by clicking on the desired grid location or time. 

 

Map View: http://athena.pin.unifi.it :8080/galeon/WCS-v1.0?REQUEST =GetCoverage&VERSION=1.0.0 &TIME=2001-01-01T00:00:00Z ,2001-01-01T00:00:00Z&SERVICE =WCS&COVERAGE=sst(time-lat-lon )&RESPONSE_CRS=EPSG&CRS=WGS84 (DD)&FORMAT=netCDF&BBOX=0,30 ,40,60

Time View: http://athena.pin.unifi.it :8080/galeon/WCS-v1.0?REQUEST =GetCoverage&VERSION=1.0.0 &TIME=2001-08-01T00:00:00Z ,2002-03-01T00:00:00Z&SERVICE =WCS&COVERAGE=sst(time-lat-lon )&RESPONSE_CRS=EPSG&CRS=WGS84 (DD)&FORMAT=netCDF&BBOX=340,60 ,340,60

Sub-cube: http://athena.pin.unifi.it :8080/galeon/WCS-v1.0?REQUEST =GetCoverage&VERSION=1.0.0 &TIME=2001-08-01T00:00:00Z ,2002-03-01T00:00:00Z&SERVICE =WCS&COVERAGE=sst(time-lat-lon )&RESPONSE_CRS=EPSG&CRS=WGS84 (DD)&FORMAT=netCDF&BBOX=0,30 ,40,60

Note that the form of the queries for map and time views are identical. The difference is in the shape of the requested data cubes:

Map View:    TIME=2001-01-01T00:00:00Z ,2001-01-01T00:00:00Z.....BBOX=0,30 ,40,60  Time range is a point - beginPosition & endPosition is the same; spatial range set in BBOX

Time View:    TIME=2001-08-01T00:00:00Z ,2002-03-01T00:00:00Z....BBOX=340,60 ,340,60 Time is a range given by beginPosition & endPosition; BBOX spatial range is a point

Sub-cube: TIME=2001-08-01T00:00:00Z ,2002-03-01T00:00:00Z....BBOX=0,30 ,40,60

So, as long as the WCS servers recognize 'point' queries, time-lat-lon datacubes can be meaningfully queried. We appreciate that the Florence server responds like that.

THREDDS Server WCS Data Access, Processing and Viewing through DataFed

The THREDDS WCS server data can be viewed through the generic DataFed browser as 

http://webapps.datafed.net/dvoy_services/datafed.aspx?dataset_abbr=THREDDS_CDM

Example WCS getCoverage query from the THREDDS server for map view

http://motherlode.ucar.edu :8080/thredds/wcs/global /testdata/GFS_Global_0p5deg _20051130_1200.grib2?service =WCS&version=1.0.0&request =GetCoverage&Coverage=Temperat ure_surface&time=2005-11-30T12 :00:00Z&format=NetCDF3&bbox=0, -90,359,90

This returns a NetCDF file, which is translated to GML and rendered as shaded grid.

The THREDDS server does not respond to WCS queries with a time range or point BBOX, hence we could not create the Time View. Time navigation is still possible changing the DateTime and clicking on GO button. Also, note that the list of parameters (variables) is derived from the getCapability and GetDescription queries (but not yet automatically). The multiple data layers is THREDSS_CDM can be browsed through the Analysts Console web-application. The app renders a user-selectable set of 'Views' and facilitates time and spatial zoom navigation, image size and other settings.

http://www.datafed.net/consoles/user_consoles.asp?view_states=GALEO/THREDDS/Total_ozone_map,GALEO/THREDDS/Precipitable_water_map,GALEO/THREDDS/Precipitation_rate_map,GALEO/THREDDS/Pressure_reduced_to_MSL_map,GALEO/THREDDS/Temperature_high_cloud_top_map,GALEO/THREDDS/Temperature_surface_map&image_width=400&image_height=220&datetime=now-48&Title=GALEON%20IE:+THREEDS+WCS%20Client