DODS workshop observations

To: thredds <thredds@xxxxxxxxxxxxxxxx>, Staff <staff@xxxxxxxxxxxxxxxx>
Subject: DODS workshop observations
From: Ben Domenico <ben@xxxxxxxxxxxxxxxx>
Date: Fri, 18 Jan 2002 16:35:48 -0700

Hi,

After the DODS meetings last week and a few brief conversations at the AMSmeetings this week, I thought it would be useful to summarize the issuesthat came up at the DODS meetings that I feel are important from my own(admittedly limited) THREDDS perspective.

When I get a chance, I'll try to capture this on a web page with all therelevant links, etc. but I wanted to get it out for discussion (especiallyfor corrections by others who were at the DODS meetings) before I let itfall through the cracks.


Have a nice MLK weekend.

-- Ben

======================================================

Granularity:

Under this heading, I include the discussions regarding what comprises adataset, what's an aggregation, what's a catalog, a collection, etc. andhow these relate to files, data objects within files, inventories, lists,directories, etc. I came away from the meetings with the sense that thereare clear definitions for only a few of these. Within THREDDS, we need tocome up with some working definitions that allow us to work with the dataheirarchy in a systematic fashion. This is somewhat complicated by thefact that the Digital Library community uses some of the terms, e.g., theterm "collection" in its own fashion.

There is a related THREDDS issue that was not discussed much at the DODSmeetings, namely, that we envision third-party metadata contributions inthe form of "catalogs" that reference files on multiple data servers. Butit means that a given dataset or file can be a member of many heirarchies.


Metadata Schemas:

The DODS DDS (Data Descriptor Structure) and DAS (Data Attribute Structure)will not be sufficient for THREDDS. We have to determine how THREDDS fitsin with externally defined "standards" such as those of ISO, FGDC, OpenGIS,GCMD, Dublin Core, ESML, etc. Recently we learned of another in the areaof software metadata -- BIDM (basic interoperability data model.) Our dataprovider sites are required to conform to some of these standards and theDL community is adopting Dublin Core with some extensions.


Metadata Creation Tools:

These are needed in the form of crawlers, scanners, and tools to aidhuman input. This includes hybrid tools where some of the metadata commonto many datastts is input by hand one time and is then combinedautomatically with metadata specific to individual datasets or files. Itis important that such tools be able to traverse data holdings where themetadata (and perhaps the datasets themselves) are held in databases andgenerated on the fly as needed. Some of this work is going on in DODS,some in the DL community, and some at Unidata. So this is one wherecoordination of efforts is needed.


Metadata Presentation Tools:

Several approaches to making metadata available were discussed at themeeting: DBMS systems, LDAP, simply directory/file systems, full textindexing facilities. As noted above, it's important for metadata"harvesting" tools to be able to "traverse" all the metadata at a site --even though it is made available in different ways.

Third-party Metadata Catalog Servers and the DODS Auxiliary InformationServers:

I believe these two concepts can be closely related. Whereas the AIS iscurrently viewed as a way of adding a "delta" of metadata to the mainmetadata source at the data providers site, the concecpt could be extendedto include sites which serve catalogs of metadata organized in a completelydifferent fashion. For example, some of the catalogs might point tocollections of datasets on different servers that illustrate differentscientific concepts or collections of datasets on different servers thatrelate to certain events: hurricanes, major storms, floods,etc.

Follow-Ups:
- Re: DODS workshop observations
  - From: Peter Cornillon

2002 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the thredds archives: