LDM and IDD Status Report

Steve Chiswell
Steve Emmerson
Mike Schmidt
John Stokes
Jeff Weber
Anne Wilson
Tom Yoksas

September 11, 2005

Real-time, self-managing data flows -- Unidata will foster and support the existence of real-time data flows that encompass a broad range of Earth-system phenomena, can be accessed with ease by all constituents, and are self managing in respect to changing contents and user needs.

--A goal of Unidata 2008: Shaping the Future of Data Use in the Geosciences

LDM Status Report

LDM Versions 6.4.0 and 6.4.1 have been released since the last report.
Highlights of the 6.4.x releases:
- Support for using a port other than 388.
- Automatic switching between primary and alternate transfer-mode by downstream LDM-s -- making failovers a thing of the past!
- Configurable filtering of data-products by upstream LDM-s.
- Support for more than 9 "back references" in string-substitutions of pqact(1) configuration-file.
- Encoding of MD5 signature in FEEDME requests to prevent data-loss.
- Reduced CPU utilization by about 75 percent by modifying the RPC library.
- Ported to Tiger version of MAX OS X.
Steve Emmerson led two sessions at the annual LDM training workshop in the UPC offices in July, 2005.

LDM Cluster Development

The toplevel IDD relay node operated by the UPC, idd.unidata.ucar.edu, is now the cluster that was reported on in the Spring 2005 status report.

Live stress testing of the cluster demonstrated that the limiting factor for data relay was the local network bandwidth which is 1 Gbps at UCAR! The cluster was operated at a sustained 500 Mbps output for a three day period in June. Tests were limited to this level since we ran out of downstream machines to which we could send data. During the stress testing, the cluster data backend machines were essentially idling.

The developers involved in the cluster effort are:

John Stokes cluster design and implementation

Steve Emmerson LDM-6 development

Mike Schmidt cluster design and system administration

Steve Chiswell IDD design and monitoring

Tom Yoksas configuration and stress testing

The design of our toplevel IDD relay node, idd.unidata.ucar.edu. is briefly described below (updated since the Spring 2005 status report).

In addition to atm.geo.nsf.gov the UPC operates the top level IDD relay node idd.unidata.ucar.edu. Instead of idd.unidata being a simple machine, it is part of a cluster that is composed of directors (machines that direct IDD feed requests to other machines) and data servers backends (machines that are fed requests by the director and service those requests). We are using the IP Virtual Server (IPVS) available in current versions of Linux to forward feed requests from directors to data servers.

Our cluster data backends currently run Fedora Core 3 64-bit Linux on identically configured Sun SunFire V20Z 1U rackmount servers:

Sun SunFire V20Z configuration

dual Opteron processors (64 bit)
12 GB RAM
2x36 GB 10K RPM SCSI
dual GB Ethernet interfaces
1U rackmount
$5500 cost (Sun educational discount program)

The cluster director is currently a Dell PowerEdge 2850 rackmount server. The 2850 is configured as follows:

Dell PowerEdge 2850 configuration

dual 64-bit Xeon processors (64 bit)
2 GB RAM
2x72 GB 10K RPM SCSI
dual GB Ethernet interfaces
2U rackmount
$2200 cost

The SunFire V20Z machines have proved to be stellar performers for IDD relay when running Fedora Core 3 64-bit Linux. We tested three operating systems side-by-side before settling on FC3:

Sun Solaris x86 10
FreeBSD 5.3
Fedora Core 3 Linux

All three operating systems are 64-bit. In our testing FC3 emerged as the clear winner; FreeBSD was second; and Solaris x86 10 was a distant third (this was very surprising). RedHat Enterprise WS 4 is FC3 with full RH support.

We will be testing the latest Fedora Core Linux release, version 4, for data backend use in the near future.

The following is a schematic view of what idd.unidata.ucar.edu:

                          |<-- director(s) -->|

                                +-------+ 
                                |       ^
                                V       |
                             +---------------+
    idd.unidata.ucar.edu ->  | LDM   | IPVS  |
                             +---------------+
                            /        |        \
                           /         |         \
                          /          |          \
                         /           |           \
                        /            |            \
                       /             |             \
                      /              |              \
                     /               |               \
        +---------------+    +---------------+    +---------------+
        |  'uni1' LDM   |    |  'uni2' LDM   |    |   'uni4' LDM  |
        +---------------+    +---------------+    +---------------+
    uni1.unidata.ucar.edu  uni2.unidata.ucar.edu  uni4.unidata.ucar.edu

      |<---------------------- data servers ---------------------->|

The top level indicates one director machine: idd.unidata.ucar.edu This machine is running IPVS and LDM 6.3.0 configured on a second interface (IP). The IPVS director software forwards port 388 requests received on a one interface configured as idd.unidata.ucar.edu on one machine and thelma.ucar.edu on the other. The set of data server backends are the same for both directors (at present).

When an IDD feed request is received by idd.unidata.ucar.edu is relayed by the IPVS software to one of the data servers. Those machines are configured to also be known internally as idd.unidata.ucar.edu, but they do not ARP, so they are not seen by the outside world/routers. The IPVS software keeps track of how many connections are on each of the data servers and forwards ("load levels") based on connection numbers (we will be changing this metric as we learn more about the setup). The data servers are all configured identically: same RAM, same LDM queue size (8 GB currently), same ldmd.conf contents, etc.

All connections from a downstream machine will always be sent to the same data server as long as its last connection did not die more than one minute in the past. This allows downstream LDMs to send an "are you alive" query to a server that they have not received data from in awhile. Once there have been no IDD request connections by a downstream host for one minute, a new request will be forwarded to the data server that is least loaded.

The design of the cluster allows for service on any data server without service interruption. When a data server goes out of service, the IPVS server is informed that the server is no longer available, and all downstream feed requests are sent to the other data servers that remain up.

LDM 6.3.0 was developed to allow running the LDM on a particular interface (IP). We are using this feature to run an LDM on the same box that is running the IPVS director. The IPVS listens on one interface (IP) and the LDM runs on another. The alternate interface does not need to represent a different Ethernet device; it can be a virtual interface configured in software. The ability to run LDMs on specific interfaces (IPs) allows us to run LDMs as either data collectors or as additional data servers on the same box running the director. (A data collector is an LDM that has multiple ldmd.conf requests that bring data to the cluster (e.g., CONDUIT from atm, UIUC, and/or, NEXRAD2 from Purdue, HDS from here, IDS|DDPLUS from there, etc.)). The data server LDMs request data redundantly from data collector LDMs. There is currently no directory redundancy; that will be added in the future.

The cluster setup is still new. Configurations will be modified as more is learned about how well the system performs. Stress tests run at the UPC demonstrated that one SunFire V20Z was able to handle 50% more downstream connections than the old SunFire 480R thelma.ucar.edu without introducing latency. With three data servers it is believed that the cluster can field literally every IDD feed request in the world if needed making the cluster the ultimate failover site. If the load on existing data servers ever becomes too high, more can easily be added. The ultimate limiting factor in this setup will be the routers and network bandwidth here in UCAR.

This cluster current relays an average of 140 Mbps (~1.4 TB/day) to approximately 250 downsteam connections. Peak rates routinely exceed 260 Mbps (2.6 TB/day).

Next Generation LDM Update

Steve Emmerson and Anne Wilson have been tasked with exploring the form and features of an ideal data relay system. Although still in draft form, an internal white paper has evolved to survey possible benefits of other relevant technologies, as well as to make recommendations regarding where Unidata should be in five years with respect to data delivery. The paper also includes some plans for describing how the IDD can be transitioned from one protocol to another.

IDD Status Report

289 machines at 157 sites are running LDM-6 and reporting real time statistics. Unidata staff routinely assist in the installation of LDM-6 at user sites to help evolve the IDD as rapidly as possible.
Real time LDM statistics can be found in the Real Time IDD Statistics web page. The topology and data flow information reflects changes in feed requests within a few minutes of changes. rtstats have proven to be most useful in diagnosing reception problems at sites, especially when universities have implemented restrictions on packets volume in and out of the campus without regard for their user needs.
NB: In order to correctly gauge real-time status of the IDD, it is important that all participating sites accurately maintain their system clocks. This is easily done through use of a Network Time Protocol daemon run on the local machine.
NWS continues to use LDM-6 to collect and relay NEXRAD Level II data. LDM-6.0.14 was used in the initial NWS build, but is slowly being replaced at participating nodes. Results of performance monitoring studies being conducted by NOAA personnel (Blanchard) are shared with the UPC, and information gained is being used to guide code modifications that will be included in future releases.
The IDD seamlessly transitioned to the use of DVB-S NOAAPORT ingest at the end of March. Currently, there are 8 sites known to be running the Unidata NOAAPORT reception software. 6 of these sites are acting as IDD injection nodes for NOAAPORT data.
Internet2 bandwidth usage by the LDM protocol reached a new high of approx. 21 TB/week the week of August 15. Weekly I2 usage statistics can be seen at Internet NetFlow: Weekly Reports. Search for 'UNIDATA LDM' usage statistics in the Advanced Applications portion of Table 7, Detailed Application Types (Full Data Set).

For previous information, please refer to the April 2005 Users Committee status report.

John Stokes	cluster design and implementation
Steve Emmerson	LDM-6 development
Mike Schmidt	cluster design and system administration
Steve Chiswell	IDD design and monitoring
Tom Yoksas	configuration and stress testing