[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: 20050929: Problem with new ldm server



Robert,

I'm afraid I don't see a problem.  All of the "SVC_UNAVAIL"
error-messages occur after the top-level LDM server indicates it
received a SIGTERM signal -- after which I wouldn't expect an
ldmping(1) to succeed.

The only program in the LDM package that generates a SIGTERM is the
ldmadmin(1) script when "stop" or "restart" is requested.  Of course,
the LDM user can always manually send a SIGTERM to the LDM server.

What, exactly, is the problem?

Regards,
Steve Emmerson

NOTE: All email exchanges with Unidata User Support are recorded in the
Unidata inquiry tracking system and then made publicly available
through the web.  If you do not want to have your interactions made
available in this way, you must let us know in each email you send to us.

------- Original Message

>To: Unidata Support <address@hidden>
>From: Robert Leche <address@hidden>
>Subject: Problem with new ldm server
>Organization: LSU/SRCC
>Keywords: 200509282205.j8SM5YG7021007 LDM NOAAPORT DVB-S

Guys,

I have included the dvb-s logs, a tcpdump , a netstat output,  and the 
ldmd.conf file to document a problem. This problem shows up in the dvb-s 
file: "mistral ldmping[5866]: SVC_UNAVAIL   0.000756    0   localhost    
RPC: Program not registered"

 From the command line I am able to ldmping active ldm servers.

If you want to connect and have a look, the new computer is currently 
named: mistral.srcc.lsu.edu 130.39.188.225.  The ldm password is the 
same as the samoon ldm password.

I am drawing a blank at this point.

Bob


****************************************************************************
logs/dvb-s
****************************************************************************
Sep 28 21:16:12 mistral rpc.ldmd[5833]: SIGTERM
Sep 28 21:16:12 mistral rpc.ldmd[5833]: Terminating process group
Sep 28 21:16:12 mistral rtstats[5843]: Interrupt
Sep 28 21:16:12 mistral rtstats[5843]: Exiting
Sep 28 21:16:13 mistral ldmping[5866]: SVC_UNAVAIL   0.000756    0   
localhost    RPC: Program not registered
Sep 28 21:16:19 mistral ldmping[5871]: SVC_UNAVAIL   0.000747    0   
localhost    RPC: Program not registered
Sep 28 21:16:19 mistral pqcheck[5875]: Starting Up (5867)
Sep 28 21:16:19 mistral pqcheck[5875]: The writer-counter of the 
product-queue is 0
Sep 28 21:16:19 mistral pqcheck[5875]: Exiting
Sep 28 21:16:19 mistral rpc.ldmd[5913]: Starting Up (version: 6.3.0; 
built: Sep 19 2005 12:20:49)
Sep 28 21:16:19 mistral rpc.ldmd[5913]: Using local address 0.0.0.0:388
Sep 28 21:16:19 mistral rtstats[5923]: Starting Up (5913)
Sep 28 21:51:14 mistral rpc.ldmd[5913]: SIGTERM
Sep 28 21:51:14 mistral rpc.ldmd[5913]: Terminating process group
Sep 28 21:51:14 mistral rtstats[5923]: Interrupt
Sep 28 21:51:14 mistral rtstats[5923]: Exiting
Sep 28 21:51:15 mistral ldmping[5984]: SVC_UNAVAIL   0.000730    0   
localhost    RPC: Program not registered
Sep 28 21:51:21 mistral ldmping[5989]: SVC_UNAVAIL   0.000761    0   
localhost    RPC: Program not registered
Sep 28 21:51:21 mistral pqcheck[5993]: Starting Up (5985)
Sep 28 21:51:21 mistral pqcheck[5993]: The writer-counter of the 
product-queue is 0
Sep 28 21:51:21 mistral pqcheck[5993]: Exiting
Sep 28 21:51:21 mistral rpc.ldmd[6031]: Starting Up (version: 6.3.0; 
built: Sep 19 2005 12:20:49)
Sep 28 21:51:21 mistral rpc.ldmd[6031]: Using local address 0.0.0.0:388
Sep 28 21:51:21 mistral rtstats[6041]: Starting Up (6031)

*******************************************************************************
****
[root@mistral rleche]# /usr/sbin/tcpdump -i eth1
tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
listening on eth1, link-type EN10MB (Ethernet), capture size 96 bytes
16:55:33.412829 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:33.413048 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:33.423293 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:33.666527 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:33.666777 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:33.677145 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:33.749854 IP 192.168.200.3.6517 > 255.255.255.255.6516: UDP, 
length 138
16:55:33.924879 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:33.925128 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:33.935496 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:34.164739 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:34.164989 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:34.175358 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:34.262932 IP 192.168.200.3.6517 > 255.255.255.255.6516: UDP, 
length 138
16:55:34.405225 IP 10.0.9.51 > NTP.MCAST.NET: udp
16:55:34.406475 IP 10.0.9.51 > NTP.MCAST.NET: udp
16:55:34.413845 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:34.414095 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:34.417593 IP 10.0.9.51.32816 > NTP.MCAST.NET.1201: UDP, length 4068
16:55:34.419717 IP 10.0.9.51.32816 > NTP.MCAST.NET.1201: UDP, length 1443
16:55:34.419842 IP 10.0.9.51 > NTP.MCAST.NET: udp
16:55:34.421216 IP 10.0.9.51.32816 > NTP.MCAST.NET.1201: UDP, length 1896
16:55:34.421341 IP 10.0.9.51 > NTP.MCAST.NET: udp
16:55:34.424467 IP 10.0.9.51.32817 > AVIATOR.MCAST.NET.1205: UDP, length 272
16:55:34.427962 IP 10.0.9.51.32816 > NTP.MCAST.NET.1201: UDP, length 1707
16:55:34.432584 IP 10.0.9.51.32816 > NTP.MCAST.NET.1201: UDP, length 787
16:55:34.433459 IP 10.0.9.51 > NTP.MCAST.NET: udp
16:55:34.434708 IP 10.0.9.51 > NTP.MCAST.NET: udp
16:55:34.435833 IP 10.0.9.51.32816 > NTP.MCAST.NET.1201: UDP, length 4068
16:55:34.436208 IP 10.0.9.51 > NTP.MCAST.NET: udp
16:55:34.437457 IP 10.0.9.51.32816 > NTP.MCAST.NET.1201: UDP, length 1971
16:55:34.442829 IP 10.0.9.51 > NTP.MCAST.NET: udp
16:55:34.444078 IP 10.0.9.51 > NTP.MCAST.NET: udp
******************************************************************************
tcpdump dump of eth1 traffic
******************************************************************************

[root@mistral rleche]# netstat -tunap
Active Internet connections (servers and established)
Proto Recv-Q Send-Q Local Address               Foreign Address             Sta
te       PID/Program name
tcp        0      0 0.0.0.0:32768               0.0.0.0:*                   LIS
TEN      2408/rpc.statd
tcp        0      0 0.0.0.0:388                 0.0.0.0:*                   LIS
TEN      6031/rpc.ldmd
tcp        0      0 0.0.0.0:199                 0.0.0.0:*                   LIS
TEN      2895/snmpd
tcp        0      0 0.0.0.0:111                 0.0.0.0:*                   LIS
TEN      4306/portmap
tcp        0      0 127.0.0.1:631               0.0.0.0:*                   LIS
TEN      2511/cupsd
tcp        0      0 127.0.0.1:25                0.0.0.0:*                   LIS
TEN      2636/sendmail: acce
tcp        0      0 127.0.0.1:6010              0.0.0.0:*                   LIS
TEN      4779/0
tcp        0      0 127.0.0.1:199               127.0.0.1:32770             EST
ABLISHED 2895/snmpd
tcp        0      0 127.0.0.1:32770             127.0.0.1:199               EST
ABLISHED 2986/dcsnmp32d
tcp        0      0 130.39.188.225:32950        130.39.188.220:6000         EST
ABLISHED 5603/xterm
tcp        0      0 :::8000                     :::*                        LIS
TEN      3058/omaws32
tcp        0      0 :::32774                    :::*                        LIS
TEN      3058/omaws32
tcp        0      0 :::22                       :::*                        LIS
TEN      2559/sshd
tcp        0      0 ::1:6010                    :::*                        LIS
TEN      4779/0
tcp        0      0 :::1311                     :::*                        LIS
TEN      3058/omaws32
tcp        0      0 ::ffff:130.39.188.225:22    ::ffff:130.39.188.220:39659 EST
ABLISHED 4777/sshd: rleche [
udp        0      0 0.0.0.0:32768               0.0.0.0:*                      
         2408/rpc.statd
udp        0      0 0.0.0.0:32769               0.0.0.0:*                      
         3058/omaws32
udp        0      0 0.0.0.0:32770               0.0.0.0:*                      
         3058/omaws32
udp        0      0 0.0.0.0:32771               0.0.0.0:*                      
         3058/omaws32
udp        0      0 0.0.0.0:32772               0.0.0.0:*                      
         3058/omaws32
udp        0      0 0.0.0.0:161                 0.0.0.0:*                      
         2895/snmpd
udp        0      0 0.0.0.0:1201                0.0.0.0:*                      
         6033/dvbs_multicast
udp        0      0 0.0.0.0:1202                0.0.0.0:*                      
         6034/dvbs_multicast
udp        0      0 0.0.0.0:1203                0.0.0.0:*                      
         6035/dvbs_multicast
udp        0      0 0.0.0.0:1204                0.0.0.0:*                      
         6036/dvbs_multicast
udp        0      0 0.0.0.0:58086               0.0.0.0:*                      
         2340/procfgd
udp        0      0 0.0.0.0:111                 0.0.0.0:*                      
         4306/portmap
udp        0      0 0.0.0.0:631                 0.0.0.0:*                      
         2511/cupsd
udp        0      0 0.0.0.0:888                 0.0.0.0:*                      
         2408/rpc.statd
udp        0      0 192.168.200.2:123           0.0.0.0:*                      
         4670/ntpd
udp        0      0 130.39.188.225:123          0.0.0.0:*                      
         4670/ntpd
udp        0      0 127.0.0.1:123               0.0.0.0:*                      
         4670/ntpd
udp        0      0 0.0.0.0:123                 0.0.0.0:*                      
         4670/ntpd
udp        0      0 :::123                      :::*                           
         4670/ntpd


#### DATOO LDMD.CONF
# $Id: ldmd.conf,v 1.9 1998/10/07 16:51:16 rkambic Exp $
# Sample ldmd.conf for ldm5
####
#
# Programs that share a queue with rpc.ldmd
# are started by it and are in the same process group.
#
# exec  "pqbinstats"
# exec  "pqact"

# dvbs shared memory ingest processes -- default fifo size (8 MB)
# exec  "dvbs_multicast -m 224.0.1.1"
# exec  "dvbs_multicast -m 224.0.1.2"
# exec  "dvbs_multicast -m 224.0.1.3"
# exec  "dvbs_multicast -m 224.0.1.4"
#
# dvbs shared memory ingest processes -- reduced fifo size (2 MB)
exec    "dvbs_multicast -m 224.0.1.1 -b 500 -p -10"
exec    "dvbs_multicast -m 224.0.1.2 -b 500 -p -10"
exec    "dvbs_multicast -m 224.0.1.3 -b 500 -p -10"
exec    "dvbs_multicast -m 224.0.1.4 -b 500 -p -10"
#
# readnoaaport shared memory readers
exec    "readnoaaport -m 224.0.1.1"
exec    "readnoaaport -m 224.0.1.2"
exec    "readnoaaport -m 224.0.1.3"
exec    "readnoaaport -m 224.0.1.4"

exec    "rtstats -h rtstats.unidata.ucar.edu"

#
###############################################################################
# ALLOW: Who we are willing to feed
#
# allow <feedset> <hostname pattern>
###############################################################################
#
allow ANY
^((localhost|loopback)|(127\.0\.0\.1\.?$)|([a-z].*\.unidata\.ucar\.edu\.?$))

# any LSU SRCC machine
allow ANY ^[a-z]*\.srcc\.lsu\.edu$
allow ANY sirocco.srcc.lsu.edu
allow ANY product1.srcc.lsu.edu
allow ANY mistral.srcc.lsu.edu
allow ANY samoon.srcc.lsu.edu

# Unidata IDD NOAAPORT top-level hosts
allow ANY ^[a-z]*\.ssec\.wisc\.edu$
allow ANY ^[a-z]*\.wunderground\.com$
allow ANY ^[a-z]*\.ucar\.edu$
allow ANY ^[a-z]*\.geo\.nsf\.gov$

# The following entry is OEP:


#
# Give permission to the Unidata Program Center
#ALLOW  ANY     ^[a-z].*\.unidata\.ucar\.edu\.?$        .*
ALLOW   ANY     ^[a-z].*\.unidata\.ucar\.edu\.?$
#
#Give all SRCC hosts access
#ALLOW  ANY     ^[a-z].*\.srcc\.lsu\.edu\.?$    .*
ALLOW   ANY     ^[a-z].*\.srcc\.lsu\.edu\.?$


# The following entry is OEP:
allow ANY ^204\.196\.102\..*$

allow   ANY     ^tornado\.geos\.ulm\.edu$
allow   ANY     ^w[a-z]*\.admin\.niu\.edu$

#
###############################################################################
# Accept Entries
###############################################################################
# Give permission to the Unidata Program Center
#ALLOW  ANY     ^[a-z].*\.unidata\.ucar\.edu\.?$        .*
ALLOW   ANY     ^[a-z].*\.unidata\.ucar\.edu\.?$
#
#Give all SRCC hosts access
#ALLOW  ANY     ^[a-z].*\.srcc\.lsu\.edu\.?$    .*
ALLOW   ANY     ^[a-z].*\.srcc\.lsu\.edu\.?$


# The following entry is OEP:
allow ANY ^204\.196\.102\..*$

allow   ANY     ^tornado\.geos\.ulm\.edu$
allow   ANY     ^w[a-z]*\.admin\.niu\.edu$

#
###############################################################################
# Accept Entries
###############################################################################
# ACCEPT: Who can feed us without be requested by a REQUEST entry, currently
# this action is ONLY needed for WSI data
#
# ACCEPT <feedset> <pattern> <hostname pattern>
#
# ACCEPT anything from yourself
#
ACCEPT ANY ".*" ^((localhost|loopback)|(127\.0\.0\.1\.?$))
#
# accept from your upstream site
#
# WSI is using ldm4 protocol so the accept is still required
#ACCEPT WSI
#    .*
#    ^[a-z].*\.uni\.wsicorp\.com$
#
###############################################################################
# End
###############################################################################









-- 
----------------------------------------------------------------
Robert Leche - System Administrator
Louisiana State University - Southern Regional Climate Center
East 328  Howe-Russell Building - Baton Rouge, La. 70803
address@hidden - 225 578 5023
----------------------------------------------------------------

------- End of Original Message