[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[LDM #JCB-153899]: idd.cise-nsf.gov to *.fis.ua.pt failure



Hi Yoshihiro,

I apologize for not being able to get back to you sooner on your problem
(we are holding training workshops at the moment).

re: 
> Yes, the ldm is running on the atm77.fis.ua.pt but I am
> not receiving any data at all.

I just checked and see that atm77.fis.ua.pt is contacting idd.cise-nsf.gov
to request CONDUIT data:

Oct 23 18:34:08 atm.cise-nsf.gov atm77.fis.ua.pt(feed)[10827] NOTE: Starting 
Up(6.4.5/6): 20061023173404.789 TS_ENDT {{CONDUIT,  "prod/gfs.*pgrb2.*[16]$"}}, 
Primary
Oct 23 18:34:08 atm.cise-nsf.gov atm77.fis.ua.pt(feed)[10827] NOTE: topo:  
atm77.fis.ua.pt {{CONDUIT, (.*)}}
Oct 23 18:34:12 atm.cise-nsf.gov atm77.fis.ua.pt(feed)[10861] NOTE: Starting 
Up(6.4.5/6): 20061023173411.629 TS_ENDT {{CONDUIT,  "prod/gfs.*pgrb2.*[27]$"}}, 
Primary
Oct 23 18:34:12 atm.cise-nsf.gov atm77.fis.ua.pt(feed)[10861] NOTE: topo:  
atm77.fis.ua.pt {{CONDUIT, (.*)}}
Oct 23 18:34:13 atm.cise-nsf.gov atm77.fis.ua.pt(feed)[10862] NOTE: Starting 
Up(6.4.5/6): 20061023173411.678 TS_ENDT {{CONDUIT,  "prod/gfs.*pgrb2.*[38]$"}}, 
Primary
Oct 23 18:34:13 atm.cise-nsf.gov atm77.fis.ua.pt(feed)[10862] NOTE: topo:  
atm77.fis.ua.pt {{CONDUIT, (.*)}}
Oct 23 18:34:13 atm.cise-nsf.gov atm77.fis.ua.pt(feed)[10863] NOTE: Starting 
Up(6.4.5/6): 20061023173411.720 TS_ENDT {{CONDUIT,  "prod/gfs.*pgrb2.*[49]$"}}, 
Primary
Oct 23 18:34:13 atm.cise-nsf.gov atm77.fis.ua.pt(feed)[10863] NOTE: topo:  
atm77.fis.ua.pt {{CONDUIT, (.*)}}

Even though it looks like the data should flow to atm77, it is not.

Interesting observations:

1) An 'ldmping' to from idd.cise-nsf.gov to atm77 does not work:

ldmping atm77.fis.ua.pt
Oct 23 18:53:56 INFO:      State    Elapsed Port   Remote_Host           
rpc_stat
Oct 23 18:53:56 INFO: Resolving atm77.fis.ua.pt to 193.137.81.77 took 0.04078 
seconds
Oct 23 18:54:06 ERROR: SVC_UNAVAIL  10.039241    0   atm77.fis.ua.pt  
h_clnt_create(atm77.fis.ua.pt): Timed out while creating connection

2) a 'traceroute' from idd.cise-nsf.gov to atm77 seems to hang in geant2.net:

traceroute to atm77.fis.ua.pt (193.137.81.77), 30 hops max, 40 byte packets
 1  cise-7204-1 (192.12.209.1)  1 ms  0 ms  0 ms
 2  arlg-nsf.maxgigapop.net (206.196.177.137)  3 ms  5 ms  2 ms
 3  abilene-rtr.maxgigapop.net (206.196.177.2)  11 ms  17 ms  6 ms
 4  abilene-wash.rt1.fra.de.geant2.net (62.40.125.17)  141 ms  110 ms  108 ms
 5  so-6-2-0.rt1.gen.ch.geant2.net (62.40.112.21)  118 ms  116 ms  116 ms
 6  so-7-0-0.rt1.mad.es.geant2.net (62.40.112.26)  145 ms  140 ms  140 ms
 7  so-6-0-0.rt1.lis.pt.geant2.net (62.40.112.98)  180 ms  159 ms  189 ms

3) a 'traceroute' from idd.unidata.ucar.edu to atm77 works:

traceroute atm77.fis.ua.pt
traceroute to atm77.fis.ua.pt (193.137.81.77), 30 hops max, 40 byte packets
 1  flra-n140.unidata.ucar.edu (128.117.140.252)  1.173 ms   0.233 ms   0.264 ms
 2  gin-n243-80.ucar.edu (128.117.243.81)  0.472 ms   0.409 ms   0.415 ms
 3  frgp-gw-1.ucar.edu (128.116.254.250)  1.396 ms   1.427 ms   1.444 ms
 4  192.43.217.166 (192.43.217.166)  1.524 ms   1.566 ms   1.616 ms
 5  * * *
 6  * * *
 7  * * *
 8  * * *
 9  198.32.11.51 (198.32.11.51)  45.516 ms   45.398 ms   45.282 ms
10  so-7-0-0.rt1.ams.nl.geant2.net (62.40.112.133)  145.608 ms   145.675 ms   
145.668 ms
11  so-1-0-0.rt1.lon.uk.geant2.net (62.40.112.138)  153.794 ms   153.787 ms   
153.720 ms
12  so-5-0-0.rt1.lis.pt.geant2.net (62.40.112.145)  180.718 ms   180.762 ms   
180.772 ms
13  * * *
14  ROUTER3.GE.Lisboa.fccn.pt (193.137.0.27)  180.719 ms   180.707 ms   180.714 
ms
15  Router2.10GE.Porto.fccn.pt (193.136.1.222)  184.505 ms   184.481 ms   
184.253 ms
16  UA.Aveiro.fccn.pt (193.136.1.194)  185.663 ms   185.642 ms   185.512 ms
17  fw1-ext.core.ua.pt (193.137.173.253)  185.416 ms   185.548 ms   185.369 ms
18  gt-cicua.core.ua.pt (193.136.86.193)  185.914 ms   186.067 ms   188.092 ms
19  * * *
20  atm77.fis.ua.pt (193.137.81.77)  185.620 ms   185.794 ms   185.762 ms

4) an 'ldmping' from idd.unidata.ucar.edu to atm77 does not work:

 ldmping atm77.fis.ua.pt
Oct 23 19:06:00 INFO:      State    Elapsed Port   Remote_Host           
rpc_stat
Oct 23 19:06:00 INFO: Resolving atm77.fis.ua.pt to 193.137.81.77 took 0.005775 
seconds
Oct 23 19:06:02 ERROR:  ADDRESSED   1.463389    0   atm77.fis.ua.pt  RPC: 
Unable to receive; errno = Connection reset by peer

5) a 'notifyme' from idd.unidata.ucar.edu to atm77 returns with an Access 
denied:

notifyme -vxl- -f ANY -o 10000 -h atm77.fis.ua.pt
Oct 23 19:07:19 notifyme[10336] NOTE: Starting Up: atm77.fis.ua.pt: 
20061023162039.995 TS_ENDT {{ANY,  ".*"}}
Oct 23 19:07:19 notifyme[10336] NOTE: LDM-5 desired product-class: 
20061023162039.995 TS_ENDT {{ANY,  ".*"}}
Oct 23 19:07:19 notifyme[10336] INFO: Resolving atm77.fis.ua.pt to 
193.137.81.77 took 0.002277 seconds
Oct 23 19:07:20 notifyme[10336] ERROR: NOTIFYME(atm77.fis.ua.pt): 7: Access 
denied by remote server
Oct 23 19:07:45 notifyme[10336] NOTE: LDM-5 desired product-class: 
20061023162039.995 TS_ENDT {{ANY,  ".*"}}
Oct 23 19:07:45 notifyme[10336] INFO: Resolving atm77.fis.ua.pt to 
193.137.81.77 took 0.00205 seconds
Oct 23 19:07:45 notifyme[10336] ERROR: NOTIFYME(atm77.fis.ua.pt): 7: Access 
denied by remote server

6) the ~ldm/logs/ldmd.log messages on idd.unidata.ucar.edu show feed requests 
from atm77:
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3769] NOTE: Starting Up(6.4.5/6): 
20061023153343.078 TS_ENDT {{CONDUIT,  "MT.gfs_CY.(00|06|12|18).*[05]$"}}, 
Alternate
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3769] NOTE: topo:  atm77.fis.ua.pt 
{{CONDUIT, (.*)}}
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3770] NOTE: Starting Up(6.4.5/6): 
20061023153343.080 TS_ENDT {{CONDUIT,  "MT.gfs_CY.(00|06|12|18).*[16]$"}}, 
Alternate
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3770] NOTE: topo:  atm77.fis.ua.pt 
{{CONDUIT, (.*)}}
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3771] NOTE: Starting Up(6.4.5/6): 
20061023153343.083 TS_ENDT {{CONDUIT,  "MT.gfs_CY.(00|06|12|18).*[49]$"}}, 
Alternate
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3771] NOTE: topo:  atm77.fis.ua.pt 
{{CONDUIT, (.*)}}
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3772] NOTE: Starting Up(6.4.5/6): 
20061023153343.085 TS_ENDT {{CONDUIT,  "MT.gfs_CY.(00|06|12|18).*[27]$"}}, 
Alternate
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3772] NOTE: topo:  atm77.fis.ua.pt 
{{CONDUIT, (.*)}}
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3773] NOTE: Starting Up(6.4.5/6): 
20061023153343.087 TS_ENDT {{CONDUIT,  "MT.gfs_CY.(00|06|12|18).*[38]$"}}, 
Alternate
Oct 23 16:33:44 uni1 atm77.fis.ua.pt(feed)[3773] NOTE: topo:  atm77.fis.ua.pt 
{{CONDUIT, (.*)}}
Oct 23 16:34:42 uni1 atm77.fis.ua.pt(feed)[21534] ERROR: Couldn't flush 
connection; nullproc_6() failure to atm77.fis.ua.pt: RPC: Timed out
Oct 23 16:34:42 uni1 atm77.fis.ua.pt(feed)[21533] ERROR: Couldn't flush 
connection; nullproc_6() failure to atm77.fis.ua.pt: RPC: Timed out
Oct 23 16:34:42 uni1 atm77.fis.ua.pt(feed)[21532] ERROR: Couldn't flush 
connection; nullproc_6() failure to atm77.fis.ua.pt: RPC: Timed out


> That´s what I am not
> undestanding because according with computer´s department
> they did no change at all.

It appears that something changed either in a firewall somewhere or some 
network routing
configuration.

> Is there any way to check all my conection with ldm ??
> best regards,

In addition to the checks above, I tried using 'rpcinfo' to interrogate your
machine:

/usr/sbin/rpcinfo -p atm77.fis.ua.pt

This got no response.

Questions:

- what is the operating systme on atm77 (please return results of 'uname -a')
- was the operating system on atm77 upgraded recently
- do you have an 'allow' line in your ~ldm/etc/ldmd.conf file for all machines
  from the unidata.ucar.edu domain?  It should look like:

# ALLOW anything to your own machine and all Unidata machines
allow   ANY
        
^((localhost|loopback)|(127\.0\.0\.1\.?$)|([a-z].*\.unidata\.ucar\.edu\.?$)

- when you did your LDM installation, did you remember to change the mode/owner 
of
  ~ldm/bin/rpc.ldmd and ~ldm/bin/hupsyslog using the command:

  <as 'ldm'>
  cd ~ldm
  cd ldm-6.4.6.4/src
  make distclean
  ./configure
  make
  make install
  sudo make install_setuids         <- the line I am referring to
  cd ~
  rm runtime
  ln -s ldm-6.4.6.4 runtime

I am working with our system administrator to try and figure out why data 
products
are not flowing to you even though both idd.unidata.ucar.edu and 
idd.cise-nsf.gov
are receiving AND honoring feed requests.

Cheers,

Tom
****************************************************************************
Unidata User Support                                    UCAR Unidata Program
(303) 497-8642                                                 P.O. Box 3000
address@hidden                                   Boulder, CO 80307
----------------------------------------------------------------------------
Unidata HomePage                       http://www.unidata.ucar.edu
****************************************************************************


Ticket Details
===================
Ticket ID: JCB-153899
Department: Support LDM
Priority: Normal
Status: Open


NOTE: All email exchanges with Unidata User Support are recorded in the Unidata inquiry tracking system and then made publicly available through the web. If you do not want to have your interactions made available in this way, you must let us know in each email you send to us.