[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[TIGGE #UDB-639046] Re: 20090721: dados do tigge



All,

I've had a separate discussion with NETS, and I think the problem is with dsspub.

When attempting to ldmping dsspub from dsspub I get similar errors to those observed by external users:

bash:ldmping
Jul 21 20:04:32 INFO: State Elapsed Port Remote_Host rpc_stat Jul 21 20:04:32 INFO: Resolving localhost to 127.0.0.1 took 0.000291 seconds Jul 21 20:04:42 ERROR: H_CLNTED 10.008459 388 localhost select: RPC: Timed out Jul 21 20:05:17 ERROR: ADDRESSED 9.999722 0 localhost RPC: Timed out Jul 21 20:05:52 ERROR: SVC_UNAVAIL 9.999705 0 localhost h_clnt_create(localhost): Timed out while creating connection Jul 21 20:06:27 ERROR: SVC_UNAVAIL 9.999712 0 localhost h_clnt_create(localhost): Timed out while creating connection

When I do the same on dssingest (ldmping dssingest from dssingest) I get a good response:

bash:ldmping
Jul 21 20:05:34 INFO: State Elapsed Port Remote_Host rpc_stat Jul 21 20:05:34 INFO: Resolving localhost to 127.0.0.1 took 0.00032 seconds
Jul 21 20:05:34 INFO: RESPONDING   0.002581  388   localhost
Jul 21 20:05:59 INFO: RESPONDING   0.000100  388   localhost

Can everyone attempt to ldmping dssingest.ucar.edu, and set up request commands for *.missing files to dssingest.ucar.edu if applicable?

I'd like to switch all operations over to dssingest until the dsspub problems are resolved.

Doug


On Jul 21, 2009, at 9:46 AM, Tom Yoksas wrote:


Hi Doug, Manuel, et. al.,

re:
We're currently having connectivity issues between ECMWF (tigge-
ldm.ecmwf.int) and NCAR (dsspub.ucar.edu), CMA (tigge-cma- ncar.cma.gov.cn)
and NCAR (dsspub.ucar.edu), and ECMWF (tigge-ldm.ecmwf.int) and
Brazil ( tigge-ldm.cptec.inpe.br ).  If you have any insight into the
problem, it would be greatly appreciated.

While I don't have any insight into what is causing the problem,
I can offer my hunch:  it is a routing problem in NLR.  I say
this because a traceroute to a Unidata user site hangs at the
NLR node in Denver:

traceroute tornado.geos.ulm.edu
traceroute to tornado.geos.ulm.edu (198.202.242.22), 30 hops max, 40 byte packets
1  flra-n140 (128.117.140.253)  1 ms  0 ms  0 ms
2  tcom-gs-1-n243-80.ucar.edu (128.117.243.85)  1 ms  1 ms  1 ms
3  nlrb-frgp.frgp.net (192.43.217.113)  1 ms  1 ms  2 ms
4  frgp-nlr.frgp.net (192.43.217.138)  5 ms  2 ms  13 ms
5  hous-denv-82.layer3.nlr.net (216.24.186.27)  31 ms  31 ms  31 ms
<hang>

If I knew who to call, I would get on the horn to NLR (or I2?) to
report the problem.

Cheers,

Tom

-ECMWF has been unable to ldmping or retrieve data from NCAR's ldm
server since
~July 10.  No recorded changes have been made to NCAR's server during
this period (Although I did restart ldm multiple times
yesterday to see if that would help out).

I'm trying to ldmping dsspub.ucar.edu I get:
Jul 20 20:47:17 INFO:      State    Elapsed Port   Remote_Host
rpc_stat
Jul 20 20:47:17 INFO: Resolving dsspub.ucar.edu to 192.43.244.212
took 0.119353 seconds
Jul 20 20:47:27 ERROR:   H_CLNTED   9.999577  388   dsspub.ucar.edu
select: RPC: Timed out

ECMWF see's the same ldmping problems to Brazil (but has been able to
retrieve data).:

ldm@tigge-ldm:~/etc> ldmping tigge-ldm.cptec.inpe.br
Jul 21 14:06:20 INFO:      State    Elapsed Port   Remote_Host
rpc_stat
Jul 21 14:06:20 INFO: Resolving tigge-ldm.cptec.inpe.br to
150.163.141.243 took 0.014087 seconds
Jul 21 14:06:30 ERROR:   H_CLNTED  10.002102  388 tigge-
ldm.cptec.inpe.br  select: RPC: Timed out


CMA has trouble when attempting to ldmping NCAR, but can retrieve data.
ldmping dsspub.ucar.edu
Jul 21 01:28:59 INFO:      State    Elapsed Port   Remote_Host
rpc_stat
Jul 21 01:28:59 INFO: Resolving dsspub.ucar.edu to 192.43.244.212 took
0.003704 seconds
Jul 21 01:29:09 ERROR:   H_CLNTED   9.999708  388   dsspub.ucar.edu
select: RPC: Timed out

I performed an ldmping test from dssingest.ucar.edu to dsspub.ucar.edu
and get a similar error message,
yet have no problem retrieving data from dsspub:

bash:ldmping dsspub
Jul 21 14:37:08 INFO:      State    Elapsed Port
Remote_Host           rpc_stat
Jul 21 14:37:08 INFO: Resolving dsspub to 192.43.244.212 took 0.01023
seconds
Jul 21 14:37:18 ERROR: SVC_UNAVAIL  10.001001    0   dsspub
h_clnt_create(dsspub): Timed out while creating connection

I have no problem ldmpinging dssingest, ECMWF, CMA, or Brazil from
dsspub:

bash:ldmping dssingest.ucar.edu
Jul 21 14:40:27 INFO:      State    Elapsed Port
Remote_Host           rpc_stat
Jul 21 14:40:27 INFO: Resolving dssingest.ucar.edu to 192.43.244.213
took 0.001078 seconds
Jul 21 14:40:27 INFO: RESPONDING   0.004333  388   dssingest.ucar.edu


Doug


On Jul 21, 2009, at 8:11 AM, Manuel Fuentes wrote:

Hi Alex,

We seem to still have problems.

ldm@tigge-ldm:~/etc> ldmping mopora.cptec.inpe.br
Jul 21 14:05:36 INFO:      State    Elapsed Port   Remote_Host
rpc_stat
Jul 21 14:05:36 INFO: Resolving mopora.cptec.inpe.br to
150.163.141.243 took 0.149712 seconds
Jul 21 14:05:45 ERROR:  ADDRESSED   9.542402    0
mopora.cptec.inpe.br  RPC: Timed out


ldm@tigge-ldm:~/etc> ldmping tigge-ldm.cptec.inpe.br
Jul 21 14:06:20 INFO:      State    Elapsed Port   Remote_Host
rpc_stat
Jul 21 14:06:20 INFO: Resolving tigge-ldm.cptec.inpe.br to
150.163.141.243 took 0.014087 seconds
Jul 21 14:06:30 ERROR:   H_CLNTED  10.002102  388 tigge-
ldm.cptec.inpe.br  select: RPC: Timed out

I'm puzzled because we don't miss data from CPTEC since 14th July
(we miss 1 field for one cycle). Perhaps we manage to get the data
via other archive centers. Do other Archive Centres (Doug, Xiaofeng
Bian) have the same problems with CPTEC's LDM ?



Cheers,
Manuel


Alex Almeida Fernandes wrote:
Hi Manuel,
Would you, please, do a ldmping on mopora.cptec.inpe.br <http://mopora.cpte
c.inpe.br
?
We're still trying to figure out the possible problem in our link.
Again, we apologize for the inconvenience.
Thanks,
Alex
2009/7/15 Manuel Fuentes <address@hidden <mailto:Manuel.Fuentes@e
cmwf.int

  Hi Waldenio.
  I think your LDM is still unreachable from our machine:
  ldmping tigge-ldm.cptec.inpe.br <http://tigge-ldm.cptec.inpe.br>
  Jul 15 21:18:57 INFO:      State    Elapsed Port   Remote_Host
rpc_stat
  Jul 15 21:18:57 INFO: Resolving tigge-ldm.cptec.inpe.br
  <http://tigge-ldm.cptec.inpe.br> to 150.163.141.243 took 0.007823
  seconds
  Jul 15 21:19:09 ERROR:      NAMED  12.254987    0
  tigge-ldm.cptec.inpe.br <http://tigge-ldm.cptec.inpe.br>  can't
  contact portmapper: RPC: Timed out
Could you, please, include Baudouin in your e-mails ? I'll be away
  for some days...
  Cheers,
  Manuel
  Waldenio wrote:
      Hello Manuel,
      Are you receiving the CPTEC's Tigge data ? I cant see the
cptec
      on the ldm statistics page on the ecmwf machines...
      Thanks,
      Waldenio.

Cheers,

Tom
--
+ ----------------------------------------------------------------------------+ * Tom Yoksas UCAR Unidata Program * * (303) 497-8642 (last resort) P.O. Box 3000 * * address@hidden Boulder, CO 80307 *
* Unidata WWW Service                            http://www.unidata.ucar.edu/*
+ ----------------------------------------------------------------------------+