[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: No WSI feed from wsihcsn.unidata.ucar.edu to iita at 99082610 (fwd)




===============================================================================
Robb Kambic                                Unidata Program Center
Software Engineer III                      Univ. Corp for Atmospheric Research
address@hidden             WWW: http://www.unidata.ucar.edu/
===============================================================================

---------- Forwarded message ----------
Date: Thu, 26 Aug 1999 17:49:40 -0600 (MDT)
From: Celia Chen <address@hidden>
To: Robb Kambic <address@hidden>
    Peter Neilley <address@hidden>
Subject: Re: No WSI feed from wsihcsn.unidata.ucar.edu to iita at 99082610

Robb,

I have to tell your that iita didn't take the pq_size 
of 850MB too well. Please remember that iita is a linux 
box and has 1GB memory. It ran out of space quickly and 
can cause files to disappear with such a big pq_size.  I 
have reduced the pq_size back to 650MB now.

Celia

> 
> Hiya,
> 
> I logged into iita and did some configuration changes and also changed it
> back to run version 5.0.8   The configuration changes are:  made the queue
> size 850 megabytes  changed pqexpire to run every 20 minutes instead of
> every 5 minutes. When pqexpire was running the machine was 0% idle.
> 
> When I stopped the ldm, there were many ldm rogue
> processes running on the machine not affiliated with the running
> ldm.  When stopping the ldm, care needs to be taken that all ldm
> processes are gone before restarting.  Before the ldm was started, the
> queue was deleted/remade.  
> 
> When trying to contact iita with ldmping from wsihcsn:
> 
> ldmping -i 5 -h iita.rap.ucar.edu
> Aug 26 22:09:19      State    Elapsed Port   Remote_Host
> rpc_stat
> Aug 26 22:09:19  ADDRESSED   0.100148    0   iita.rap.ucar.edu  RPC:
> Unable to receive; errno = Connection reset by peer
> Aug 26 22:09:24 SVC_UNAVAIL   0.050834    0   iita.rap.ucar.edu  RPC:
> Unable to receive; errno = Connection reset by peer
> Aug 26 22:09:29 SVC_UNAVAIL   0.032411    0   iita.rap.ucar.edu  RPC:
> Unable to receive; errno = Connection reset by peer
> 
> rpcinfo on iita also had the same problem:
> 
> iita:~/logs> rpcinfo -n 388 -u iita 300029 4
> rpcinfo: RPC: Unable to receive; errno = Connection refused
> program 300029 version 4 is not available
> iita:~/logs> ^-u^-t
> rpcinfo -n 388 -t iita 300029 4
> rpcinfo: RPC: Unable to receive; errno = Connection reset by peer
> program 300029 version 4 is not available
> 
> 
> 
> wsihcsn is feeding 4 machines:
> 
> 2 machines are having latency problems, iita and torrent inside security
> perimeter
> 
> 2 machines are current:  shemp and ldm.comet outside security perimeter.
> 
> It's 4:30 and the WSI feed is about 31 behind on iita.  
> 
> Aug 26 22:29:36 pqutil:     2268 19990826215757.317     WSI 646
> NEX/EWX/LREF1/199908262145
> 
> Robb...
> 
> ===============================================================================
> Robb Kambic                              Unidata Program Center
> Software Engineer III                    Univ. Corp for Atmospheric Research
> address@hidden                   WWW: http://www.unidata.ucar.edu/
> ===============================================================================
> 
>