[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

errors on squall?



Hi David,

In motherlode's logs, mostly before the Saturday outage, squall was
returning a lot of messages like this one:

ldmd.log:Nov 05 16:25:32 motherlode.ucar.edu squall(feed)[19992]:
85x11_eta4panel_12Z_36h.ps.gz: RPC: Remote system error

which would then cause an exit.  Also, Daryl reporting seeing lots of
these messages on ldmarchive during the motherlode outage.  (I appended
his message at the bottom.)

This error means "Remote system had a major failure trying to execute
the selected program."  Is your queue ok?  Is there enough space for
it?  (This used to be a problem until around version 5.1.2.)  Are
problems showing up in the log?  I would think (maybe 'hope' is a better
word) that for each error message in my log there would be a
corresponding problem reported in yours...

Anne
-- 
***************************************************
Anne Wilson                     UCAR Unidata Program            
address@hidden                 P.O. Box 3000
                                  Boulder, CO  80307
----------------------------------------------------
Unidata WWW server       http://www.unidata.ucar.edu/
****************************************************

Daryl Herzmann wrote:
> 
> Hiya,
>         I was looking over the LDM logs this weekend and noticed many
> connects and disconnects from squall to ldmarchive.  It happened a couple
> of times (GMT)
> 
> 1.  Nov 03 08:31:29 till Nov 03 09:23:12
> 2.  Nov 03 15:54:05 till Nov 03 17:00:13
> 3.  Nov 04 14:44:44 till Nov 04 17:08:14
> 
>         Here is an example
> Nov 04 14:44:14 pircsl4 squall(feed)[19358]: YVUF86 KWBE 041200 /mETA_84: 
> RPC: Remote system error
> Nov 04 14:44:14 pircsl4 squall(feed)[19358]: pq_sequence failed: Input/output 
> error (errno = 5)
> Nov 04 14:44:14 pircsl4 squall(feed)[19358]: Exiting
> Nov 04 14:44:20 pircsl4 rpc.ldmd[19342]: child 19358 exited with status 1
> Nov 04 14:44:44 pircsl4 squall[21359]: Connection from squall.atmos.uiuc.edu
> Nov 04 14:44:44 pircsl4 squall(feed)[21359]: Starting Up: 20011104143831.585 
> TS_ENDT {{DIFAX|FSL2|WMO,  ".*"}}
> Nov 04 14:44:44 pircsl4 squall(feed)[21359]: topo:  squall.atmos.uiuc.edu 
> DIFAX|FSL2|WMO
> Nov 04 14:44:44 pircsl4 squall(feed)[21359]: YVUF86 KWBE 041200 /mETA_84: 
> RPC: Remote system error
> Nov 04 14:44:44 pircsl4 squall(feed)[21359]: pq_sequence failed: Input/output 
> error (errno = 5)
> Nov 04 14:44:44 pircsl4 squall(feed)[21359]: Exiting
> 
>         Do you what may have been causing these?  I am curious since I
> really wanted to verify that ldmarchive was a stable node for the IDD.
> squall was the only machine to exhibit this behavior.  Ideas?
>