[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

20011203: LDM dying



Robert,

I'll cc Jeff Weber since he is now handling LDM questions.

From your log snippet below, I would guess that process 853 is
pqact, but it would help to check the start of the ldmd.log 
to verify pqact's process id.

I haven't seen a sig 30 from pqact before. 

The code for signal 30 is:
SIGXCPU      30      Core      CPU time limit exceeded [see getrlimit(2)]

You should have a core file dumped from pqact according to the above.
You can verify what program caused the core dump with "file core".

The CPU time limit exceeded condition is strange. Do you have limits imposed
on your system for cpu? Run the command "limit" and check for "cputime" in
the output. If it is not "unlimited" then that may be the cause of
the program exiting after using a certain amount of cpu time.

Steve Chiswell
Unidata User SUpport



>From: Robert Mullenax <address@hidden>
>Organization: UCAR/Unidata
>Keywords: 200112031433.fB3EXbN04455

>We are having a strange problem with the LDM on our
>SPARC (Ultra 10 clone Solaris 8) machine.  The LDM
>is only running for a few hours before it is
>being killed.  I have never seen this before.
>I have attached the output from ldmd.log
>below.  Nobody is manually killing it.
>Thanks,
>
>Robert Mullenax--NSBF Meteorology
>
>
>
>
>
>
>Dec 03 08:11:14 proftomd[24533]: Exiting
>Dec 03 08:13:24 psnldm rpc.ldmd[851]: child 853 terminated by signal 30
>Dec 03 08:13:24 psnldm rpc.ldmd[851]: Killing (SIGINT) process group
>Dec 03 08:13:24 psnldm rpc.ldmd[851]: Interrupt
>Dec 03 08:13:24 psnldm rpc.ldmd[851]: Exiting
>Dec 03 08:13:24 psnldm wxmcidas(feed)[14731]: Interrupt
>Dec 03 08:13:24 psnldm wxmcidas(feed)[14731]: Exiting
>Dec 03 08:13:24 psnldm ldm[857]: Interrupt
>Dec 03 08:13:24 psnldm 140.172.240.73[856]: Interrupt
>Dec 03 08:13:24 psnldm 140.172.240.73[856]: Exiting
>Dec 03 08:13:24 psnldm ldm[857]: Exiting
>Dec 03 08:13:24 psnldm pqbinstats[854]: Interrupt
>Dec 03 08:13:25 psnldm pqbinstats[854]: Exiting
>Dec 03 08:13:25 psnldm cirrus[855]: Interrupt
>Dec 03 08:13:25 psnldm cirrus[855]: Exiting
>Dec 03 08:13:25 psnldm rpc.ldmd[851]: Terminating process group
>Dec 03 08:13:25 psnldm rpc.ldmd[851]: child 852 terminated by signal 15
>Dec 03 09:02:07 psnldm dcgrib[29269]:  Using gds block for projection
>id:  240
>
>
>