[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: 20030202: LDM failure on motherlode



Tom,

Here's what I see from the console log;

> Feb  2 00:05:58 motherlode.ucar.edu rdriver: ID[RAIDarray.rdaemon.1001]
> volume management starting.
> Starting mysqld daemon with databases from /usr/local/mysql/data
> Starting Big Brother...
> Big Brother 1.6e1 started
> Starting LDM (/usr/local/ldm/bin/rpc.ldmd) via boot script.
> /usr/local/apache/bin/apachectl start: httpd started
> ...
> Feb  2 02:30:02 motherlode.ucar.edu last message repeated 1 time
> Feb  2 02:32:36 motherlode.ucar.edu adm: ********** SYSTEM ACCOUNTING
> Queue appears corrupt, deleting and rebuilding.

and the web server that started after the LDM appeared to be working fine
even though Caron said otherwise?  First off, doing the queue sanity
check via pqcat ran at least 2.5 hours?  Something was very hosed

mike

On Feb 2,  9:22am, Tom Yoksas wrote:
> Subject: 20030202: LDM failure on motherlode
>
> Hi guys,
>
> When I logged on this morning at 9 am, I saw the ldmping failure to
> motherlode attached at the end of this message.  I logged on, and sure
> enough the LDM was not running.  It appears that motherlode was down
> since uptime showed it was only up for 9 minutes:
>
> % uptime
>   9:14am  up  9:12,  1 user,  load average: 9.58, 5.75, 3.64
>
> Before noticing that the machine had been rebooted, I tried starting
> it, but nothing happened.  I assumed that this was caused by the
> development environment switch, so I rebuilt and reinstalled
> ldm-6.0.0.9.  When I tried to start the LDM still nothing happened.  I
> then did a 'ps' looking for ldmadmin and saw that it was running
> 'ldmadmin mkqueue'.  After a couple of minutes, the queue was rebuilt
> and the LDM started.
>
> Tom