[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[LDM #NJD-255826]: ldmadmin problem on startup



Dan,

> Full Name: Dan Fredette
> Email Address: address@hidden
> Organization: WSI
> Package Version: 6.10.1
> Operating System: Centos 6
> Hardware:
> Description of problem: Hi,
> 
> I came across something today about ldmadmin on startup.  I search in
> the archives and couldn't find anything on it, so I thought I would post
> what I saw.  We run a multi-queue setup and one of my queues ran into
> a problem when it tried to startup.  Here is my log from when it tried:
> 
> 20:15:33  [INFO] ldm_start_and_monitor: Starting LDM, command = ldmadmin 
> start -v
> 20:15:45  The product-queue is OK.
> 20:15:45  Checking pqact(1) configuration-file(s)...
> 20:15:46  ulog: Couldn't connect(2) to UNIX socket "/dev/log": No such file 
> or directory
> 20:15:46  ulog: Couldn't connect(2) to UNIX socket "/dev/log": No such file 
> or directory
> 20:15:46  Checking LDM configuration-file (/home/ldm_sat_raw/etc/ldmd.conf)...
> 20:15:46  Starting the LDM server...
> 20:17:52  start(): Server not started.
> 20:18:08  start(): Server not started.
> 20:18:26  start(): Server not started.
> 20:18:44  start(): Server not started.
> 20:19:03  start(): Server not started.
> 20:19:23  start(): Server not started.
> 20:19:44  start(): Server not started.
> 20:20:06  start(): Server not started.
> 20:20:29  start(): Server not started.
> 20:20:53  start(): Server not started.
> 20:21:18  start(): Server not started.
> 20:21:44  start(): Server not started.
> 20:22:11  start(): Server not started.
> 20:22:39  start(): Server not started.
> 20:23:08  start(): Server not started.
> 20:23:38  start(): Server not started.
> 20:24:09  start(): Server not started.
> 20:24:41  start(): Server not started.
> 20:25:14  start(): Server not started.
> 20:25:48  start(): Server not started.
> 20:26:23  start(): Server not started.
> 20:26:59  start(): Server not started.
> 20:27:36  start(): Server not started.
> 20:28:14  start(): Server not started.
> 20:28:53  start(): Server not started.
> 20:29:33  start(): Server not started.
> 20:30:14  start(): Server not started.
> 
> This "Server not started" continues on until this morning, when I finally
> killed the processes.  Once I did that it restarted fine.  I looked in
> the ldmadmin script and saw that it should break out after 15 tries,
> but it did not appear that's what it did.
> 
> print "Starting the LDM server...\n";
> system("$cmd_line $ldmd_conf > $pid_file");
> if ($?) {
> unlink($pid_file);
> errmsg("start(): Could not start LDM server");
> $status = 1;
> }
> else {
> # Check to make sure the LDM is running
> my($loopcount) = 1;
> while(!isRunning($pid_file, $ip_addr)) {
> if($loopcount > 15) {
> errmsg("start(): ".
> "Server not started.");
> $status = 1;        # failure
> break;
> }
> sleep($loopcount);
> $loopcount++;
> }
> }
> 
> I don't see any other loops that it might be stuck in, in the ldmadmin
> script.  Have you seen anything like this before?

I've never seen this before. Given the code, I don't see how it could happen.

If you start it manually via the command "ldmadmin start", does it work?

I'm very concerned about the error-messages from ulog(3). It appears that 
you're not running the LDM on the system on which it was built. If so, then you 
might encounter other problems.

> Thanks
> Dan

Regards,
Steve Emmerson

Ticket Details
===================
Ticket ID: NJD-255826
Department: Support LDM
Priority: Normal
Status: Closed