[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[LDM #NJD-255826]: ldmadmin problem on startup



Dan,

> Full Name: Dan Fredette
> Email Address: address@hidden
> Organization: WSI
> Package Version: 6.10.1
> Operating System: Centos 6
> Hardware:
> Description of problem: Hi,
> 
> I came across something today about ldmadmin on startup.  I search in
> the archives and couldn't find anything on it, so I thought I would post
> what I saw.  We run a multi-queue setup and one of my queues ran into
> a problem when it tried to startup.  Here is my log from when it tried:
> 
> 20:15:33  [INFO] ldm_start_and_monitor: Starting LDM, command = ldmadmin 
> start -v
> 20:15:45  The product-queue is OK.
> 20:15:45  Checking pqact(1) configuration-file(s)...
> 20:15:46  ulog: Couldn't connect(2) to UNIX socket "/dev/log": No such file 
> or directory
> 20:15:46  ulog: Couldn't connect(2) to UNIX socket "/dev/log": No such file 
> or directory
> 20:15:46  Checking LDM configuration-file (/home/ldm_sat_raw/etc/ldmd.conf)...
> 20:15:46  Starting the LDM server...
> 20:17:52  start(): Server not started.
> 20:18:08  start(): Server not started.
> 20:18:26  start(): Server not started.
> 20:18:44  start(): Server not started.
> 20:19:03  start(): Server not started.
> 20:19:23  start(): Server not started.
> 20:19:44  start(): Server not started.
> 20:20:06  start(): Server not started.
> 20:20:29  start(): Server not started.
> 20:20:53  start(): Server not started.
> 20:21:18  start(): Server not started.
> 20:21:44  start(): Server not started.
> 20:22:11  start(): Server not started.
> 20:22:39  start(): Server not started.
> 20:23:08  start(): Server not started.
> 20:23:38  start(): Server not started.
> 20:24:09  start(): Server not started.
> 20:24:41  start(): Server not started.
> 20:25:14  start(): Server not started.
> 20:25:48  start(): Server not started.
> 20:26:23  start(): Server not started.
> 20:26:59  start(): Server not started.
> 20:27:36  start(): Server not started.
> 20:28:14  start(): Server not started.
> 20:28:53  start(): Server not started.
> 20:29:33  start(): Server not started.
> 20:30:14  start(): Server not started.
> 
> This "Server not started" continues on until this morning, when I finally
> killed the processes.  Once I did that it restarted fine.  I looked in
> the ldmadmin script and saw that it should break out after 15 tries,
> but it did not appear that's what it did.
> 
> print "Starting the LDM server...\n";
> system("$cmd_line $ldmd_conf > $pid_file");
> if ($?) {
> unlink($pid_file);
> errmsg("start(): Could not start LDM server");
> $status = 1;
> }
> else {
> # Check to make sure the LDM is running
> my($loopcount) = 1;
> while(!isRunning($pid_file, $ip_addr)) {
> if($loopcount > 15) {
> errmsg("start(): ".
> "Server not started.");
> $status = 1;        # failure
> break;
> }
> sleep($loopcount);
> $loopcount++;
> }
> }
> 
> I don't see any other loops that it might be stuck in, in the ldmadmin
> script.  Have you seen anything like this before?

I've never seen this before. Given the code, I don't see how it could happen.

If you start it manually via the command "ldmadmin start", does it work?

I'm very concerned about the error-messages from ulog(3). It appears that 
you're not running the LDM on the system on which it was built. If so, then you 
might encounter other problems.

> Thanks
> Dan

Regards,
Steve Emmerson

Ticket Details
===================
Ticket ID: NJD-255826
Department: Support LDM
Priority: Normal
Status: Closed


NOTE: All email exchanges with Unidata User Support are recorded in the Unidata inquiry tracking system and then made publicly available through the web. If you do not want to have your interactions made available in this way, you must let us know in each email you send to us.