[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[LDM #NET-131221]: LDM is crashing my machine



Gilbert,

It seems likely that the problem with the kernel is causing
the problem with the LDM (rather than the other way around).
I've asked our O/S expert to weigh-in.  In the meantime, you
might think of anything you've done that could cause the O/S
to become unstable.

--Steve

> Getting a kernel "oops" every other day or so. Look at the process:
> 
> Feb 10 14:52:31 weather kernel: BUG: unable to handle kernel paging
> request at virtual address ffafef40
> Feb 10 14:52:31 weather kernel:  printing eip:
> Feb 10 14:52:31 weather kernel: c046241b
> Feb 10 14:52:31 weather kernel: *pde = 00005067
> Feb 10 14:52:31 weather kernel: *pte = 00000000
> Feb 10 14:52:31 weather kernel: Oops: 0000 [#1]
> Feb 10 14:52:31 weather kernel: SMP
> Feb 10 14:52:31 weather kernel: Modules linked in: autofs4 hidp rfcomm
> l2cap bluetooth sunrpc dm_multipath video output sbs battery ac ipv6
> kvm_intel kvm snd_hda_intel snd_emu10$
> Feb 10 14:52:31 weather kernel: CPU:    1
> Feb 10 14:52:31 weather kernel: EIP:    0060:[<c046241b>]    Not tainted
> VLI
> Feb 10 14:52:31 weather kernel: EFLAGS: 00210282   (2.6.23.14-78.fc7 #1)
> Feb 10 14:52:31 weather kernel: EIP is at sync_page+0x27/0x41
> Feb 10 14:52:31 weather kernel: eax: 8000007d   ebx: f4016dd8   ecx:
> c2129020   edx: ffafef08
> Feb 10 14:52:31 weather kernel: esi: f4016dd8   edi: c3008bb4   ebp:
> c04623f4   esp: f4016dbc
> Feb 10 14:52:31 weather kernel: ds: 007b   es: 007b   fs: 00d8  gs: 0033
> ss: 0068
> Feb 10 14:52:31 weather kernel: Process rpc.ldmd (pid: 25871, ti=f4016000
> task=de838610 task.ti=f4016000)
> Feb 10 14:52:31 weather kernel: Stack: c061bdd2 f4016dd8 c2129020 f4016df4
> 00048005 c04623e6 00000002 c2129020
> Feb 10 14:52:31 weather kernel:        00000000 00000001 de838610 c043d402
> c3008bb8 c3008bb8 f7afef18 f7afef08
> Feb 10 14:52:31 weather kernel:        c04624b2 00000000 f7afee00 ffffffe5
> 00000000 c0479890 de83865c c04f32cf
> Feb 10 14:52:31 weather kernel: Call Trace:
> Feb 10 14:52:31 weather kernel:  [<c061bdd2>] __wait_on_bit_lock+0x2a/0x52
> Feb 10 14:52:31 weather kernel:  [<c04623e6>] __lock_page+0x58/0x5e
> Feb 10 14:52:31 weather kernel:  [<c043d402>] wake_bit_function+0x0/0x3c
> Feb 10 14:52:31 weather kernel:  [<c04624b2>] find_lock_page+0x5a/0x90
> Feb 10 14:52:31 weather kernel:  [<c0479890>] shmem_getpage+0x59/0x5ae
> Feb 10 14:52:31 weather kernel:  [<c04f32cf>] rb_erase+0x176/0x22f
> Feb 10 14:52:31 weather kernel:  [<c046625d>]
> get_page_from_freelist+0x25d/0x2db
> Feb 10 14:52:31 weather kernel:  [<c0479eb3>] shmem_fault+0x65/0x95
> Feb 10 14:52:31 weather kernel:  [<c046c3a4>] __do_fault+0x59/0x394
> Feb 10 14:52:31 weather kernel:  [<c046ea26>] handle_mm_fault+0x3a0/0x78b
> Feb 10 14:52:31 weather kernel:  [<c046a9fa>]
> vma_prio_tree_insert+0x17/0x2a
> Feb 10 14:52:31 weather kernel:  [<c0471891>] mmap_region+0x31c/0x3d8
> Feb 10 14:52:31 weather kernel:  [<c061e434>] do_page_fault+0x26a/0x5ef
> Feb 10 14:52:31 weather kernel:  [<c0458fd2>]
> audit_syscall_exit+0x2aa/0x2c6
> Feb 10 14:52:31 weather kernel:  [<c061e1ca>] do_page_fault+0x0/0x5ef
> Feb 10 14:52:31 weather kernel:  [<c061ceb2>] error_code+0x72/0x78
> Feb 10 14:52:31 weather kernel:  [<c0610000>] xfrm_netlink_rcv+0x2a/0x38
> Feb 10 14:52:31 weather kernel:  =======================
> Feb 10 14:52:31 weather kernel: Code: 00 31 c0 c3 89 c1 0f ae f0 89 f6 8b
> 50 10 8b 00 66 85 c0 79 07 ba 40 0d 70 c0 eb 0f 8b 01 84 c0 78 1b f6 c2 01
> 75 16 85 d2 74 12 <8b> 42 38$
> Feb 10 14:52:31 weather kernel: EIP: [<c046241b>] sync_page+0x27/0x41
> SS:ESP 0068:f4016dbc
> 
> 
> --------------------------------------------------------------------------
> This was in my ldmd.log:
> 
> Feb 10 14:52:31 weather rpc.ldmd[3250] NOTE: child 25871 terminated by
> signal 11
> Feb 10 14:52:31 weather rpc.ldmd[3250] NOTE: Killing (SIGTERM) process
> group
> Feb 10 14:52:31 weather rpc.ldmd[3250] NOTE: Exiting
> Feb 10 14:52:31 weather pqact[3304] NOTE: Exiting
> Feb 10 14:52:31 weather pqact[3257] NOTE: Exiting
> Feb 10 14:52:31 weather pqact[3254] NOTE: Exiting
> Feb 10 14:52:31 weather weather3.ca.uky.edu(feed)[30532] ERROR: Couldn't
> flush connection; nullproc_6() failure to weather3.ca.uky.edu: RPC: Unable
> to receive; errno = Bad file $
> Feb 10 14:52:31 weather pqact[3255] NOTE: Exiting
> Feb 10 14:52:31 weather weather2.admin.niu.edu[3264] NOTE: Exiting
> Feb 10 14:52:31 weather weather2.admin.niu.edu[3263] NOTE: Exiting
> Feb 10 14:52:31 weather pqact[3260] NOTE: Exiting
> Feb 10 14:52:31 weather pqact[3261] NOTE: Exiting
> Feb 10 14:52:31 weather pqsurf[3262] NOTE: Exiting
> Feb 10 14:52:31 weather weather3.admin.niu.edu[3272] NOTE: Exiting
> Feb 10 14:52:31 weather idd.unidata.ucar.edu[3271] NOTE: Exiting
> Feb 10 14:52:31 weather flood-1.atmos.uiuc.edu[3269] NOTE: Exiting
> Feb 10 14:52:31 weather idd.unidata.ucar.edu[3268] NOTE: Exiting
> Feb 10 14:52:31 weather bigbird.tamu.edu[3267] NOTE: Exiting
> Feb 10 14:52:31 weather idd.aos.wisc.edu[3266] NOTE: Exiting
> Feb 10 14:52:31 weather pqact[3259] NOTE: Exiting
> Feb 10 14:52:31 weather weather2.admin.niu.edu[3270] NOTE: Exiting
> Feb 10 14:52:31 weather pqact[3256] NOTE: Exiting
> Feb 10 14:52:31 weather weather3.admin.niu.edu[3265] NOTE: Exiting
> Feb 10 14:52:31 weather rtstats[3252] NOTE: Exiting
> Feb 10 14:52:31 weather rpc.ldmd[3250] NOTE: Terminating process group
> Feb 10 14:52:31 weather pqact[3304] NOTE: Behind by 5.61988 s
> Feb 10 14:52:31 weather pqact[3254] NOTE: Behind by 0.102237 s
> Feb 10 14:52:31 weather pqact[3257] NOTE: Behind by 0.103043 s
> Feb 10 14:52:31 weather weather3.ca.uky.edu(feed)[30532] NOTE: Exiting
> 
> 
> ------------------------------------------------------------------------------
> And what is process 25871?
> 
> Feb 10 10:06:03 weather weather3.ca.uky.edu(feed)[25871] NOTE: Starting
> Up(6.7.0.0/6): 20080210145648.656 TS_ENDT {{UNIWISC,  ".*"}},
> SIG=47b727221f2679fd6ef0ca5876ce080c, Primary
> 
> *******************************************************************************
> Gilbert Sebenste                                                     ********
> (My opinions only!)                                                  ******
> Staff Meteorologist, Northern Illinois University                      ****
> E-mail: address@hidden                                  ***
> web: http://weather.admin.niu.edu                                      **
> *******************************************************************************
> 
> 

Regards,
Steve Emmerson

Ticket Details
===================
Ticket ID: NET-131221
Department: Support LDM
Priority: Normal
Status: Closed