[Nut-upsuser] Kernel crash when using usbhid-ups driver.
Rod Roark
rod at sunsetsystems.com
Mon Jul 1 04:00:51 UTC 2013
Paul Whittaker <paul.whittaker <at> drisq.com> writes:
>
>
> Hi Arnaud,
> Great to hear this is an isolated case; hopefully it means we've
> found a bug we can help solve.
> On 27/11/12 22:50, Arnaud Quette wrote:please send the log here, in
compressed form.
> are the photos taken with a still cam?
> please put these on your website (or any other public
> storage) and point the link
>
>
>
> For each of the four crashes I've got syslog excerpts (which
> include some messages from just before and just after the crash)
> and photos of the screen. I couldn't take the whole screen at
> once clearly, so I took several photos that overlap. The photos
> were pretty bad (low end smartphone - sorry) but they should be
> legible; I've treated them to improve clarity and image size.
> Crashes 1 and 2 used the same (fairly basic) setup. After crash 2
> I added the 'pollonly' driver flag, and after crash 3 I removed
> that and was running the driver with -D instead.
> Crash 1 - A.png
> B.png
> (syslog.1.gz attached)
> Crash 2 - A.png
> B.png C.png D.png
> (syslog.2.gz attached)
> Crash 3 - A.png
> B.png
> (syslog.3.gz attached)
> Crash 4 - A.png
> B.png C.png
> (syslog.4.gz attached)
> There should be most kernel details you need in the screenshots.
> This is running the latest nut-server package from Ubuntu Precise
> 64-bit (version 2.6.3-1ubuntu1.1 at present) on a Dell PowerEdge
> R210 II.
>
> a dump file would be
preferable:https://help.ubuntu.com/12.04/serverguide/kernel-crash-dump.html
>
>
>
> I'll see what I can do; this is our main development server, so
> I'll probably be limited to weekends.
> I'm not very familiar with what data crash dumps record, but do
> know
> that I can only post one if I'm sure it doesn't contain company
> file data, encryption keys, VM state, etc. (for obvious reasons).
> Since I know in advance I'll be taking a crash dump, are there any
> steps I can take steps to
> avoid such data? If not, I can still run tools on the dump
> locally (out of hours) and send backtraces or other output you
> might need.
> Let me know if any other details, config files, etc., would help.
> Cheers,
>
>
> Paul.
>
>
> PS I've replied only to the list; do you prefer that I reply to
> you and CC: the list instead?
>
>
> Attachment (syslog.1.gz): application/x-gzip, 919 bytes
> Attachment (syslog.2.gz): application/x-gzip, 1669 bytes
> Attachment (syslog.3.gz): application/x-gzip, 2707 bytes
> Attachment (syslog.4.gz): application/x-gzip, 769 bytes
>
> _______________________________________________
> Nut-upsuser mailing list
> Nut-upsuser <at> lists.alioth.debian.org
> http://lists.alioth.debian.org/cgi-bin/mailman/listinfo/nut-upsuser
I found this thread on a google search and just wanted to add that I seem to
have encountered a very similar problem: general protection faults once
every 1-3 days after installing nut for my CyberPower 1000PFCLCD.
Using Ubuntu 12.04 LTS 64-bit, stock kernel 3.5.0-34-generic. ASRock H77M
motherboard, Core i3-3225 CPU. A syslog dump is below in case it may be
useful. This one's a bit special in that there seems to be 3 successive
GPFs. I don't follow this list so please contact me separately if I can be
helpful.
Jun 30 10:29:24 ns1 kernel: [90099.127943] general protection fault: 0000
[#1] SMP
Jun 30 10:29:24 ns1 kernel: [90099.127984] CPU 3
Jun 30 10:29:24 ns1 kernel: [90099.127996] Modules linked in: veth
xt_multiport iptable_filter ip_tables x_tables bnep rfcomm bluetooth
snd_hda_codec_hdmi snd_hda_codec_realtek coretemp bridge kvm_intel kvm stp
llc ghash_clmulni_intel cryptd i915 drm_kms_helper drm snd_hda_intel
snd_hda_codec joydev mei serio_raw i2c_algo_bit snd_hwdep snd_pcm ppdev
video lpc_ich mac_hid snd_seq_midi snd_rawmidi parport_pc snd_seq_midi_event
snd_seq microcode snd_timer snd_seq_device snd soundcore snd_page_alloc lp
parport hid_logitech ff_memless hid_generic usbhid hid raid10 raid456
async_pq async_xor xor async_memcpy async_raid6_recov r8169 raid6_pq
async_tx ahci libahci raid1 raid0 multipath linear
Jun 30 10:29:24 ns1 kernel: [90099.128430]
Jun 30 10:29:24 ns1 kernel: [90099.128436] Pid: 2634, comm: usbhid-ups Not
tainted 3.5.0-34-generic #55~precise1-Ubuntu To Be Filled By O.E.M. To Be
Filled By O.E.M./H77M
Jun 30 10:29:24 ns1 kernel: [90099.128500] RIP: 0010:[<ffffffff811740db>]
[<ffffffff811740db>] __kmalloc+0x7b/0x1a0
Jun 30 10:29:24 ns1 kernel: [90099.128545] RSP: 0018:ffff8800c188dd68
EFLAGS: 00010282
Jun 30 10:29:24 ns1 kernel: [90099.128571] RAX: 0000000000000000 RBX:
ffff8800c188de60 RCX: 0000000000031bda
Jun 30 10:29:24 ns1 kernel: [90099.128606] RDX: 0000000000031bd9 RSI:
0000000000000000 RDI: 0000000000016860
Jun 30 10:29:24 ns1 kernel: [90099.128640] RBP: ffff8800c188ddb8 R08:
ffff88011f396860 R09: ffff88008eaf9780
Jun 30 10:29:24 ns1 kernel: [90099.128674] R10: 0000000000000000 R11:
0000000000000001 R12: ffff88011ac02a00
Jun 30 10:29:24 ns1 kernel: [90099.128708] R13: ffff012c11947808 R14:
00000000000000d0 R15: ffffffff814caa00
Jun 30 10:29:24 ns1 kernel: [90099.128742] FS: 00007fd56798c700(0000)
GS:ffff88011f380000(0000) knlGS:0000000000000000
Jun 30 10:29:24 ns1 kernel: [90099.128780] CS: 0010 DS: 0000 ES: 0000 CR0:
000000008005003b
Jun 30 10:29:24 ns1 kernel: [90099.128808] CR2: 000000000804f7c8 CR3:
00000000c1896000 CR4: 00000000001407e0
Jun 30 10:29:24 ns1 kernel: [90099.128842] DR0: 0000000000000000 DR1:
0000000000000000 DR2: 0000000000000000
Jun 30 10:29:24 ns1 kernel: [90099.128876] DR3: 0000000000000000 DR6:
00000000ffff0ff0 DR7: 0000000000000400
Jun 30 10:29:24 ns1 kernel: [90099.128910] Process usbhid-ups (pid: 2634,
threadinfo ffff8800c188c000, task ffff8800c6bec500)
Jun 30 10:29:24 ns1 kernel: [90099.128950] Stack:
Jun 30 10:29:24 ns1 kernel: [90099.128963] ffff8800c188df08
ffff8800c188de60 0000000000000000 0000000000000200
Jun 30 10:29:24 ns1 kernel: [90099.129007] ffff880115599280
ffff8800c188de60 0000000000000000 ffff880036b176c0
Jun 30 10:29:24 ns1 kernel: [90099.129050] ffff880115599280
ffff88008eaf9780 ffff8800c188de38 ffffffff814caa00
Jun 30 10:29:24 ns1 kernel: [90099.129094] Call Trace:
Jun 30 10:29:24 ns1 kernel: [90099.129112] [<ffffffff814caa00>]
proc_do_submiturb+0x4c0/0x9f0
Jun 30 10:29:24 ns1 kernel: [90099.129143] [<ffffffff814cbb4c>]
usbdev_do_ioctl+0x3cc/0xc50
Jun 30 10:29:24 ns1 kernel: [90099.129172] [<ffffffff814cc3fe>]
usbdev_ioctl+0xe/0x20
Jun 30 10:29:24 ns1 kernel: [90099.129202] [<ffffffff8119a10a>]
do_vfs_ioctl+0x8a/0x340
Jun 30 10:29:24 ns1 kernel: [90099.129231] [<ffffffff8119a451>]
sys_ioctl+0x91/0xa0
Jun 30 10:29:24 ns1 kernel: [90099.129259] [<ffffffff816a5669>]
system_call_fastpath+0x16/0x1b
Jun 30 10:29:24 ns1 kernel: [90099.129288] Code: 00 4d 8b 04 24 65 4c 03 04
25 48 dc 00 00 49 8b 50 08 4d 8b 28 4d 85 ed 0f 84 ec 00 00 00 49 63 44 24
20 49 8b 3c 24 48 8d 4a 01 <49> 8b 5c 05 00 4c 89 e8 65 48 0f c7 0f 0f 94 c0
84 c0 74 c2 49
Jun 30 10:29:24 ns1 kernel: [90099.129540] RIP [<ffffffff811740db>]
__kmalloc+0x7b/0x1a0
Jun 30 10:29:24 ns1 kernel: [90099.131778] RSP <ffff8800c188dd68>
Jun 30 10:29:24 ns1 kernel: [90099.206651] ---[ end trace 6002a33ab3b18572 ]---
Jun 30 10:29:39 ns1 upsd[2636]: Data for UPS [cyber] is stale - check driver
Jun 30 10:29:42 ns1 upsmon[2640]: Poll UPS [cyber at localhost] failed - Data stale
Jun 30 10:29:42 ns1 upsmon[2640]: Communications with UPS cyber at localhost lost
Jun 30 10:29:42 ns1 kernel: [90117.515437] general protection fault: 0000
[#2] SMP
Jun 30 10:29:42 ns1 kernel: [90117.517705] CPU 3
Jun 30 10:29:42 ns1 kernel: [90117.517717] Modules linked in: veth
xt_multiport iptable_filter ip_tables x_tables bnep rfcomm bluetooth
snd_hda_codec_hdmi snd_hda_codec_realtek coretemp bridge kvm_intel kvm stp
llc ghash_clmulni_intel cryptd i915 drm_kms_helper drm snd_hda_intel
snd_hda_codec joydev mei serio_raw i2c_algo_bit snd_hwdep snd_pcm ppdev
video lpc_ich mac_hid snd_seq_midi snd_rawmidi parport_pc snd_seq_midi_event
snd_seq microcode snd_timer snd_seq_device snd soundcore snd_page_alloc lp
parport hid_logitech ff_memless hid_generic usbhid hid raid10 raid456
async_pq async_xor xor async_memcpy async_raid6_recov r8169 raid6_pq
async_tx ahci libahci raid1 raid0 multipath linear
Jun 30 10:29:42 ns1 kernel: [90117.527368]
Jun 30 10:29:42 ns1 kernel: [90117.529789] Pid: 12632, comm: upsmon Tainted:
G D 3.5.0-34-generic #55~precise1-Ubuntu To Be Filled By O.E.M. To
Be Filled By O.E.M./H77M
Jun 30 10:29:42 ns1 kernel: [90117.532309] RIP: 0010:[<ffffffff811740db>]
[<ffffffff811740db>] __kmalloc+0x7b/0x1a0
Jun 30 10:29:42 ns1 kernel: [90117.534906] RSP: 0018:ffff880117eb3d08
EFLAGS: 00010282
Jun 30 10:29:42 ns1 kernel: [90117.537427] RAX: 0000000000000000 RBX:
ffff8800a63eae00 RCX: 0000000000031bda
Jun 30 10:29:42 ns1 kernel: [90117.539956] RDX: 0000000000031bd9 RSI:
0000000000000000 RDI: 0000000000016860
Jun 30 10:29:42 ns1 kernel: [90117.542476] RBP: ffff880117eb3d58 R08:
ffff88011f396860 R09: 0000000000000000
Jun 30 10:29:42 ns1 kernel: [90117.544993] R10: 0000000000000002 R11:
fefefefefefefeff R12: ffff88011ac02a00
Jun 30 10:29:42 ns1 kernel: [90117.547510] R13: ffff012c11947808 R14:
00000000000000d0 R15: ffffffff811da892
Jun 30 10:29:42 ns1 kernel: [90117.550026] FS: 00007f224c00b700(0000)
GS:ffff88011f380000(0000) knlGS:0000000000000000
Jun 30 10:29:42 ns1 kernel: [90117.552550] CS: 0010 DS: 0000 ES: 0000 CR0:
000000008005003b
Jun 30 10:29:42 ns1 kernel: [90117.555093] CR2: 0000000000408068 CR3:
000000011791a000 CR4: 00000000001407e0
Jun 30 10:29:42 ns1 kernel: [90117.557620] DR0: 0000000000000000 DR1:
0000000000000000 DR2: 0000000000000000
Jun 30 10:29:42 ns1 kernel: [90117.560144] DR3: 0000000000000000 DR6:
00000000ffff0ff0 DR7: 0000000000000400
Jun 30 10:29:42 ns1 kernel: [90117.562661] Process upsmon (pid: 12632,
threadinfo ffff880117eb2000, task ffff8800b6cd0000)
Jun 30 10:29:42 ns1 kernel: [90117.565169] Stack:
Jun 30 10:29:42 ns1 kernel: [90117.567667] ffff880117eb3fd8
ffffffff811da779 0000000000000080 00000000000001f8
Jun 30 10:29:42 ns1 kernel: [90117.570202] ffff880117eb3fd8
ffff8800a63eae00 ffff880082132080 00000000fffffff4
Jun 30 10:29:42 ns1 kernel: [90117.572749] ffff880117eb3f58
00000000000001f8 ffff880117eb3e38 ffffffff811da892
Jun 30 10:29:42 ns1 kernel: [90117.575278] Call Trace:
Jun 30 10:29:42 ns1 kernel: [90117.577795] [<ffffffff811da779>] ?
load_elf_binary+0x49/0xe20
Jun 30 10:29:42 ns1 kernel: [90117.580354] [<ffffffff811da892>]
load_elf_binary+0x162/0xe20
Jun 30 10:29:42 ns1 kernel: [90117.582876] [<ffffffff81151cb2>] ?
get_user_pages+0x52/0x60
Jun 30 10:29:42 ns1 kernel: [90117.585381] [<ffffffff8118e92a>] ?
get_arg_page+0xaa/0xe0
Jun 30 10:29:42 ns1 kernel: [90117.587866] [<ffffffff813540d6>] ?
strnlen_user+0x36/0xf0
Jun 30 10:29:42 ns1 kernel: [90117.590356] [<ffffffff8118eb61>] ?
get_user_arg_ptr.isra.23+0x31/0x70
Jun 30 10:29:42 ns1 kernel: [90117.592846] [<ffffffff8118dd10>]
search_binary_handler+0x110/0x350
Jun 30 10:29:42 ns1 kernel: [90117.595357] [<ffffffff811da730>] ?
load_elf_library+0x230/0x230
Jun 30 10:29:42 ns1 kernel: [90117.597844] [<ffffffff8119025d>]
do_execve_common.isra.30+0x2bd/0x350
Jun 30 10:29:42 ns1 kernel: [90117.600328] [<ffffffff8119030b>]
do_execve+0x1b/0x20
Jun 30 10:29:42 ns1 kernel: [90117.602732] [<ffffffff8101d707>]
sys_execve+0x47/0x70
Jun 30 10:29:42 ns1 kernel: [90117.605068] [<ffffffff816a5aac>]
stub_execve+0x6c/0xc0
Jun 30 10:29:42 ns1 kernel: [90117.607325] Code: 00 4d 8b 04 24 65 4c 03 04
25 48 dc 00 00 49 8b 50 08 4d 8b 28 4d 85 ed 0f 84 ec 00 00 00 49 63 44 24
20 49 8b 3c 24 48 8d 4a 01 <49> 8b 5c 05 00 4c 89 e8 65 48 0f c7 0f 0f 94 c0
84 c0 74 c2 49
Jun 30 10:29:42 ns1 kernel: [90117.609856] RIP [<ffffffff811740db>]
__kmalloc+0x7b/0x1a0
Jun 30 10:29:42 ns1 kernel: [90117.612226] RSP <ffff880117eb3d08>
Jun 30 10:29:42 ns1 kernel: [90117.614581] ---[ end trace 6002a33ab3b18573 ]---
Jun 30 10:29:42 ns1 kernel: [90117.614745] BUG: Bad rss-counter state
mm:ffff88006d861880 idx:1 val:2
Jun 30 10:29:47 ns1 upsmon[2640]: Poll UPS [cyber at localhost] failed - Data stale
Jun 30 10:29:52 ns1 upsmon[2640]: Poll UPS [cyber at localhost] failed - Data stale
Jun 30 10:29:57 ns1 kernel: [90131.585402] general protection fault: 0000
[#3] SMP
Jun 30 10:29:57 ns1 kernel: [90131.587750] CPU 3
Jun 30 10:29:57 ns1 kernel: [90131.587762] Modules linked in: veth
xt_multiport iptable_filter ip_tables x_tables bnep rfcomm bluetooth
snd_hda_codec_hdmi snd_hda_codec_realtek coretemp bridge kvm_intel kvm stp
llc ghash_clmulni_intel cryptd i915 drm_kms_helper drm snd_hda_intel
snd_hda_codec joydev mei serio_raw i2c_algo_bit snd_hwdep snd_pcm ppdev
video lpc_ich mac_hid snd_seq_midi snd_rawmidi parport_pc snd_seq_midi_event
snd_seq microcode snd_timer snd_seq_device snd soundcore snd_page_alloc lp
parport hid_logitech ff_memless hid_generic usbhid hid raid10 raid456
async_pq async_xor xor async_memcpy async_raid6_recov r8169 raid6_pq
async_tx ahci libahci raid1 raid0 multipath linear
Jun 30 10:29:57 ns1 kernel: [90131.597760]
Jun 30 10:29:57 ns1 kernel: [90131.600309] Pid: 2037, comm: postgres
Tainted: G D 3.5.0-34-generic #55~precise1-Ubuntu To Be Filled By
O.E.M. To Be Filled By O.E.M./H77M
Jun 30 10:29:57 ns1 kernel: [90131.602946] RIP: 0010:[<ffffffff81176fcd>]
[<ffffffff81176fcd>] __kmalloc_node_track_caller+0x13d/0x1f0
Jun 30 10:29:57 ns1 kernel: [90131.605596] RJun 30 11:03:33 ns1 kernel:
imklog 5.8.6, log source = /proc/kmsg started.
More information about the Nut-upsuser
mailing list