Bug#1004501: kernel: BUG: unable to handle page fault (RIP drm_stub_open)

Nikolay Kyx knv418 at gmail.com
Sat Jan 29 16:36:13 GMT 2022


Package: nvidia-legacy-340xx-kernel-dkms
Version: 340.108-12
Severity: important

Steps to reproduce on xfce desktop:
1. Boot, load and unload nvidia driver kernel module (in my case the
stack is optirun+bbswitch-dkms+nvidia driver);
2. Run xflock4.
So:
$ optirun mpv --version && sleep 1 ; xflock4 # suffices, reproduces
bug _quite frequently_

Visible effect:
Screen blanks, but then freezes, session become unrecoverable. Still
able to switch to another tty and login.

The following kernel errors appears in journal once or multiple times
(usually two Oopses) in row:
[   89.688357] kernel: BUG: unable to handle page fault for address:
ffffffffc1c1ded0
[   89.688367] kernel: #PF: supervisor read access in kernel mode
[   89.688370] kernel: #PF: error_code(0x0000) - not-present page
[   89.688373] kernel: PGD 6d014067 P4D 6d014067 PUD 6d016067 PMD
13dc58067 PTE 0
[   89.688380] kernel: Oops: 0000 [#1] SMP PTI
[   89.688385] kernel: CPU: 2 PID: 3934 Comm: Xorg Tainted: P
 OE     5.15.0-3-amd64 #1  Debian 5.15.15-1
[   89.688392] kernel: RIP: 0010:drm_stub_open+0x56/0x130 [drm]
[   89.688446] kernel: Code: e7 ff ff 0f 00 e8 5a fe ff ff 48 89 c5 41
89 c6 48 3d 00 f0 ff ff 0f 87 86 00 00 00 48 8b 58 10 41 be ed ff ff
ff 48 8b 43 30 <48> 8b 80 d0 00 00 00 48 85 c0 74 50 48 8b 38 e8 76 e5
e8 d7 48 8b
[   89.688450] kernel: RSP: 0018:ffffb3c080ee3c48 EFLAGS: 00010283
[   89.688454] kernel: RAX: ffffffffc1c1de00 RBX: ffff9adca0d0c800
RCX: 0000000000000000
[   89.688457] kernel: RDX: 000000000003bd40 RSI: 0000000000000000
RDI: ffffffffc0114f00
[   89.688459] kernel: RBP: ffff9adcada0e0a8 R08: 0000000000000000
R09: ffffffffc01213b0
[   89.688461] kernel: R10: 0000000000000000 R11: 0000000000000000
R12: ffff9adca647b8e0
[   89.688464] kernel: R13: ffff9adcada12900 R14: 00000000ffffffed
R15: 0000000000000000
[   89.688467] kernel: FS:  00007fda74004a40(0000)
GS:ffff9adcc3e80000(0000) knlGS:0000000000000000
[   89.688470] kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   89.688473] kernel: CR2: ffffffffc1c1ded0 CR3: 000000013cef0001
CR4: 00000000000206e0
[   89.688476] kernel: Call Trace:
[   89.688480] kernel:  <TASK>
[   89.688484] kernel:  chrdev_open+0xf3/0x240
[   89.688492] kernel:  ? cdev_default_release+0x20/0x20
[   89.688497] kernel:  do_dentry_open+0x14e/0x370
[   89.688503] kernel:  path_openat+0xaeb/0x1070
[   89.688508] kernel:  ? generic_write_end+0xeb/0x160
[   89.688514] kernel:  ? balance_dirty_pages_ratelimited+0x199/0x3d0
[   89.688520] kernel:  do_filp_open+0xb2/0x150
[   89.688526] kernel:  ? __check_object_size+0x136/0x150
[   89.688530] kernel:  do_sys_openat2+0x96/0x160
[   89.688535] kernel:  __x64_sys_openat+0x53/0x90
[   89.688539] kernel:  do_syscall_64+0x3b/0xc0
[   89.688545] kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xae
[   89.688552] kernel: RIP: 0033:0x7fda74569827
[   89.688556] kernel: Code: 25 00 00 41 00 3d 00 00 41 00 74 47 64 8b
04 25 18 00 00 00 85 c0 75 6b 44 89 e2 48 89 ee bf 9c ff ff ff b8 01
01 00 00 0f 05 <48> 3d 00 f0 ff ff 0f 87 95 00 00 00 48 8b 4c 24 28 64
48 2b 0c 25
[   89.688559] kernel: RSP: 002b:00007ffc44848140 EFLAGS: 00000246
ORIG_RAX: 0000000000000101
[   89.688563] kernel: RAX: ffffffffffffffda RBX: 00000000ffffffff
RCX: 00007fda74569827
[   89.688566] kernel: RDX: 0000000000080002 RSI: 0000556740d9abf0
RDI: 00000000ffffff9c
[   89.688568] kernel: RBP: 0000556740d9abf0 R08: 0000000000000031
R09: 0000000000000000
[   89.688571] kernel: R10: 0000000000000000 R11: 0000000000000246
R12: 0000000000080002
[   89.688573] kernel: R13: 0000556740d9abf0 R14: 0000556740d9abf0
R15: 0000556740d9a1c0
[   89.688577] kernel:  </TASK>
[   89.688579] kernel: Modules linked in: xt_CHECKSUM xt_MASQUERADE
xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat
nft_chain_nat nf_nat nf_conntrack bbswitch(OE) nf_defrag_ipv6
nf_defrag_ipv4 nft_counter nf_tables libcrc32c nfnetlink bridge stp
llc intel_powerclamp r8169 ath9k coretemp ath9k_common ath3k
sparse_keymap at24 ath9k_hw kvm_intel ath kvm mac80211 bluetooth
jitterentropy_rng sha512_ssse3 sha512_generic libarc4 ctr drbg mxm_wmi
ansi_cprng irqbypass ecdh_generic cfg80211 iTCO_wdt intel_pmc_bxt
iTCO_vendor_support watchdog rfkill ecc intel_cstate realtek
snd_hda_codec_realtek mdio_devres snd_hda_codec_generic
snd_hda_codec_hdmi ledtrig_audio libphy intel_uncore snd_hda_intel
snd_intel_dspcfg snd_intel_sdw_acpi bfq snd_hda_codec snd_hda_core
sr_mod cdrom i2c_i801 snd_hwdep sg snd_pcm mei_me snd_timer mei wmi
i2c_smbus button battery ac snd acpi_cpufreq lpc_ich intel_ips
soundcore parport_pc ppdev lp parport fuse configfs ip_tables x_tables
autofs4 ext4 crc16 mbcache
[   89.688668] kernel:  jbd2 crc32c_generic zstd zstd_compress
zsmalloc dm_mod hid_generic usbhid hid sd_mod t10_pi crc_t10dif
crct10dif_generic crct10dif_common i915 i2c_algo_bit ttm ahci
drm_kms_helper libahci ehci_pci ehci_hcd libata cec usbcore rc_core
scsi_mod drm psmouse evdev crc32c_intel scsi_common serio_raw
usb_common video [last unloaded: nvidia]
[   89.688705] kernel: CR2: ffffffffc1c1ded0
[   89.688709] kernel: ---[ end trace 6b4637486e7d6a9a ]---

I am using Debian GNU/Linux bookworm/sid, linux-image-5.15.0-3-amd64=5.15.15-1
and libc6=2.33-3

Related package versions:
bumblebee-nvidia/testing,unstable,now 3.2.1-27 amd64
glx-alternative-nvidia/testing,unstable,now 1.2.1 amd64
libegl1-nvidia-legacy-340xx/unstable,now 340.108-12 amd64
libgl1-nvidia-legacy-340xx-glx/unstable,now 340.108-12 amd64
libgles1-nvidia-legacy-340xx/unstable,now 340.108-12 amd64
libgles2-nvidia-legacy-340xx/unstable,now 340.108-12 amd64
libnvidia-legacy-340xx-compiler/unstable,now 340.108-12 amd64
libnvidia-legacy-340xx-cuda1/unstable,now 340.108-12 amd64
libnvidia-legacy-340xx-eglcore/unstable,now 340.108-12 amd64
libnvidia-legacy-340xx-glcore/unstable,now 340.108-12 amd64
libnvidia-legacy-340xx-ml1/unstable,now 340.108-12 amd64
nvidia-installer-cleanup/testing,unstable,now 20151021+13 amd64
nvidia-kernel-common/testing,unstable,now 20151021+13 amd64
nvidia-legacy-340xx-alternative/unstable,now 340.108-12 amd64
nvidia-legacy-340xx-driver-bin/unstable,now 340.108-12 amd64
nvidia-legacy-340xx-driver-libs/unstable,now 340.108-12 amd64
nvidia-legacy-340xx-driver/unstable,now 340.108-12 amd64
nvidia-legacy-340xx-kernel-dkms/unstable,now 340.108-12 amd64
nvidia-legacy-340xx-kernel-support/unstable,now 340.108-12 amd64
nvidia-legacy-340xx-opencl-icd/unstable,now 340.108-12 amd64
nvidia-legacy-340xx-vdpau-driver/unstable,now 340.108-12 amd64
nvidia-modprobe/testing,unstable,now 470.94-1 amd64
nvidia-opencl-common/unstable,now 470.94-1 amd64
nvidia-support/testing,unstable,now 20151021+13 amd64
xserver-xorg-video-nvidia-legacy-340xx/unstable,now 340.108-12 amd64

GPUs in system:
00:02.0 VGA compatible controller: Intel Corporation Core Processor
Integrated Graphics Controller (rev 02) (prog-if 00 [VGA controller])
    DeviceName:  Onboard IGD
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B- DisINTx-
    Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort-
<TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0
    Interrupt: pin A routed to IRQ 16
    Region 0: Memory at d3400000 (64-bit, non-prefetchable) [size=4M]
    Region 2: Memory at b0000000 (64-bit, prefetchable) [size=256M]
    Region 4: I/O ports at e080 [size=8]
    Expansion ROM at 000c0000 [virtual] [disabled] [size=128K]
    Kernel driver in use: i915
    Kernel modules: i915
01:00.0 VGA compatible controller: NVIDIA Corporation GT218M [GeForce
310M] (rev ff) (prog-if ff)
    !!! Unknown header type 7f
    Kernel modules: nvidia



More information about the pkg-nvidia-devel mailing list