Bug#976901: nvidia-tesla-450-kernel-dkms: Fails to build DKMS kernel module on ppc64le 450.80.02
Konstantinos Margaritis
markos at freevec.org
Thu Dec 10 09:37:13 GMT 2020
On 10/12/20 1:19 π.μ., Andreas Beckmann wrote:
> but I'm not sure whether it is worth backporting them,
> since you most likely will be affected by
> #973729 - nvidia-uvm does not work with Linux 5.9
> which is fixed in 455.45.01
Well, I did the replace you suggested below and even though the modules
load, I don't get a display, here is what dmesg gives:
[ 15.889326] NVRM: GPU 0000:01:00.0: DMA address not in addressable
range of device (0x80020054de80000-0x80020054de8ffff,
0x800000000000000-0x80000ffffffffff)
[ 15.889341] NVRM: GPU 0000:01:00.0: DMA address not in addressable
range of device (0x80020054de80000-0x80020054de8ffff,
0x800000000000000-0x80000ffffffffff)
[ 15.890377] NVRM: GPU 0000:01:00.0: DMA address not in addressable
range of device (0x80020054de80000-0x80020054de8ffff,
0x800000000000000-0x80000ffffffffff)
[ 15.890564] NVRM: GPU 0000:01:00.0: RmInitAdapter failed!
(0x24:0x1e:1224)
[ 15.890601] NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device
minor number 0
[ 15.995590] NVRM: GPU 0030:01:00.0: DMA address not in addressable
range of device (0x80020054a310000-0x80020054a31ffff,
0x800000000000000-0x80000ffffffffff)
[ 15.995601] NVRM: GPU 0030:01:00.0: DMA address not in addressable
range of device (0x80020054a310000-0x80020054a31ffff,
0x800000000000000-0x80000ffffffffff)
[ 15.996482] NVRM: GPU 0030:01:00.0: DMA address not in addressable
range of device (0x80020054a310000-0x80020054a31ffff,
0x800000000000000-0x80000ffffffffff)
[ 15.996650] NVRM: GPU 0030:01:00.0: RmInitAdapter failed!
(0x24:0x1e:1224)
[ 15.996705] NVRM: GPU 0030:01:00.0: rm_init_adapter failed, device
minor number 1
[ 34.850800] ------------[ cut here ]------------
[ 34.850801] remap_4k_pfn called with wrong pfn value
[ 34.850966] WARNING: CPU: 5 PID: 1584 at
arch/powerpc/include/asm/book3s/64/hash-64k.h:166
nvidia_mmap_helper+0x6bc/0x800 [nvidia]
[ 34.850967] Modules linked in: xt_conntrack(E)
nf_conntrack_netlink(E) xfrm_user(E) xfrm_algo(E) xt_addrtype(E)
br_netfilter(E) overlay(E) xt_CHECKSUM(E) nft_chain_nat(E)
xt_MASQUERADE(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E)
nf_defrag_ipv4(E) libcrc32c(E) nft_counter(E) xt_tcpudp(E) nft_compat(E)
bridge(E) stp(E) llc(E) nf_tables(E) nfnetlink(E) rfkill(E)
nvidia_drm(POE) drm_kms_helper(E) syscopyarea(E) sysfillrect(E)
sysimgblt(E) fb_sys_fops(E) nvidia_modeset(POE) nvidia(POE)
binfmt_misc(E) evdev(E) joydev(E) snd_hda_codec_hdmi(E) snd_hda_intel(E)
snd_intel_dspcfg(E) snd_hda_codec(E) snd_hda_core(E) snd_hwdep(E)
snd_pcm(E) snd_timer(E) ctr(E) cbc(E) snd(E) vmx_crypto(E) soundcore(E)
gf128mul(E) ofpart(E) ipmi_powernv(E) powernv_flash(E) ipmi_devintf(E)
mtd(E) ipmi_msghandler(E) opal_prd(E) at24(E) regmap_i2c(E)
parport_pc(E) lp(E) drm(E) parport(E) sunrpc(E) fuse(E) configfs(E)
drm_panel_orientation_quirks(E) ip_tables(E) x_tables(E) autofs4(E)
ext4(E) crc16(E) mbcache(E) jbd2(E)
[ 34.850994] crc32c_generic(E) ecb(E) aes_generic(E) libaes(E) xts(E)
hid_generic(E) usbhid(E) hid(E) dm_crypt(E) dm_mod(E) xhci_pci(E)
xhci_hcd(E) tg3(E) usbcore(E) nvme(E) libphy(E) nvme_core(E) ptp(E)
pps_core(E) usb_common(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E)
crct10dif_common(E)
[ 34.851008] CPU: 5 PID: 1584 Comm: Xorg Tainted: P OE
5.9.0-4-powerpc64le #1 Debian 5.9.11-1
[ 34.851009] NIP: c00800000e44c664 LR: c00800000e44c660 CTR:
0000000000000000
[ 34.851010] REGS: c000000007493750 TRAP: 0700 Tainted: P
OE (5.9.0-4-powerpc64le Debian 5.9.11-1)
[ 34.851011] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR:
28042222 XER: 00000000
[ 34.851014] CFAR: c0000000001314e4 IRQMASK: 0
GPR00: c00800000e44c660 c0000000074939e0
c00800000f09ec00 0000000000000028
GPR04: 0000000000000001 0000000000000004
0000000000000027 c0000005ff6cbf90
GPR08: 0000000000000023 ffffffffffffffd8
0000000000000027 0000000000000000
GPR12: 0000000000002000 c0000005fffea600
00000001473aaad0 00007fffeac1ac14
GPR16: 0000000000000000 0000000000000000
0000000000000013 0000000000000008
GPR20: 0000000000000000 0000000000010000
0000000000001000 0060000002400000
GPR24: c0000005f7085e08 c0000005f7085800
0000000000002000 0000000000000003
GPR28: 0006000000240000 c0000005f7085800
c0000005fa441800 c0000005f38fbb80
[ 34.851105] NIP [c00800000e44c664] nvidia_mmap_helper+0x6bc/0x800
[nvidia]
[ 34.851187] LR [c00800000e44c660] nvidia_mmap_helper+0x6b8/0x800 [nvidia]
[ 34.851188] Call Trace:
[ 34.851270] [c0000000074939e0] [c00800000e44c660]
nvidia_mmap_helper+0x6b8/0x800 [nvidia] (unreliable)
[ 34.851353] [c000000007493ac0] [c00800000e44c814]
nvidia_mmap+0x6c/0xc0 [nvidia]
[ 34.851434] [c000000007493b00] [c00800000e4400ec]
nvidia_frontend_mmap+0x54/0x80 [nvidia]
[ 34.851438] [c000000007493b20] [c0000000003bf51c] mmap_region+0x4cc/0x840
[ 34.851439] [c000000007493c00] [c0000000003bfcac] do_mmap+0x41c/0x650
[ 34.851441] [c000000007493c90] [c0000000003871f4]
vm_mmap_pgoff+0x124/0x190
[ 34.851442] [c000000007493d20] [c0000000003bce70]
ksys_mmap_pgoff+0x240/0x2b0
[ 34.851444] [c000000007493da0] [c000000000013b88] sys_mmap+0x48/0x90
[ 34.851446] [c000000007493dc0] [c00000000002f54c]
system_call_exception+0x14c/0x230
[ 34.851447] [c000000007493e20] [c00000000000d948]
system_call_common+0xe8/0x218
[ 34.851448] Instruction dump:
[ 34.851449] 4bfffe68 60000000 60420000 eae10098 4bfffaf0 f9210070
39210070 4bffffb8
[ 34.851453] 3d220000 e8698690 48bebdd5 e8410018 <0fe00000> 7f03c378
3bc0fff5 48bed289
[ 34.851457] ---[ end trace 0fa4d523132d0cdc ]---
[ 35.320351] ------------[ cut here ]------------
[ 35.320352] remap_4k_pfn called with wrong pfn value
[ 35.320524] WARNING: CPU: 4 PID: 1584 at
arch/powerpc/include/asm/book3s/64/hash-64k.h:166
nvidia_mmap_helper+0x6bc/0x800 [nvidia]
[ 35.320524] Modules linked in: xt_conntrack(E)
nf_conntrack_netlink(E) xfrm_user(E) xfrm_algo(E) xt_addrtype(E)
br_netfilter(E) overlay(E) xt_CHECKSUM(E) nft_chain_nat(E)
xt_MASQUERADE(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E)
nf_defrag_ipv4(E) libcrc32c(E) nft_counter(E) xt_tcpudp(E) nft_compat(E)
bridge(E) stp(E) llc(E) nf_tables(E) nfnetlink(E) rfkill(E)
nvidia_drm(POE) drm_kms_helper(E) syscopyarea(E) sysfillrect(E)
sysimgblt(E) fb_sys_fops(E) nvidia_modeset(POE) nvidia(POE)
binfmt_misc(E) evdev(E) joydev(E) snd_hda_codec_hdmi(E) snd_hda_intel(E)
snd_intel_dspcfg(E) snd_hda_codec(E) snd_hda_core(E) snd_hwdep(E)
snd_pcm(E) snd_timer(E) ctr(E) cbc(E) snd(E) vmx_crypto(E) soundcore(E)
gf128mul(E) ofpart(E) ipmi_powernv(E) powernv_flash(E) ipmi_devintf(E)
mtd(E) ipmi_msghandler(E) opal_prd(E) at24(E) regmap_i2c(E)
parport_pc(E) lp(E) drm(E) parport(E) sunrpc(E) fuse(E) configfs(E)
drm_panel_orientation_quirks(E) ip_tables(E) x_tables(E) autofs4(E)
ext4(E) crc16(E) mbcache(E) jbd2(E)
[ 35.320549] crc32c_generic(E) ecb(E) aes_generic(E) libaes(E) xts(E)
hid_generic(E) usbhid(E) hid(E) dm_crypt(E) dm_mod(E) xhci_pci(E)
xhci_hcd(E) tg3(E) usbcore(E) nvme(E) libphy(E) nvme_core(E) ptp(E)
pps_core(E) usb_common(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E)
crct10dif_common(E)
[ 35.320560] CPU: 4 PID: 1584 Comm: Xorg Tainted: P W OE
5.9.0-4-powerpc64le #1 Debian 5.9.11-1
[ 35.320561] NIP: c00800000e44c664 LR: c00800000e44c660 CTR:
0000000000000000
[ 35.320562] REGS: c000000007493750 TRAP: 0700 Tainted: P W
OE (5.9.0-4-powerpc64le Debian 5.9.11-1)
[ 35.320562] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR:
22042222 XER: 00000000
[ 35.320565] CFAR: c0000000001314e4 IRQMASK: 0
GPR00: c00800000e44c660 c0000000074939e0
c00800000f09ec00 0000000000000028
GPR04: 0000000000000001 0000000000000004
0000000000000027 c0000005ff61bf90
GPR08: 0000000000000023 ffffffffffffffd8
0000000000000027 0000000000000000
GPR12: 0000000000002000 c0000005fffeb800
00000001473b1c70 00007fffeac1ac14
GPR16: 0000000000000000 0000000000000000
0000000000000012 0000000000000008
GPR20: 0000000000000000 0000000000010000
0000000000001000 0062000002400000
GPR24: c00020055901de08 c00020055901d800
0000000000002000 0000000000000003
GPR28: 0006200000240000 c00020055901d800
c0000005fa311800 c0000005f754d610
[ 35.320661] NIP [c00800000e44c664] nvidia_mmap_helper+0x6bc/0x800
[nvidia]
[ 35.320747] LR [c00800000e44c660] nvidia_mmap_helper+0x6b8/0x800 [nvidia]
[ 35.320748] Call Trace:
[ 35.320834] [c0000000074939e0] [c00800000e44c660]
nvidia_mmap_helper+0x6b8/0x800 [nvidia] (unreliable)
[ 35.320921] [c000000007493ac0] [c00800000e44c814]
nvidia_mmap+0x6c/0xc0 [nvidia]
[ 35.321006] [c000000007493b00] [c00800000e4400ec]
nvidia_frontend_mmap+0x54/0x80 [nvidia]
[ 35.321009] [c000000007493b20] [c0000000003bf51c] mmap_region+0x4cc/0x840
[ 35.321010] [c000000007493c00] [c0000000003bfcac] do_mmap+0x41c/0x650
[ 35.321012] [c000000007493c90] [c0000000003871f4]
vm_mmap_pgoff+0x124/0x190
[ 35.321013] [c000000007493d20] [c0000000003bce70]
ksys_mmap_pgoff+0x240/0x2b0
[ 35.321016] [c000000007493da0] [c000000000013b88] sys_mmap+0x48/0x90
[ 35.321017] [c000000007493dc0] [c00000000002f54c]
system_call_exception+0x14c/0x230
[ 35.321019] [c000000007493e20] [c00000000000d948]
system_call_common+0xe8/0x218
[ 35.321019] Instruction dump:
[ 35.321021] 4bfffe68 60000000 60420000 eae10098 4bfffaf0 f9210070
39210070 4bffffb8
[ 35.321025] 3d220000 e8698690 48bebdd5 e8410018 <0fe00000> 7f03c378
3bc0fff5 48bed289
[ 35.321029] ---[ end trace 0fa4d523132d0cdd ]---
> PPS: the first time I hear that someone is actually trying to use
> the ppc64el packages ;-)
I had the talos II for a few years and was using it for VSX work, but as
it's still a very powerful machine, I decided to use it for some ML/DL
work and got myself a pair of Titan X cards to use. Hopefully it should
work, I'd hate to have to revert to using an x86 box just for this :-/
Regards
Konstantinos
More information about the pkg-nvidia-devel
mailing list