Bug#976901: nvidia-tesla-450-kernel-dkms: Fails to build DKMS kernel module on ppc64le 450.80.02

Konstantinos Margaritis markos at freevec.org
Thu Dec 10 09:37:13 GMT 2020


On 10/12/20 1:19 π.μ., Andreas Beckmann wrote:
> but I'm not sure whether it is worth backporting them,
> since you most likely will be affected by
> #973729 - nvidia-uvm does not work with Linux 5.9
> which is fixed in 455.45.01

Well, I did the replace you suggested below and even though the modules 
load, I don't get a display, here is what dmesg gives:

[   15.889326] NVRM: GPU 0000:01:00.0: DMA address not in addressable 
range of device (0x80020054de80000-0x80020054de8ffff, 
0x800000000000000-0x80000ffffffffff)
[   15.889341] NVRM: GPU 0000:01:00.0: DMA address not in addressable 
range of device (0x80020054de80000-0x80020054de8ffff, 
0x800000000000000-0x80000ffffffffff)
[   15.890377] NVRM: GPU 0000:01:00.0: DMA address not in addressable 
range of device (0x80020054de80000-0x80020054de8ffff, 
0x800000000000000-0x80000ffffffffff)
[   15.890564] NVRM: GPU 0000:01:00.0: RmInitAdapter failed! 
(0x24:0x1e:1224)
[   15.890601] NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device 
minor number 0
[   15.995590] NVRM: GPU 0030:01:00.0: DMA address not in addressable 
range of device (0x80020054a310000-0x80020054a31ffff, 
0x800000000000000-0x80000ffffffffff)
[   15.995601] NVRM: GPU 0030:01:00.0: DMA address not in addressable 
range of device (0x80020054a310000-0x80020054a31ffff, 
0x800000000000000-0x80000ffffffffff)
[   15.996482] NVRM: GPU 0030:01:00.0: DMA address not in addressable 
range of device (0x80020054a310000-0x80020054a31ffff, 
0x800000000000000-0x80000ffffffffff)
[   15.996650] NVRM: GPU 0030:01:00.0: RmInitAdapter failed! 
(0x24:0x1e:1224)
[   15.996705] NVRM: GPU 0030:01:00.0: rm_init_adapter failed, device 
minor number 1
[   34.850800] ------------[ cut here ]------------
[   34.850801] remap_4k_pfn called with wrong pfn value
[   34.850966] WARNING: CPU: 5 PID: 1584 at 
arch/powerpc/include/asm/book3s/64/hash-64k.h:166 
nvidia_mmap_helper+0x6bc/0x800 [nvidia]
[   34.850967] Modules linked in: xt_conntrack(E) 
nf_conntrack_netlink(E) xfrm_user(E) xfrm_algo(E) xt_addrtype(E) 
br_netfilter(E) overlay(E) xt_CHECKSUM(E) nft_chain_nat(E) 
xt_MASQUERADE(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E) 
nf_defrag_ipv4(E) libcrc32c(E) nft_counter(E) xt_tcpudp(E) nft_compat(E) 
bridge(E) stp(E) llc(E) nf_tables(E) nfnetlink(E) rfkill(E) 
nvidia_drm(POE) drm_kms_helper(E) syscopyarea(E) sysfillrect(E) 
sysimgblt(E) fb_sys_fops(E) nvidia_modeset(POE) nvidia(POE) 
binfmt_misc(E) evdev(E) joydev(E) snd_hda_codec_hdmi(E) snd_hda_intel(E) 
snd_intel_dspcfg(E) snd_hda_codec(E) snd_hda_core(E) snd_hwdep(E) 
snd_pcm(E) snd_timer(E) ctr(E) cbc(E) snd(E) vmx_crypto(E) soundcore(E) 
gf128mul(E) ofpart(E) ipmi_powernv(E) powernv_flash(E) ipmi_devintf(E) 
mtd(E) ipmi_msghandler(E) opal_prd(E) at24(E) regmap_i2c(E) 
parport_pc(E) lp(E) drm(E) parport(E) sunrpc(E) fuse(E) configfs(E) 
drm_panel_orientation_quirks(E) ip_tables(E) x_tables(E) autofs4(E) 
ext4(E) crc16(E) mbcache(E) jbd2(E)
[   34.850994]  crc32c_generic(E) ecb(E) aes_generic(E) libaes(E) xts(E) 
hid_generic(E) usbhid(E) hid(E) dm_crypt(E) dm_mod(E) xhci_pci(E) 
xhci_hcd(E) tg3(E) usbcore(E) nvme(E) libphy(E) nvme_core(E) ptp(E) 
pps_core(E) usb_common(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E) 
crct10dif_common(E)
[   34.851008] CPU: 5 PID: 1584 Comm: Xorg Tainted: P OE     
5.9.0-4-powerpc64le #1 Debian 5.9.11-1
[   34.851009] NIP:  c00800000e44c664 LR: c00800000e44c660 CTR: 
0000000000000000
[   34.851010] REGS: c000000007493750 TRAP: 0700   Tainted: P           
OE      (5.9.0-4-powerpc64le Debian 5.9.11-1)
[   34.851011] MSR:  9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE>  CR: 
28042222  XER: 00000000
[   34.851014] CFAR: c0000000001314e4 IRQMASK: 0
                GPR00: c00800000e44c660 c0000000074939e0 
c00800000f09ec00 0000000000000028
                GPR04: 0000000000000001 0000000000000004 
0000000000000027 c0000005ff6cbf90
                GPR08: 0000000000000023 ffffffffffffffd8 
0000000000000027 0000000000000000
                GPR12: 0000000000002000 c0000005fffea600 
00000001473aaad0 00007fffeac1ac14
                GPR16: 0000000000000000 0000000000000000 
0000000000000013 0000000000000008
                GPR20: 0000000000000000 0000000000010000 
0000000000001000 0060000002400000
                GPR24: c0000005f7085e08 c0000005f7085800 
0000000000002000 0000000000000003
                GPR28: 0006000000240000 c0000005f7085800 
c0000005fa441800 c0000005f38fbb80
[   34.851105] NIP [c00800000e44c664] nvidia_mmap_helper+0x6bc/0x800 
[nvidia]
[   34.851187] LR [c00800000e44c660] nvidia_mmap_helper+0x6b8/0x800 [nvidia]
[   34.851188] Call Trace:
[   34.851270] [c0000000074939e0] [c00800000e44c660] 
nvidia_mmap_helper+0x6b8/0x800 [nvidia] (unreliable)
[   34.851353] [c000000007493ac0] [c00800000e44c814] 
nvidia_mmap+0x6c/0xc0 [nvidia]
[   34.851434] [c000000007493b00] [c00800000e4400ec] 
nvidia_frontend_mmap+0x54/0x80 [nvidia]
[   34.851438] [c000000007493b20] [c0000000003bf51c] mmap_region+0x4cc/0x840
[   34.851439] [c000000007493c00] [c0000000003bfcac] do_mmap+0x41c/0x650
[   34.851441] [c000000007493c90] [c0000000003871f4] 
vm_mmap_pgoff+0x124/0x190
[   34.851442] [c000000007493d20] [c0000000003bce70] 
ksys_mmap_pgoff+0x240/0x2b0
[   34.851444] [c000000007493da0] [c000000000013b88] sys_mmap+0x48/0x90
[   34.851446] [c000000007493dc0] [c00000000002f54c] 
system_call_exception+0x14c/0x230
[   34.851447] [c000000007493e20] [c00000000000d948] 
system_call_common+0xe8/0x218
[   34.851448] Instruction dump:
[   34.851449] 4bfffe68 60000000 60420000 eae10098 4bfffaf0 f9210070 
39210070 4bffffb8
[   34.851453] 3d220000 e8698690 48bebdd5 e8410018 <0fe00000> 7f03c378 
3bc0fff5 48bed289
[   34.851457] ---[ end trace 0fa4d523132d0cdc ]---
[   35.320351] ------------[ cut here ]------------
[   35.320352] remap_4k_pfn called with wrong pfn value
[   35.320524] WARNING: CPU: 4 PID: 1584 at 
arch/powerpc/include/asm/book3s/64/hash-64k.h:166 
nvidia_mmap_helper+0x6bc/0x800 [nvidia]
[   35.320524] Modules linked in: xt_conntrack(E) 
nf_conntrack_netlink(E) xfrm_user(E) xfrm_algo(E) xt_addrtype(E) 
br_netfilter(E) overlay(E) xt_CHECKSUM(E) nft_chain_nat(E) 
xt_MASQUERADE(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E) 
nf_defrag_ipv4(E) libcrc32c(E) nft_counter(E) xt_tcpudp(E) nft_compat(E) 
bridge(E) stp(E) llc(E) nf_tables(E) nfnetlink(E) rfkill(E) 
nvidia_drm(POE) drm_kms_helper(E) syscopyarea(E) sysfillrect(E) 
sysimgblt(E) fb_sys_fops(E) nvidia_modeset(POE) nvidia(POE) 
binfmt_misc(E) evdev(E) joydev(E) snd_hda_codec_hdmi(E) snd_hda_intel(E) 
snd_intel_dspcfg(E) snd_hda_codec(E) snd_hda_core(E) snd_hwdep(E) 
snd_pcm(E) snd_timer(E) ctr(E) cbc(E) snd(E) vmx_crypto(E) soundcore(E) 
gf128mul(E) ofpart(E) ipmi_powernv(E) powernv_flash(E) ipmi_devintf(E) 
mtd(E) ipmi_msghandler(E) opal_prd(E) at24(E) regmap_i2c(E) 
parport_pc(E) lp(E) drm(E) parport(E) sunrpc(E) fuse(E) configfs(E) 
drm_panel_orientation_quirks(E) ip_tables(E) x_tables(E) autofs4(E) 
ext4(E) crc16(E) mbcache(E) jbd2(E)
[   35.320549]  crc32c_generic(E) ecb(E) aes_generic(E) libaes(E) xts(E) 
hid_generic(E) usbhid(E) hid(E) dm_crypt(E) dm_mod(E) xhci_pci(E) 
xhci_hcd(E) tg3(E) usbcore(E) nvme(E) libphy(E) nvme_core(E) ptp(E) 
pps_core(E) usb_common(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E) 
crct10dif_common(E)
[   35.320560] CPU: 4 PID: 1584 Comm: Xorg Tainted: P        W OE     
5.9.0-4-powerpc64le #1 Debian 5.9.11-1
[   35.320561] NIP:  c00800000e44c664 LR: c00800000e44c660 CTR: 
0000000000000000
[   35.320562] REGS: c000000007493750 TRAP: 0700   Tainted: P        W  
OE      (5.9.0-4-powerpc64le Debian 5.9.11-1)
[   35.320562] MSR:  9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE>  CR: 
22042222  XER: 00000000
[   35.320565] CFAR: c0000000001314e4 IRQMASK: 0
                GPR00: c00800000e44c660 c0000000074939e0 
c00800000f09ec00 0000000000000028
                GPR04: 0000000000000001 0000000000000004 
0000000000000027 c0000005ff61bf90
                GPR08: 0000000000000023 ffffffffffffffd8 
0000000000000027 0000000000000000
                GPR12: 0000000000002000 c0000005fffeb800 
00000001473b1c70 00007fffeac1ac14
                GPR16: 0000000000000000 0000000000000000 
0000000000000012 0000000000000008
                GPR20: 0000000000000000 0000000000010000 
0000000000001000 0062000002400000
                GPR24: c00020055901de08 c00020055901d800 
0000000000002000 0000000000000003
                GPR28: 0006200000240000 c00020055901d800 
c0000005fa311800 c0000005f754d610
[   35.320661] NIP [c00800000e44c664] nvidia_mmap_helper+0x6bc/0x800 
[nvidia]
[   35.320747] LR [c00800000e44c660] nvidia_mmap_helper+0x6b8/0x800 [nvidia]
[   35.320748] Call Trace:
[   35.320834] [c0000000074939e0] [c00800000e44c660] 
nvidia_mmap_helper+0x6b8/0x800 [nvidia] (unreliable)
[   35.320921] [c000000007493ac0] [c00800000e44c814] 
nvidia_mmap+0x6c/0xc0 [nvidia]
[   35.321006] [c000000007493b00] [c00800000e4400ec] 
nvidia_frontend_mmap+0x54/0x80 [nvidia]
[   35.321009] [c000000007493b20] [c0000000003bf51c] mmap_region+0x4cc/0x840
[   35.321010] [c000000007493c00] [c0000000003bfcac] do_mmap+0x41c/0x650
[   35.321012] [c000000007493c90] [c0000000003871f4] 
vm_mmap_pgoff+0x124/0x190
[   35.321013] [c000000007493d20] [c0000000003bce70] 
ksys_mmap_pgoff+0x240/0x2b0
[   35.321016] [c000000007493da0] [c000000000013b88] sys_mmap+0x48/0x90
[   35.321017] [c000000007493dc0] [c00000000002f54c] 
system_call_exception+0x14c/0x230
[   35.321019] [c000000007493e20] [c00000000000d948] 
system_call_common+0xe8/0x218
[   35.321019] Instruction dump:
[   35.321021] 4bfffe68 60000000 60420000 eae10098 4bfffaf0 f9210070 
39210070 4bffffb8
[   35.321025] 3d220000 e8698690 48bebdd5 e8410018 <0fe00000> 7f03c378 
3bc0fff5 48bed289
[   35.321029] ---[ end trace 0fa4d523132d0cdd ]---


> PPS: the first time I hear that someone is actually trying to use
> the ppc64el packages ;-)

I had the talos II for a few years and was using it for VSX work, but as 
it's still a very powerful machine, I decided to use it for some ML/DL 
work and got myself a pair of Titan X cards to use. Hopefully it should 
work, I'd hate to have to revert to using an x86 box just for this :-/

Regards

Konstantinos



More information about the pkg-nvidia-devel mailing list