Bug#818763: nvidia-kernel-dkms: fail to load nvidia-uvm (again)

Luca Boccassi luca.boccassi at gmail.com
Sun Mar 20 20:06:10 UTC 2016


Contro: tag -1 moreinfo

On Sun, 2016-03-20 at 16:07 +0100, Michael Below wrote:
> Package: nvidia-kernel-dkms
> Version: 352.79-5
> Severity: normal
> 
> Dear Maintainer,
> 
> after some updates to my Debian testing installation I noticed that OpenCL
> stopped working with my graphics card. For darktable, the debug log looked like:
> 
> [opencl_init] found opencl runtime library 'libOpenCL'
> [opencl_init] opencl library 'libOpenCL' found on your system and loaded
> modprobe: ERROR: could not insert 'nvidia_current_uvm': Operation not permitted
> modprobe: ERROR: ../libkmod/libkmod-module.c:977 command_do() Error running install command for nvidia_uvm
> modprobe: ERROR: could not insert 'nvidia_uvm': Operation not permitted
> [opencl_init] could not get platforms: -1001
> [opencl_init] FINALLY: opencl is NOT AVAILABLE on this system.
> [opencl_init] initial status of opencl enabled flag is OFF.
> 
> This seems to be the same bug as in #812396, so I tried the same fix as
> proposed there (sudo modprobe --force-modversion nvidia-current-uvm), and now
> it works:
> 
> [opencl_init] found opencl runtime library 'libOpenCL'
> [opencl_init] opencl library 'libOpenCL' found on your system and loaded
> [opencl_init] found 1 platform
> [opencl_init] found 1 device
> [opencl_init] device 0 `GeForce GTX 750 Ti' has sm_20 support.
> [opencl_init] device 0 `GeForce GTX 750 Ti' supports image sizes of 16384 x 16384
> [opencl_init] device 0 `GeForce GTX 750 Ti' allows GPU memory allocations of up to 511MB
> [opencl_init] device 0: GeForce GTX 750 Ti 
>      GLOBAL_MEM_SIZE:          2047MB
>      MAX_WORK_GROUP_SIZE:      1024
>      MAX_WORK_ITEM_DIMENSIONS: 3
>      MAX_WORK_ITEM_SIZES:      [ 1024 1024 64 ]
>      DRIVER_VERSION:           352.79
>      DEVICE_VERSION:           OpenCL 1.2 CUDA
> [opencl_init] compiling program `demosaic_ppg.cl' ..
> [opencl_load_program] could not load cached binary program, trying to compile source
> [opencl_load_program] successfully loaded program from `/usr/share/darktable/kernels/demosaic_ppg.cl'
> [opencl_build_program] successfully built program
> [opencl_build_program] BUILD STATUS: 0

Hi,

Sorry for your problems.

Unfortunately I cannot reproduce with the same version of the drivers on
my amd64 Jessie desktop, opencl-demo works just fine and nvidia-uvm
loads:

./cl-demo 10 5
Choose platform:
[0] NVIDIA Corporation
Enter choice: 
Choose device:
[0] GeForce GTX 780
Enter choice: 
---------------------------------------------------------------------
NAME: GeForce GTX 780
VENDOR: NVIDIA Corporation
PROFILE: FULL_PROFILE
VERSION: OpenCL 1.2 CUDA
EXTENSIONS: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing
cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll
cl_nv_copy_opts  cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics
cl_khr_local_int32_extended_atomics cl_khr_fp64 
DRIVER_VERSION: 352.79

<SNIP>

0.000265 s
0.000453 GB/s
GOOD

$ lsmod | grep nvidia
nvidia_uvm             73728  0 
nvidia               8540160  71 nvidia_uvm
drm                   352256  3 nvidia


But I noticed something strange that caught my attention in your Xorg
log:

[  2634.192] (II) NVIDIA GLX Module  340.93  Wed Aug 19 16:23:51 PDT
2015
[  2634.192] (II) LoadModule: "nvidia"
[  2634.192] (II) Loading /usr/lib/xorg/modules/drivers/nvidia_drv.so
[  2634.193] (II) Module nvidia: vendor="NVIDIA Corporation"
[  2634.193] 	compiled for 4.0.2, module version = 1.0.0
[  2634.193] 	Module class: X.Org Video Driver
[  2634.193] (II) NVIDIA dlloader X Driver  340.93  Wed Aug 19 16:01:53
PDT 2015

It looks like the older 340.93 library is being loaded? But all the
symlinks and the package versions in your system indicate that 352.79 is
installed, any idea why this might be happening?

If you run:

glxgears -info

What's the reported version?

Finally, if you run manually:

sudo modprobe -v nvidia-uvm

What's the output?

Kind regards,
Luca Boccassi
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: This is a digitally signed message part
URL: <http://lists.alioth.debian.org/pipermail/pkg-nvidia-devel/attachments/20160320/f92eed87/attachment.sig>


More information about the pkg-nvidia-devel mailing list