Bug#816283: nvidia-cuda-toolkit: cuda broken after upgrade (x86_64, Jessie, GTX980)

Alois Schloegl alois.schloegl at ist.ac.at
Tue Mar 29 11:23:00 UTC 2016



In order to fix the installation of glx-alternative-nvidia, and fix some
other cuda related issues, I tried to update cuda to jessie-backports.
The update did not work out of the box, e.g

After uninstalling nvidia-cuda and reinstallung with
   apt-get -t jessie-backports install nvidia-cuda-toolkit,
there were still problems


nvidia-smi fails with this message
Failed to initialize NVML: GPU access blocked by the operating system

dmesg shows "API mismatch: the client has the version 352.41".


After the update, I noticed that the link pointed to
  /usr/lib/x86_64-linux-gnu/libEGL.so.1 -> libEGL.so.352.41

and
  dpkg -i libegl1-nvidia_352.79-1~bpo8+1_amd64.deb
did not fix this, so manual extracted
  /usr/lib/x86_64-linux-gnu/libEGL.so.352.79
and set the above link to it.




dmesg shows "API mismatch: the client has the version 352.41". The
details are:


# dmesg |grep -i "nvid\|nvrm"
[   16.782287] nvidia: module license 'NVIDIA' taints kernel.
[   16.813559] nvidia 0000:03:00.0: enabling device (0000 -> 0003)
[   16.829365] nvidia 0000:82:00.0: enabling device (0000 -> 0003)
[   16.845105] nvidia 0000:83:00.0: enabling device (0000 -> 0003)
[   16.861001] [drm] Initialized nvidia-drm 0.0.0 20150116 for
0000:02:00.0 on minor 0
[   16.868905] [drm] Initialized nvidia-drm 0.0.0 20150116 for
0000:03:00.0 on minor 1
[   16.876789] [drm] Initialized nvidia-drm 0.0.0 20150116 for
0000:82:00.0 on minor 2
[   16.884678] [drm] Initialized nvidia-drm 0.0.0 20150116 for
0000:83:00.0 on minor 3
[   16.892467] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  352.79
Wed Jan 13 16:17:53 PST 2016
[   18.702943] input: HDA NVidia HDMI/DP,pcm=3 as
/devices/pci0000:80/0000:80:02.0/0000:82:00.1/sound/card2/input7
[   18.711066] input: HDA NVidia HDMI/DP,pcm=3 as
/devices/pci0000:00/0000:00:02.0/0000:02:00.1/sound/card0/input8
[   18.711188] input: HDA NVidia HDMI/DP,pcm=7 as
/devices/pci0000:00/0000:00:02.0/0000:02:00.1/sound/card0/input12
[   18.711330] input: HDA NVidia HDMI/DP,pcm=8 as
/devices/pci0000:00/0000:00:02.0/0000:02:00.1/sound/card0/input17
[   18.711668] input: HDA NVidia HDMI/DP,pcm=9 as
/devices/pci0000:00/0000:00:02.0/0000:02:00.1/sound/card0/input21
[   18.754668] input: HDA NVidia HDMI/DP,pcm=7 as
/devices/pci0000:80/0000:80:02.0/0000:82:00.1/sound/card2/input11
[   18.766633] input: HDA NVidia HDMI/DP,pcm=3 as
/devices/pci0000:80/0000:80:03.0/0000:83:00.1/sound/card3/input6
[   18.767994] input: HDA NVidia HDMI/DP,pcm=3 as
/devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card1/input9
[   18.770893] input: HDA NVidia HDMI/DP,pcm=7 as
/devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card1/input13
[   18.771620] input: HDA NVidia HDMI/DP,pcm=8 as
/devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card1/input16
[   18.771805] input: HDA NVidia HDMI/DP,pcm=9 as
/devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card1/input20
[   18.820540] input: HDA NVidia HDMI/DP,pcm=8 as
/devices/pci0000:80/0000:80:02.0/0000:82:00.1/sound/card2/input14
[   18.832036] input: HDA NVidia HDMI/DP,pcm=9 as
/devices/pci0000:80/0000:80:02.0/0000:82:00.1/sound/card2/input19
[   18.843980] input: HDA NVidia HDMI/DP,pcm=7 as
/devices/pci0000:80/0000:80:03.0/0000:83:00.1/sound/card3/input10
[   18.855326] input: HDA NVidia HDMI/DP,pcm=8 as
/devices/pci0000:80/0000:80:03.0/0000:83:00.1/sound/card3/input15
[   18.867117] input: HDA NVidia HDMI/DP,pcm=9 as
/devices/pci0000:80/0000:80:03.0/0000:83:00.1/sound/card3/input18
[  731.363046] NVRM: API mismatch: the client has the version 352.41, but
NVRM: this kernel module has the version 352.79.  Please
NVRM: make sure that this kernel module and all NVIDIA driver
NVRM: components have the same version.
[  731.387767] NVRM: nvidia_frontend_ioctl: minor 255, module->ioctl
failed, error -22


Here is the list of current packages:


# dpkg -l|grep -i nvidia
ii  glx-alternative-nvidia                 0.7.1~bpo8+1
        amd64        allows the selection of NVIDIA as GLX provider
ii  libcublas6.5:amd64                     6.5.19-3~bpo8+1
        amd64        NVIDIA cuBLAS Library
ii  libcuda1:amd64                         352.79-1~bpo8+1
        amd64        NVIDIA CUDA Driver Library
ii  libcudart6.5:amd64                     6.5.19-3~bpo8+1
        amd64        NVIDIA CUDA Runtime Library
ii  libcufft6.5:amd64                      6.5.19-3~bpo8+1
        amd64        NVIDIA cuFFT Library
ii  libcufftw6.5:amd64                     6.5.19-3~bpo8+1
        amd64        NVIDIA cuFFTW Library
ii  libcuinj64-6.5:amd64                   6.5.19-3~bpo8+1
        amd64        NVIDIA CUINJ Library (64-bit)
ii  libcurand6.5:amd64                     6.5.19-3~bpo8+1
        amd64        NVIDIA cuRAND Library
ii  libcusparse6.5:amd64                   6.5.19-3~bpo8+1
        amd64        NVIDIA cuSPARSE Library
ii  libegl1-nvidia:amd64                   352.79-1~bpo8+1
        amd64        NVIDIA binary EGL libraries
ii  libgl1-nvidia-glx:amd64                352.79-1~bpo8+1
        amd64        NVIDIA binary OpenGL libraries
ii  libgles1-nvidia:amd64                  352.79-1~bpo8+1
        amd64        NVIDIA binary OpenGL|ES 1.x libraries
ii  libgles2-nvidia:amd64                  352.79-1~bpo8+1
        amd64        NVIDIA binary OpenGL|ES 2.x libraries
ii  libnppc6.5:amd64                       6.5.19-3~bpo8+1
        amd64        NVIDIA Performance Primitives core runtime library
ii  libnppi6.5:amd64                       6.5.19-3~bpo8+1
        amd64        NVIDIA Performance Primitives for image processing
runtime library
ii  libnpps6.5:amd64                       6.5.19-3~bpo8+1
        amd64        NVIDIA Performance Primitives for signal processing
runtime library
ii  libnvcuvid1:amd64                      352.79-1~bpo8+1
        amd64        NVIDIA CUDA Video Decoder runtime library
ii  libnvidia-eglcore:amd64                352.79-1~bpo8+1
        amd64        NVIDIA binary EGL core libraries
ii  libnvidia-ml1:amd64                    352.79-1~bpo8+1
        amd64        NVIDIA Management Library (NVML) runtime library
ii  libnvtoolsext1:amd64                   6.5.19-3~bpo8+1
        amd64        NVIDIA Tools Extension Library
ii  libnvvm2:amd64                         6.5.19-3~bpo8+1
        amd64        NVIDIA NVVM Library
ii  nvidia-alternative                     352.79-1~bpo8+1
        amd64        allows the selection of NVIDIA as GLX provider
ii  nvidia-cuda-dev                        6.5.19-3~bpo8+1
        amd64        NVIDIA CUDA development files
ii  nvidia-cuda-doc                        6.5.19-3~bpo8+1
        all          NVIDIA CUDA and OpenCL documentation
ii  nvidia-cuda-gdb                        6.5.19-3~bpo8+1
        amd64        NVIDIA CUDA Debugger (GDB)
ii  nvidia-cuda-toolkit                    6.5.19-3~bpo8+1
        amd64        NVIDIA CUDA development toolkit
ii  nvidia-driver                          352.79-1~bpo8+1
        amd64        NVIDIA metapackage
ii  nvidia-driver-bin                      352.79-1~bpo8+1
        amd64        NVIDIA driver support binaries
ii  nvidia-installer-cleanup               20151021+1~bpo8+1
        amd64        cleanup after driver installation with the
nvidia-installer
ii  nvidia-kernel-common                   20151021+1~bpo8+1
        amd64        NVIDIA binary kernel module support files
ii  nvidia-kernel-dkms                     352.79-1~bpo8+1
        amd64        NVIDIA binary kernel module DKMS source
ii  nvidia-kernel-support                  352.79-1~bpo8+1
        amd64        NVIDIA binary kernel module support files
ii  nvidia-modprobe                        358.09-1~bpo8+1
        amd64        utility to load NVIDIA kernel modules and create
device nodes
ii  nvidia-persistenced                    358.09-1~bpo8+1
        amd64        daemon to maintain persistent software state in the
NVIDIA driver
ii  nvidia-profiler                        6.5.19-3~bpo8+1
        amd64        NVIDIA Profiler for CUDA and OpenCL
ii  nvidia-smi                             352.79-1~bpo8+1
        amd64        NVIDIA System Management Interface
ii  nvidia-support                         20151021+1~bpo8+1
        amd64        NVIDIA binary graphics driver support files
ii  nvidia-vdpau-driver:amd64              352.79-1~bpo8+1
        amd64        Video Decode and Presentation API for Unix - NVIDIA
driver
ii  nvidia-visual-profiler                 6.5.19-3~bpo8+1
        amd64        NVIDIA Visual Profiler for CUDA and OpenCL
ii  xserver-xorg-video-nvidia              352.79-1~bpo8+1
        amd64        NVIDIA binary Xorg driver



It seems to me that the nvidia-cuda-toolkit of jessie-backports is
broken. Do you have have any suggestions how to fix this ?




-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 473 bytes
Desc: OpenPGP digital signature
URL: <http://lists.alioth.debian.org/pipermail/pkg-nvidia-devel/attachments/20160329/cbd736ed/attachment.sig>


More information about the pkg-nvidia-devel mailing list