Bug#889097: nvidia-libopencl1: Missing NVidia OpenCL platform

Krzysztof Marczak buddhi1980 at gmail.com
Fri Feb 2 16:19:20 UTC 2018


Thank you for quick reply.
You were right. It's look like it's the same problem as reported in #888952
When after reboot I don't run clinfo as a root, the NVidia OpenCL platform
is not visible. After running 'sudo clinfo' it starts to work properly.
It's reproducible all the time.

It looks like nvidia_uvm kernel module is loaded when clinfo is used
when I run folowing commands:
$ lsmod >modules_before.txt
$ sudo clinfo
$ lsmod >modules_after.txt
$ diff modules_before.txt modules_after.txt

the output if diff is following:
1a2
> nvidia_uvm            765952  0
79c80
< nvidia              13168640  1057 nvidia_modeset
---
> nvidia              13168640  1055 nvidia_modeset,nvidia_uvm


This is the output of clinfo run as a root:

X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
Number of platforms                               2
 Platform Name                                   Intel Gen OCL Driver
 Platform Vendor                                 Intel
 Platform Version                                OpenCL 2.0 beignet 1.3
 Platform Profile                                FULL_PROFILE
 Platform Extensions
cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics
cl_khr_local_int32_extended_atomics cl_khr_byte_addressable
_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer
cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator
cl_intel_subgroups cl_intel_subgroups_short cl_khr_gl_sharing
 Platform Extensions function suffix             Intel
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
X server found. dri2 connection failed!



DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument



Assuming 131072kB available aperture size.



May lead to reduced performance or incorrect rendering.



get chip id failed: -1 [22]



param: 4, val: 0







 Platform Name                                   NVIDIA CUDA



 Platform Vendor                                 NVIDIA Corporation



 Platform Version                                OpenCL 1.2 CUDA 9.0.282



 Platform Profile                                FULL_PROFILE



 Platform Extensions
cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics
cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte
_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options
cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts
cl_nv_create_buffer
 Platform Extensions function suffix             NV







 Platform Name                                   Intel Gen OCL Driver



Number of devices                                 1



X server found. dri2 connection failed!



DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument



Assuming 131072kB available aperture size.



May lead to reduced performance or incorrect rendering.



get chip id failed: -1 [22]



param: 4, val: 0



X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
 Device Name                                     Intel(R) HD Graphics
Haswell GT2 Desktop
 Device Vendor                                   Intel
 Device Vendor ID                                0x8086
 Device Version                                  OpenCL 1.2 beignet 1.3
 Driver Version                                  1.3
 Device OpenCL C Version                         OpenCL C 1.2 beignet 1.3
 Device Type                                     GPU
 Device Available                                Yes
 Device Profile                                  FULL_PROFILE
 Max compute units                               20
 Max clock frequency                             1000MHz
 Device Partition                                (core)
   Max number of sub-devices                     1
   Supported partition types                     None, None, None
 Max work item dimensions                        3
 Max work item sizes                             512x512x512
 Max work group size                             512
 Compiler Available                              Yes
 Linker Available                                Yes
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
 Preferred work group size multiple              16
 Preferred / native vector sizes
   char                                                16 / 8
   short                                                8 / 8
   int                                                  4 / 4
   long                                                 2 / 2
   half                                                 0 / 8        (n/a)
   float                                                4 / 4
   double                                               0 / 2        (n/a)
 Half-precision Floating-point support           (n/a)
 Single-precision Floating-point support         (core)
   Denormals                                     No
   Infinity and NANs                             Yes
   Round to nearest                              Yes
   Round to zero                                 No
   Round to infinity                             No
   IEEE754-2008 fused multiply-add               No
   Support is emulated in software               No
   Correctly-rounded divide and sqrt operations  No
 Double-precision Floating-point support         (n/a)
 Address bits                                    32, Little-Endian
 Global memory size                              2147483648
<(214)%20748-3648> (2GiB)
 Error Correction support                        No
 Max memory allocation                           1610612736 (1.5GiB)
 Unified memory for Host and Device              Yes
 Minimum alignment for any data type             128 bytes
 Alignment of base address                       1024 bits (128 bytes)
 Global Memory cache type                        Read/Write
 Global Memory cache size                        8192 (8KiB)
 Global Memory cache line size                   64 bytes
 Image support                                   Yes
   Max number of samplers per kernel             16
   Max size for 1D images from buffer            65536 pixels
   Max 1D or 2D image array size                 2048 images
   Base address alignment for 2D image buffers   4096 bytes
   Pitch alignment for 2D image buffers          1 pixels
   Max 2D image size                             8192x8192 pixels
   Max 3D image size                             8192x8192x2048 pixels
   Max number of read image args                 128
   Max number of write image args                8
 Local memory type                               Local
 Local memory size                               65536 (64KiB)
 Max constant buffer size                        134217728 (128MiB)
 Max number of constant args                     8
 Max size of kernel argument                     1024
 Queue properties
   Out-of-order execution                        No
   Profiling                                     Yes
 Prefer user sync for interop                    Yes
 Profiling timer resolution                      80ns
 Execution capabilities
   Run OpenCL kernels                            Yes
   Run native kernels                            Yes
   SPIR versions                                 1.2
 printf() buffer size                            1048576 (1024KiB)
 Built-in kernels
 __cl_copy_region_align4;__cl_copy_region_align16;__cl_cpy_
region_unalign_same_offset;__cl_copy_region_unalign_dst_
offset;__cl_copy_region_unalign_src_offset;__cl
_copy_buffer_rect;__cl_copy_image_1d_to_1d;__cl_copy_
image_2d_to_2d;__cl_copy_image_3d_to_2d;__cl_copy_image_2d_to_3d;__cl_copy_
image_3d_to_3d;__cl_copy_image_2d_to_buffer;__cl_copy_
image_3d_to_buffer;__cl_copy_
buffer_to_image_2d;__cl_copy_buffer_to_image_3d;__cl_fill_
region_unalign;__cl_fill_region_align2;__cl_fill_region_align4;__cl_fill_
region_align8_2;__cl_fill_region_align8_4;__cl_fill_
region_align8_8;__cl_fill_re
gion_align8_16;__cl_fill_region_align128;__cl_fill_
image_1d;__cl_fill_image_1d_array;__cl_fill_image_2d;__cl_
fill_image_2d_array;__cl_fill_image_3d;
 Device Extensions
cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics
cl_khr_local_int32_extended_atomics cl_khr_byte_addressable
_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer
cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator
cl_intel_subgroups cl_intel_subgroups_short cl_khr_gl_sharing

 Platform Name                                   NVIDIA CUDA
Number of devices                                 1
 Device Name                                     GeForce GTX 1050
 Device Vendor                                   NVIDIA Corporation
 Device Vendor ID                                0x10de
 Device Version                                  OpenCL 1.2 CUDA
 Driver Version                                  384.111
 Device OpenCL C Version                         OpenCL C 1.2
 Device Type                                     GPU
 Device Available                                Yes
 Device Profile                                  FULL_PROFILE
 Device Topology (NV)                            PCI-E, 01:00.0
 Max compute units                               5
 Max clock frequency                             1455MHz
 Compute Capability (NV)                         6.1
 Device Partition                                (core)
   Max number of sub-devices                     1
   Supported partition types                     None
 Max work item dimensions                        3
 Max work item sizes                             1024x1024x64
 Max work group size                             1024
 Compiler Available                              Yes
 Linker Available                                Yes
 Preferred work group size multiple              32
 Warp size (NV)                                  32
 Preferred / native vector sizes
   char                                                 1 / 1
   short                                                1 / 1
   int                                                  1 / 1
   long                                                 1 / 1
   half                                                 0 / 0        (n/a)
   float                                                1 / 1
   double                                               1 / 1
       (cl_khr_fp64)
 Half-precision Floating-point support           (n/a)
 Single-precision Floating-point support         (core)
   Denormals                                     Yes
   Infinity and NANs                             Yes
   Round to nearest                              Yes
   Round to zero                                 Yes
   Round to infinity                             Yes
   IEEE754-2008 fused multiply-add               Yes
   Support is emulated in software               No
   Correctly-rounded divide and sqrt operations  Yes
 Double-precision Floating-point support         (cl_khr_fp64)
   Denormals                                     Yes
   Infinity and NANs                             Yes
   Round to nearest                              Yes
   Round to zero                                 Yes
   Round to infinity                             Yes
   IEEE754-2008 fused multiply-add               Yes
   Support is emulated in software               No
 Address bits                                    64, Little-Endian
 Global memory size                              2092892160
<(209)%20289-2160> (1.949GiB)
 Error Correction support                        No
 Max memory allocation                           523223040 (499MiB)
 Unified memory for Host and Device              No
 Integrated memory (NV)                          No
 Minimum alignment for any data type             128 bytes
 Alignment of base address                       4096 bits (512 bytes)
 Global Memory cache type                        Read/Write
 Global Memory cache size                        81920 (80KiB)
 Global Memory cache line size                   128 bytes
 Image support                                   Yes
   Max number of samplers per kernel             32
   Max size for 1D images from buffer            134217728 pixels
   Max 1D or 2D image array size                 2048 images
   Max 2D image size                             16384x32768 pixels
   Max 3D image size                             16384x16384x16384 pixels
   Max number of read image args                 256
   Max number of write image args                16
 Local memory type                               Local
 Local memory size                               49152 (48KiB)
 Registers per block (NV)                        65536
 Max constant buffer size                        65536 (64KiB)
 Max number of constant args                     9
 Max size of kernel argument                     4352 (4.25KiB)
 Queue properties
   Out-of-order execution                        Yes
   Profiling                                     Yes
 Prefer user sync for interop                    No
 Profiling timer resolution                      1000ns
 Execution capabilities
   Run OpenCL kernels                            Yes
   Run native kernels                            No
   Kernel execution timeout (NV)                 Yes
 Concurrent copy and kernel execution (NV)       Yes
   Number of async copy engines                  2
 printf() buffer size                            1048576 (1024KiB)
 Built-in kernels
 Device Extensions
cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics
cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte
_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options
cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts
cl_nv_create_buffer

NULL platform behavior
 clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...)  No platform
 clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...)   No platform
 clCreateContext(NULL, ...) [default]            No platform
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
X server found. dri2 connection failed!
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
 clCreateContext(NULL, ...) [other]              Success [Intel]
 clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT)  No platform
 clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU)  No platform
 clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU)  No platform
 clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR)  No platform
 clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM)  No platform
 clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL)  No platform


2018-02-02 0:24 GMT+01:00 Andreas Beckmann <anbe at debian.org>:

> On 2018-02-01 22:30, Krzysztof Marczak wrote:
> > After lastest update to version nvidia-graphics-drivers 384.111-3 the
> OpenCL
> > platform in no longer available in the system. clinfo command lists only
> other
> > platforms (e.g. Intel).
> > Just before this update everything worked properly.
> >
> > I have tried to unistall (purge) all opencl-icd packages and install
> again, but
> > it did not help.
> >
> > In actual state there is not possible to run any software using OpenCL
> with
> > nvidia proprietary drivers.
>
> Looks like a duplicate of #888952
> Running clinfo as root helped there - is some module not loaded?
>
> Which package version combination was working previously?
>
>
> Andreas
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/pkg-nvidia-devel/attachments/20180202/758578d3/attachment-0001.html>


More information about the pkg-nvidia-devel mailing list