Bug#1030336: (no subject)

Pierre Aussaguel pierre_aussaguel at yahoo.fr
Wed Feb 8 10:50:20 GMT 2023


I upgraded today to v. 520.


I dont know if it is related to the previous problem or not, but I am 
not able to enable OpenCl anymore in Darktable.


"darktable -d opencl" gives the following output

[dt_get_sysresource_level] switched to 2 as `large'
   total mem:       48101MB
   mipmap cache:    6012MB
   available mem:   32882MB
   singlebuff:      751MB
   OpenCL tune mem: OFF
   OpenCL pinned:   OFF
[opencl_init] opencl related configuration options:
[opencl_init] opencl: ON
[opencl_init] opencl_scheduling_profile: 'default'
[opencl_init] opencl_library: 'default path'
[opencl_init] opencl_device_priority: '*/!0,*/*/*'
[opencl_init] opencl_mandatory_timeout: 200
[opencl_init] opencl library 'libOpenCL.so.1' found on your system and 
loaded
[opencl_init] found 1 platform
[opencl_init] found 1 device

[dt_opencl_device_init]
    DEVICE:                   0: 'NVIDIA T400'
    CANONICAL NAME:           nvidiat400
    PLATFORM NAME & VENDOR:   NVIDIA CUDA, NVIDIA Corporation
    DRIVER VERSION:           520.56.06
    DEVICE VERSION:           OpenCL 3.0 CUDA, SM_20 SUPPORT
    DEVICE_TYPE:              GPU
    GLOBAL MEM SIZE:          1868 MB
    MAX MEM ALLOC:            467 MB
    MAX IMAGE SIZE:           32768 x 32768
    MAX WORK GROUP SIZE:      1024
    MAX WORK ITEM DIMENSIONS: 3
    MAX WORK ITEM SIZES:      [ 1024 1024 64 ]
    ASYNC PIXELPIPE:          NO
    PINNED MEMORY TRANSFER:   NO
    MEMORY TUNING:            NO
    FORCED HEADROOM:          400
    AVOID ATOMICS:            NO
    MICRO NAP:                250
    ROUNDUP WIDTH:            16
    ROUNDUP HEIGHT:           16
    CHECK EVENT HANDLES:      128
    PERFORMANCE:              7.964
    TILING ADVANTAGE:         0.000
    DEFAULT DEVICE:           NO
    KERNEL DIRECTORY:         /usr/share/darktable/kernels
    CL COMPILER OPTION:       -cl-fast-relaxed-math
0.161706 [opencl_fopen_stat] could not open file 
`/home/pierre/.cache/darktable/cached_kernels_for_NVIDIAT400_5205606/demosaic_ppg.cl.bin'!
0.161715 [opencl_load_program] could not load cached binary program, 
trying to compile source
0.161829 [opencl_build_program] could not build program: 
CL_BUILD_PROGRAM_FAILURE
0.161832 [dt_opencl_device_init] failed to compile program 
`demosaic_ppg.cl'!
[opencl_init] no suitable devices found.
[opencl_init] FINALLY: opencl is NOT AVAILABLE on this system.
[opencl_init] initial status of opencl enabled flag is OFF.


And "clinfo" gives

Number of platforms                               1
   Platform Name                                   NVIDIA CUDA
   Platform Vendor                                 NVIDIA Corporation
   Platform Version                                OpenCL 3.0 CUDA 11.8.87
   Platform Profile                                FULL_PROFILE
   Platform Extensions cl_khr_global_int32_base_atomics 
cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics 
cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_3d_image_writes 
cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing 
cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll 
cl_nv_copy_opts cl_khr_gl_event cl_nv_create_buffer 
cl_khr_int64_base_atomics cl_khr_int64_extended_atomics 
cl_nv_kernel_attribute cl_khr_device_uuid cl_khr_pci_bus_info 
cl_khr_external_semaphore cl_khr_external_memory 
cl_khr_external_semaphore_opaque_fd cl_khr_external_memory_opaque_fd
   Platform Extensions with Version cl_khr_global_int32_base_atomics 
0x400000 (1.0.0)
cl_khr_global_int32_extended_atomics 0x400000 (1.0.0)
cl_khr_local_int32_base_atomics 0x400000 (1.0.0)
cl_khr_local_int32_extended_atomics 0x400000 (1.0.0)
cl_khr_fp64 0x400000 (1.0.0)
cl_khr_3d_image_writes 0x400000 (1.0.0)
cl_khr_byte_addressable_store 0x400000 (1.0.0)
cl_khr_icd 0x400000 (1.0.0)
cl_khr_gl_sharing 0x400000 (1.0.0)
cl_nv_compiler_options 0x400000 (1.0.0)
cl_nv_device_attribute_query 0x400000 (1.0.0)
cl_nv_pragma_unroll 0x400000 (1.0.0)
cl_nv_copy_opts 0x400000 (1.0.0)
cl_khr_gl_event 0x400000 (1.0.0)
cl_nv_create_buffer 0x400000 (1.0.0)
cl_khr_int64_base_atomics 0x400000 (1.0.0)
cl_khr_int64_extended_atomics 0x400000 (1.0.0)
cl_nv_kernel_attribute 0x400000 (1.0.0)
cl_khr_device_uuid 0x400000 (1.0.0)
cl_khr_pci_bus_info 0x400000 (1.0.0)
cl_khr_external_semaphore 0x9000 (0.9.0)
cl_khr_external_memory 0x9000 (0.9.0)
cl_khr_external_semaphore_opaque_fd 0x9000 (0.9.0)
cl_khr_external_memory_opaque_fd 0x9000 (0.9.0)
   Platform Numeric Version                        0xc00000 (3.0.0)
   Platform Extensions function suffix             NV
   Platform Host timer resolution                  0ns

   Platform Name                                   NVIDIA CUDA
Number of devices                                 1
   Device Name                                     NVIDIA T400
   Device Vendor                                   NVIDIA Corporation
   Device Vendor ID                                0x10de
   Device Version                                  OpenCL 3.0 CUDA
   Device UUID 8d697e0e-da41-3544-60f5-cab4da7779eb
   Driver UUID 8d697e0e-da41-3544-60f5-cab4da7779eb
   Valid Device LUID                               No
   Device LUID 6d69-637300000000
   Device Node Mask                                0
   Device Numeric Version                          0xc00000 (3.0.0)
   Driver Version                                  520.56.06
   Device OpenCL C Version                         OpenCL C 1.2
   Device OpenCL C all versions                    OpenCL 
C                                                         0x400000 (1.0.0)
                                                   OpenCL 
C                                                         0x401000 (1.1.0)
                                                   OpenCL 
C                                                         0x402000 (1.2.0)
                                                   OpenCL 
C                                                         0xc00000 (3.0.0)
   Device OpenCL C features __opencl_c_fp64 0xc00000 (3.0.0)
__opencl_c_images 0xc00000 (3.0.0)
__opencl_c_int64 0xc00000 (3.0.0)
__opencl_c_3d_image_writes 0xc00000 (3.0.0)
   Latest comfornace test passed                   v2021-02-01-00
   Device Type                                     GPU
   Device Topology (NV)                            PCI-E, 0000:05:00.0
   Device Profile                                  FULL_PROFILE
   Device Available                                Yes
   Compiler Available                              Yes
   Linker Available                                Yes
   Max compute units                               6
   Max clock frequency                             1425MHz
   Compute Capability (NV)                         7.5
   Device Partition                                (core)
     Max number of sub-devices                     1
     Supported partition types                     None
     Supported affinity domains                    (n/a)
   Max work item dimensions                        3
   Max work item sizes                             1024x1024x64
   Max work group size                             1024
   Preferred work group size multiple (device)     32
=== CL_PROGRAM_BUILD_LOG ===
   Preferred work group size multiple (kernel) <getWGsizes:1504: create 
kernel : error -45>
   Warp size (NV)                                  32
   Max sub-groups per work group                   0
   Preferred / native vector sizes
     char                                                 1 / 1
     short                                                1 / 1
     int                                                  1 / 1
     long                                                 1 / 1
     half                                                 0 / 0        (n/a)
     float                                                1 / 1
     double                                               1 / 1        
(cl_khr_fp64)
   Half-precision Floating-point support           (n/a)
   Single-precision Floating-point support         (core)
     Denormals                                     Yes
     Infinity and NANs                             Yes
     Round to nearest                              Yes
     Round to zero                                 Yes
     Round to infinity                             Yes
     IEEE754-2008 fused multiply-add               Yes
     Support is emulated in software               No
     Correctly-rounded divide and sqrt operations  Yes
   Double-precision Floating-point support         (cl_khr_fp64)
     Denormals                                     Yes
     Infinity and NANs                             Yes
     Round to nearest                              Yes
     Round to zero                                 Yes
     Round to infinity                             Yes
     IEEE754-2008 fused multiply-add               Yes
     Support is emulated in software               No
   Address bits                                    64, Little-Endian
   Global memory size                              1959264256 (1.825GiB)
   Error Correction support                        No
   Max memory allocation                           489816064 (467.1MiB)
   Unified memory for Host and Device              No
   Integrated memory (NV)                          No
   Shared Virtual Memory (SVM) capabilities        (core)
     Coarse-grained buffer sharing                 Yes
     Fine-grained buffer sharing                   No
     Fine-grained system sharing                   No
     Atomics                                       No
   Minimum alignment for any data type             128 bytes
   Alignment of base address                       4096 bits (512 bytes)
   Preferred alignment for atomics
     SVM                                           0 bytes
     Global                                        0 bytes
     Local                                         0 bytes
   Atomic memory capabilities                      relaxed, work-group scope
   Atomic fence capabilities                       relaxed, 
acquire/release, work-group scope
   Max size for global variable                    0
   Preferred total size of global vars             0
   Global Memory cache type                        Read/Write
   Global Memory cache size                        196608 (192KiB)
   Global Memory cache line size                   128 bytes
   Image support                                   Yes
     Max number of samplers per kernel             32
     Max size for 1D images from buffer            268435456 pixels
     Max 1D or 2D image array size                 2048 images
     Max 2D image size                             32768x32768 pixels
     Max 3D image size 16384x16384x16384 pixels
     Max number of read image args                 256
     Max number of write image args                32
     Max number of read/write image args           0
   Pipe support                                    No
   Max number of pipe args                         0
   Max active pipe reservations                    0
   Max pipe packet size                            0
   Local memory type                               Local
   Local memory size                               49152 (48KiB)
   Registers per block (NV)                        65536
   Max number of constant args                     9
   Max constant buffer size                        65536 (64KiB)
   Generic address space support                   No
   Max size of kernel argument                     4352 (4.25KiB)
   Queue properties (on host)
     Out-of-order execution                        Yes
     Profiling                                     Yes
   Device enqueue capabilities                     (n/a)
   Queue properties (on device)
     Out-of-order execution                        No
     Profiling                                     No
     Preferred size                                0
     Max size                                      0
   Max queues on device                            0
   Max events on device                            0
   Prefer user sync for interop                    No
   Profiling timer resolution                      1000ns
   Execution capabilities
     Run OpenCL kernels                            Yes
     Run native kernels                            No
     Non-uniform work-groups                       No
     Work-group collective functions               No
     Sub-group independent forward progress        No
     Kernel execution timeout (NV)                 Yes
   Concurrent copy and kernel execution (NV)       Yes
     Number of async copy engines                  3
     IL version                                    (n/a)
     ILs with version <printDeviceInfo:186: get 
CL_DEVICE_ILS_WITH_VERSION : error -30>
   printf() buffer size                            1048576 (1024KiB)
   Built-in kernels                                (n/a)
   Built-in kernels with version <printDeviceInfo:190: get 
CL_DEVICE_BUILT_IN_KERNELS_WITH_VERSION : error -30>
   Device Extensions cl_khr_global_int32_base_atomics 
cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics 
cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_3d_image_writes 
cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing 
cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll 
cl_nv_copy_opts cl_khr_gl_event cl_nv_create_buffer 
cl_khr_int64_base_atomics cl_khr_int64_extended_atomics 
cl_nv_kernel_attribute cl_khr_device_uuid cl_khr_pci_bus_info 
cl_khr_external_semaphore cl_khr_external_memory 
cl_khr_external_semaphore_opaque_fd cl_khr_external_memory_opaque_fd
   Device Extensions with Version cl_khr_global_int32_base_atomics 
0x400000 (1.0.0)
cl_khr_global_int32_extended_atomics 0x400000 (1.0.0)
cl_khr_local_int32_base_atomics 0x400000 (1.0.0)
cl_khr_local_int32_extended_atomics 0x400000 (1.0.0)
cl_khr_fp64 0x400000 (1.0.0)
cl_khr_3d_image_writes 0x400000 (1.0.0)
cl_khr_byte_addressable_store 0x400000 (1.0.0)
cl_khr_icd 0x400000 (1.0.0)
cl_khr_gl_sharing 0x400000 (1.0.0)
cl_nv_compiler_options 0x400000 (1.0.0)
cl_nv_device_attribute_query 0x400000 (1.0.0)
cl_nv_pragma_unroll 0x400000 (1.0.0)
cl_nv_copy_opts 0x400000 (1.0.0)
cl_khr_gl_event 0x400000 (1.0.0)
cl_nv_create_buffer 0x400000 (1.0.0)
cl_khr_int64_base_atomics 0x400000 (1.0.0)
cl_khr_int64_extended_atomics 0x400000 (1.0.0)
cl_nv_kernel_attribute 0x400000 (1.0.0)
cl_khr_device_uuid 0x400000 (1.0.0)
cl_khr_pci_bus_info 0x400000 (1.0.0)
cl_khr_external_semaphore 0x9000 (0.9.0)
cl_khr_external_memory 0x9000 (0.9.0)
cl_khr_external_semaphore_opaque_fd 0x9000 (0.9.0)
cl_khr_external_memory_opaque_fd 0x9000 (0.9.0)

NULL platform behavior
   clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...)  NVIDIA CUDA
   clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...)   Success [NV]
   clCreateContext(NULL, ...) [default]            Success [NV]
   clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT)  No platform
   clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU)  No devices found 
in platform
   clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU)  No platform
   clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR)  No devices 
found in platform
   clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM)  Invalid device 
type for platform
   clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL)  No platform

ICD loader properties
   ICD loader Name                                 OpenCL ICD Loader
   ICD loader Vendor                               OCL Icd free software
   ICD loader Version                              2.3.1
   ICD loader Profile                              OpenCL 3.0





More information about the pkg-nvidia-devel mailing list