Bug#1074350: nvidia-kernel-dkms: Trying to modprobe nvidia-peermem to use NCCL/RDMA/Infiniband with GPUs

Andreas Beckmann anbe at debian.org
Sat Jul 6 00:33:09 BST 2024


On 04/07/2024 06.19, Jeffrey Mark Siskind wrote:
> I figured it out. doca-ofed aka MLNX_OFED needs to have
> openibd.service running. It failed because opensmd.service was
> running. For some reason, it hung when I tried to stop opensmd.service.
> I rebooted and then nvidia-peermem loaded.

Great!

If you have more hints for other people trying to get that running, too, 
you coul dleave them here in this bug.


Andreas



More information about the pkg-nvidia-devel mailing list