Bug#1074350: nvidia-kernel-dkms: Trying to modprobe nvidia-peermem to use NCCL/RDMA/Infiniband with GPUs
Andreas Beckmann
anbe at debian.org
Sat Jul 6 00:33:09 BST 2024
On 04/07/2024 06.19, Jeffrey Mark Siskind wrote:
> I figured it out. doca-ofed aka MLNX_OFED needs to have
> openibd.service running. It failed because opensmd.service was
> running. For some reason, it hung when I tried to stop opensmd.service.
> I rebooted and then nvidia-peermem loaded.
Great!
If you have more hints for other people trying to get that running, too,
you coul dleave them here in this bug.
Andreas
More information about the pkg-nvidia-devel
mailing list