...nccl-test on a single machine with multiple GPUs (H800...
I'm using NCCL version 2.21.5+cuda12.4, nvidia-driver: 550.54.15 and the same version of nvidia-fabricmanager. I run nccl-test on a single machine and got error of "Invalid argument" NCCL_DEBUG=INFO ./build/all_reduce_perf -b 8 -e 128M -f 2 -g 8 ...