Tony
Tony
I have the same error. Did you fix it?
Bugs still appear. No idea how to do with it.
> Your NIC IP address may not be configured correctly. Thanks! Could you please tell me how to check if the IP address is correct? Both node can access to...
> I’m not an expert in this area, but I think you could try using `ib_write_bw` to test connectivity. I tried 'ib_write_bw' on both nodes and I got these: ```log...
I solved this problem by seting env: ```shell export NVSHMEM_DISABLE_CUDA_VMM=1 ``` but I don't know why this could work.
> > I solved this problem by seting env: > > export NVSHMEM_DISABLE_CUDA_VMM=1 > > but I don't know why this could work. > > Memory fragmentation of NVSHMEM, did...