Anurag Kumar Singh
Anurag Kumar Singh
I checked `import horovod.tensorflow` it's working and I followed all the steps mentioned in the documentation but still I am getting same error. I am doing it all this in...
As I mentioned above I was doing it in a virtual environment now I installed horovod again globally now it is working in virtual environment also.But Now I am getting...
NVIDIA driver is installed this error come whenever I try to run deepmd-kit with more than 1 process.
Solution -- given by [benmenadue](https://github.com/benmenadue) unable to understand his solution. If you can help me out what changes do I have to make. System -- NCCL - 2.12.12 Workstation with...
@njzjz The link you mentioned I tried that link I was able to run that nccl-test via cudart (tensorflow) anurag1@hp-HP-Z8-G4-Workstation:/nccl-tests\$ NCCL_DEBUG=WARN LD_LIBRARY_PATH=~/.local/nccl/lib/ ./src/build-shared/all_gather_perf nThread 1 nGpus 1 minBytes 33554432 maxBytes...
@njzjz I tried using conda but still error is same Output -- [0] DEEPMD rank:0 INFO built training [0] DEEPMD rank:0 INFO initialize model from scratch [0] DEEPMD rank:0 INFO...
@njzjz Pardon me for asking questions on this issue after a long time. Could you please tell me in which file I need to make a change - CUDARTLIB="cuda". I...
@njzjz Thanks for the prompt response. When I try to run I got the following error --  I think I am missing some steps as it requires Makefile as...