rccl
rccl copied to clipboard
ROCm Communication Collectives Library (RCCL)
## Details ___Do not mention proprietary info or link to internal work items in this PR.___ **Work item:** _"Internal", or link to GitHub issue (if applicable)._ SWDEV-456817 **What were the...
### Problem Description Want to check the Algorithm and protocol being selected for all_reduce_perf. Uncommented line 1520 in enqueue.cc, but none print statements were seen. How do we check the...
Adding opt-in experimental XCC affinity mode for MI300 This should be considered experimental and not guaranteed to work in current/future versions of ROCm.
## Details **Work item:** Internal **What were the changes?** Enable the use of `amdclang++` instead of `hipcc` for building RCCL. **Why were the changes made?** - Update `CXX` and `C`...
Define APIs that can be implemented by a dynamic plugin to export flow info. Co-authored with Tom Emmons . Export flow information for ring and tree topologies currently, covering both...
- Added Dockerfile - Updated README.md with instructions for using Dockerfile
I found a bug in the NIC selection in rccl, which I have detailed in the following documentation. I've made a few improvements to make better use of the NIC...