Sourav Chakraborty
Sourav Chakraborty
Hi @jedbrown We fixed several IPC issues in ROCm 3.7 release. Can you give it a try? It's recommended to uninstall any older ROCm version before installing 3.7.
Hi @jedbrown, I believe it's a ROCR issue and not UCX. I would like to send you some test programs to debug it further. Is the email @jedbrown.org good to...
The root cause was confirmed to be with ROCR support for Radeon VII and not UCX. An internal issue has been raised to resolve this.
The 44.5 GB pytorch.bin file came from the google/t5-v1_1-xxl model: https://huggingface.co/google/t5-v1_1-xxl/tree/main The other dependencies are: openai/clip-vit-large-patch1 and Falconsai/nsfw_image_detection