mamba icon indicating copy to clipboard operation
mamba copied to clipboard

Fix mamba installation on AMD GPUs: clang++: error: cannot determine amdgcn architecture: /opt/rocm/lib/llvm/bin/amdgpu-arch: ; consider passing it via '--offload-arch'

Open rodosingh opened this issue 1 year ago • 0 comments

Hi Folks (especially AMD GPUs' trainer),

While installing/building mamba-ssm from repository, if you have faced issues with cannot determine amdgcn architecture and the solution suggested being consider passing it via '--offload-arch'. Then here is the solution.

[NOTE]: This have been tried with the rocm/pytorch-training docker image.

First of all, try to checkout to a particular commit of mamba repo to avoid possible errors like NameError: name 'bare_metal_version' is not defined and then export a variable that overrides the possibility of setting --offload_arch=native as done in setup.py of mamba repo.

Steps:

git clone https://github.com/state-spaces/mamba.git mamba_ssm
cd mamba_ssm
git checkout 014c094
export HIP_ARCHITECTURES="gfx942" # For MI300 only. Replace it with your architecture(s)
pip install --no-cache-dir --verbose .

I tried searching for the HIP_ARCHITECTURES variable by loading docker image and os.getenv(), but couldn't find any.

rodosingh avatar Apr 03 '25 14:04 rodosingh