Update A1111 to v1.10.1 and add ROCm support on A1111
This pull request introduces significant updates to the Docker setup for Stable Diffusion web UI services, with a focus on improving hardware compatibility, updating dependencies, and refactoring service definitions. The changes include upgrading base images and software versions, adding support for AMD ROCm devices, and restructuring the docker-compose.yml file to better organize service configurations for CUDA, ROCm, and CPU environments.
Hardware compatibility and service configuration:
- Added ROCm (AMD GPU) support with a new
auto-rocmservice and correspondingDockerfile-rocm, including device, environment, and build settings for ROCm hardware. [1] [2] - Refactored
docker-compose.ymlto split service definitions by hardware type (CUDA, ROCm, CPU) and grouped CLI arguments for each, improving maintainability and clarity. [1] [2]
Dependency and version updates:
- Updated base images and Python dependencies for all services, including the upgrade of PyTorch to 2.5.1 (CUDA) and 2.6.0 (ROCm), and bumping the Stable Diffusion web UI to version 1.10.1. [1] [2] [3] [4]
- Upgraded the Alpine Git and Bash images used for downloads and utility containers to newer versions for improved security and compatibility. [1] [2]
General improvements and bug fixes:
- Improved robustness of Gradio patching by searching for
routes.pyfiles dynamically and applying changes, with added error handling. [1] [2] - Updated image tags and build contexts to reflect new versions and configurations for each service profile.
i'm getting this error on my machine:
$ docker compose --profile auto-rocm up --build -d
...
[+] Running 1/2
✔ sd-auto-rocm:79 Built 0.0s
⠙ Container webui-docker-auto-rocm-1 Starting 0.1s
DEBU[0000] otel error error="<nil>"
DEBU[0000] otel error error="<nil>"
Error response from daemon: could not select device driver "amd" with capabilities: [[gpu]]
any thoughts?
i'm getting this error on my machine:
$ docker compose --profile auto-rocm up --build -d ... [+] Running 1/2 ✔ sd-auto-rocm:79 Built 0.0s ⠙ Container webui-docker-auto-rocm-1 Starting 0.1s DEBU[0000] otel error error="<nil>" DEBU[0000] otel error error="<nil>" Error response from daemon: could not select device driver "amd" with capabilities: [[gpu]]any thoughts?
That is a weird error. I'll try to look into it.
Meanwhile, did you update the env vars to match your system:
- ROCm_VERSION=6.4
- HIP_VISIBLE_DEVICES=0
- HSA_OVERRIDE_GFX_VERSION=11.0.0