train_dreambooth_inpaint.py throwing "returned non-zero exit status 1" error
Describe the bug
I've been running train_dreambooth_inpaint.py from https://github.com/huggingface/diffusers/tree/main/examples/research_projects/dreambooth_inpaint for the last few days with the same environment configurations. Today I ran it again and got:
Traceback (most recent call last):
File "train_dreambooth_inpaint.py", line 825, in
Reproduction
- SSH into GPU
- Create a conda environment: conda create -n YOUR_ENV_NAME python=3.8
- Activate the session: source activate YOUR_ENV_NAME
- CD into https://github.com/huggingface/diffusers/tree/main/examples/research_projects/dreambooth_inpaint
- pip install -r requirements.txt
- pip install git+https://github.com/huggingface/diffusers
- Run the first code snippet with custom inputs (my code is below)
`export MODEL_NAME="runwayml/stable-diffusion-inpainting" export INSTANCE_DIR="/workspace/diffusers/examples/research_projects/dreambooth_inpaint/modern_512_images" export OUTPUT_DIR="/workspace/diffusers/examples/research_projects/dreambooth_inpaint/modern_512_400steps_model"
accelerate launch train_dreambooth_inpaint.py
--pretrained_model_name_or_path=$MODEL_NAME
--instance_data_dir=$INSTANCE_DIR
--output_dir=$OUTPUT_DIR
--instance_prompt="a photo of mmodern furniture"
--resolution=512
--train_batch_size=1
--gradient_accumulation_steps=1
--learning_rate=5e-6
--lr_scheduler="constant"
--lr_warmup_steps=0
--max_train_steps=400`
Logs
No response
System Info
-
diffusersversion: 0.14.0.dev0 - Platform: Linux-5.15.0-47-generic-x86_64-with-glibc2.17
- Python version: 3.8.16
- PyTorch version (GPU?): 1.13.1+cu117 (True)
- Huggingface_hub version: 0.12.1
- Transformers version: 4.26.1
- Accelerate version: 0.16.0
- xFormers version: not installed
- Using GPU in script?: yes
- Using distributed or parallel set-up in script?: no