train_dreambooth_inpaint.py throwing "returned non-zero exit status 1" error

Open lakshman111 opened this issue 3 years ago • 0 comments

Describe the bug

I've been running train_dreambooth_inpaint.py from https://github.com/huggingface/diffusers/tree/main/examples/research_projects/dreambooth_inpaint for the last few days with the same environment configurations. Today I ran it again and got:

Traceback (most recent call last): File "train_dreambooth_inpaint.py", line 825, in main() File "train_dreambooth_inpaint.py", line 421, in main accelerator = Accelerator( TypeError: init() got an unexpected keyword argument 'accelerator_project_config' Traceback (most recent call last): File "/usr/local/envs/laksh/bin/accelerate", line 8, in sys.exit(main()) File "/usr/local/envs/laksh/lib/python3.8/site-packages/accelerate/commands/accelerate_cli.py", line 45, in main args.func(args) File "/usr/local/envs/laksh/lib/python3.8/site-packages/accelerate/commands/launch.py", line 1097, in launch_command simple_launcher(args) File "/usr/local/envs/laksh/lib/python3.8/site-packages/accelerate/commands/launch.py", line 552, in simple_launcher raise subprocess.CalledProcessError(returncode=process.returncode, cmd=cmd) subprocess.CalledProcessError: Command '['/usr/local/envs/laksh/bin/python', 'train_dreambooth_inpaint.py', '--pretrained_model_name_or_path=runwayml/stable-diffusion-inpainting', '--instance_data_dir=/workspace/diffusers/examples/research_projects/dreambooth_inpaint/modern_512_images', '--output_dir=/workspace/diffusers/examples/research_projects/dreambooth_inpaint/modern_512_400steps_model', '--instance_prompt=a photo of mmodern furniture', '--resolution=512', '--train_batch_size=1', '--gradient_accumulation_steps=1', '--learning_rate=5e-6', '--lr_scheduler=constant', '--lr_warmup_steps=0', '--max_train_steps=400']' returned non-zero exit status 1.

Reproduction

SSH into GPU
Create a conda environment: conda create -n YOUR_ENV_NAME python=3.8
Activate the session: source activate YOUR_ENV_NAME
CD into https://github.com/huggingface/diffusers/tree/main/examples/research_projects/dreambooth_inpaint
pip install -r requirements.txt
pip install git+https://github.com/huggingface/diffusers
Run the first code snippet with custom inputs (my code is below)

`export MODEL_NAME="runwayml/stable-diffusion-inpainting" export INSTANCE_DIR="/workspace/diffusers/examples/research_projects/dreambooth_inpaint/modern_512_images" export OUTPUT_DIR="/workspace/diffusers/examples/research_projects/dreambooth_inpaint/modern_512_400steps_model"

accelerate launch train_dreambooth_inpaint.py
--pretrained_model_name_or_path=$MODEL_NAME
--instance_data_dir=$INSTANCE_DIR
--output_dir=$OUTPUT_DIR
--instance_prompt="a photo of mmodern furniture"
--resolution=512
--train_batch_size=1
--gradient_accumulation_steps=1
--learning_rate=5e-6
--lr_scheduler="constant"
--lr_warmup_steps=0
--max_train_steps=400`

Logs

No response

System Info

diffusers version: 0.14.0.dev0
Platform: Linux-5.15.0-47-generic-x86_64-with-glibc2.17
Python version: 3.8.16
PyTorch version (GPU?): 1.13.1+cu117 (True)
Huggingface_hub version: 0.12.1
Transformers version: 4.26.1
Accelerate version: 0.16.0
xFormers version: not installed
Using GPU in script?: yes
Using distributed or parallel set-up in script?: no

Feb 21 '23 06:02 lakshman111