e4t-diffusion issues

images and resources for pre-training

**How many images and resources are needed for pre training ?** Thanks! I use single A100 GPU (40G), a image cost 24 hours , on the pre-training.

yudongjian

BUG in inference.py

1

This line, assert ckpt_path in MODELS, f"Choose from {list(MODELS.keys())}" and "e4t-diffusion-ffhq-celebahq-v1" is the only key of MODELS. So, in function load_e4t_unet, if os.path.exists(ckpt_path) is False, you WILL get a assert...

CHR-ray

Problem with running the code both in colab and local machine

1

Hello, I really admire your implementation and I am planning to use e4t code but unfortunately I can't run your code. The main problem is in the https://github.com/mkshing/e4t-diffusion/blob/main/pretrain_e4t.py#L238 where we...

alimohammadiamirhossein

Need help... OOM with 2 RTX3090 (bs=2)

1

the accelerate yaml file ``` compute_environment: LOCAL_MACHINE distributed_type: MULTI_GPU downcast_bf16: 'no' gpu_ids: all machine_rank: 0 main_training_function: main mixed_precision: fp8 num_machines: 1 num_processes: 2 rdzv_backend: static same_network: true tpu_env: [] tpu_use_cluster:...

CHR-ray

Question about settings and results.

Thank you for your implementation! However I have some questions about settings and results. I use your **pretained encoder.pt and weight_offsets.pt** on celebahq and FFFHQ. I had to use the...

zhanjiahui

{placeholder_token} vs *s in inference

1

In the fine tune code, there is an assert hard code the special token as {placeholder_tokne}. ```assert ( "{placeholder_token}" in args.prompt_template ), "You must specify the location of placeholder token...

bsun0802

Upgrade to SD 1.5 or later

Hi. Your implementation is working well for me, but in some cases the output is less than ideal. Would it be possible to upgrade this to work with Stable Diffusion...

blistick

can you release the cat model

which is mentioned in the following image. ![image](https://github.com/mkshing/e4t-diffusion/assets/29566012/d6158c68-06cf-4112-bf2c-464d43198d12)

silence401

Bugs in the pretrain script

I ran the pretraining script _CUDA_VISIBLE_DEVICES=1 accelerate launch pretrain_e4t.py --pretrained_model_name_or_path="CompVis/stable-diffusion-v1-4" --clip_model_name_or_path="ViT-H-14::laion2b_s32b_b79k" --domain_class_token="cat" --placeholder_token="*s" --prompt_template=normal --save_sample_prompt="a photo of the *s, a photo of the *s in monet style" --reg_lambda=0.01 --domain_embed_scale=0.1 --output_dir="pretrained-cat"...

shenzy08

when I finetuned the model

2

zhangqizky

e4t-diffusion
e4t-diffusion copied to clipboard

Metadata

images and resources for pre-training

BUG in inference.py

Problem with running the code both in colab and local machine

Need help... OOM with 2 RTX3090 (bs=2)

Question about settings and results.

{placeholder_token} vs *s in inference

Upgrade to SD 1.5 or later

can you release the cat model

Bugs in the pretrain script

when I finetuned the model

← Metadata

Owner

Metadata

e4t-diffusion e4t-diffusion copied to clipboard

Metadata

← Metadata

Owner

Metadata

e4t-diffusion
e4t-diffusion copied to clipboard