wyyfff

Results 4 comments of wyyfff

> [@wyyfffff](https://github.com/wyyfffff) Please reduce the learning rate. 1e-6 is safe. @Artiprocher Thank you for your reply ! I have already reduced the learning rate to 1e-8, with learning rate warmup...

> [@wyyfffff](https://github.com/wyyfffff) I encountered a similar issue. I reduced the learning rate to 1e-8 and applied gradient clipping with accelerator.clip_grad_norm_(model.trainable_modules(), 1.0). My training uses DeepSpeed ZeRO-2 on four GPUs, and...

For vae and text_encoder, you can still use `fetch_model`. But if you have modified the layers in dit, I think it’s better to try my method. > [@wyyfffff](https://github.com/wyyfffff) I encountered...

@zzhang2816 Below is my modified code, maybe you can try it: In my case, I removed the WAN safetensor path from `model_configs` to create a new `wan_paths: list[str]` ``` @staticmethod...