zzhang2816
zzhang2816
Besides this problem, with the apple m1 chip, there will be an "illegal hardware instruction" error.
Thank you! Sorry for the later reply. I am not working in the field of time-series data now. The version I used is fedot == 0.3.0, I haven't tried the...
Add these lines to inference code ``` if lora_folder != "": unet.load_attn_procs(lora_folder) unet.to(torch.float16) unet.requires_grad_(False) ```
@wyyfffff I encountered a similar issue. I reduced the learning rate to 1e-8 and applied gradient clipping with accelerator.clip_grad_norm_(model.trainable_modules(), 1.0). My training uses DeepSpeed ZeRO-2 on four GPUs, and I’ve...
I will try it soon, thanks a lot! @wyyfffff
I have rewritten the model loading logic using traditional initialization method, but the problem persist. Not sure what is wrong. ``` def load_dit(file_path, model_class, torch_dtype, device): state_dict = {} for...
Thanks for sharing! I’ll give it a try. @wyyfffff