zzhang2816

Results 7 comments of zzhang2816

Besides this problem, with the apple m1 chip, there will be an "illegal hardware instruction" error.

Thank you! Sorry for the later reply. I am not working in the field of time-series data now. The version I used is fedot == 0.3.0, I haven't tried the...

Add these lines to inference code ``` if lora_folder != "": unet.load_attn_procs(lora_folder) unet.to(torch.float16) unet.requires_grad_(False) ```

@wyyfffff I encountered a similar issue. I reduced the learning rate to 1e-8 and applied gradient clipping with accelerator.clip_grad_norm_(model.trainable_modules(), 1.0). My training uses DeepSpeed ZeRO-2 on four GPUs, and I’ve...

I have rewritten the model loading logic using traditional initialization method, but the problem persist. Not sure what is wrong. ``` def load_dit(file_path, model_class, torch_dtype, device): state_dict = {} for...

Thanks for sharing! I’ll give it a try. @wyyfffff