Jeonghyeon Park

Results 2 comments of Jeonghyeon Park

I encountered a similar issue using PEFT LoRA, load_in_8bit, and DeepSpeed 3 (optimizer and params offload) with huggingface accelerator. on a single gpu, training was fine as expected. If anyone...

I was searching for the internet to find the answer. and then I found this thread. is it possible though? I think the latter arg probably overwrites the former. if...