david
david
I also met the problem, and I handled this with leannmlindsey's method.
I also meet this problem too, but I can't solve it, who can give a solution, thanks a lot.
I have solved the problem, my reason is the data can't be transfer to other gpu automatically, so I revise evo's code to manually combinate it, and then it works.
I don't know why transformers can't be automatic train by ``` model = AutoModelForCausalLM.from_pretrained( model_path, config=model_config, trust_remote_code=True, device_map="auto", } ``` so accordding to the debug information I find that the...