Hi-archer

Results 5 comments of Hi-archer

I have the same error as you, whether you used the author's cleaning code and then generated sharegpt_split.json, and reported the error during training with sharegpt_split.json data?

> > I have the same error as you, whether you used the author's cleaning code and then generated sharegpt_split.json, and reported the error during training with sharegpt_split.json data? >...

> > > > I have the same error as you, whether you used the author's cleaning code and then generated sharegpt_split.json, and reported the error during training with sharegpt_split.json...

> I have set scaled_dot_product_attention as default when the torch 2.0 was installed. It should be as efficient as original. I used this code. It worked very well and also...