Hi-archer
Hi-archer
I have the same error as you, whether you used the author's cleaning code and then generated sharegpt_split.json, and reported the error during training with sharegpt_split.json data?
> > I have the same error as you, whether you used the author's cleaning code and then generated sharegpt_split.json, and reported the error during training with sharegpt_split.json data? >...
> > > > I have the same error as you, whether you used the author's cleaning code and then generated sharegpt_split.json, and reported the error during training with sharegpt_split.json...
> I have set scaled_dot_product_attention as default when the torch 2.0 was installed. It should be as efficient as original. I used this code. It worked very well and also...
> I also got segmentation error with flashinfer=0.0.8 after some requests. > > > One should install FlashInfer manually @orderer0001 > > But, currently it can only use one GPU....