hl0929 comments

Repositories
Issues
Comments

Results 3 comments of


                                            hl0929

After the model training, I want to input a paragraph and a question, and then get the output answer, how to do

> If you want a interactive or data-streaming way to do inference with a trained model, unfortunately there is no boilerplate code yet. A simple idea to implement it is...

[Bug]: NCCL watchdog thread terminated with exception: CUDA error: an illegal memory access was encountered

When I load the llama model, some GPU will do this and others will be fine

the evaluation error for the eval_mixtral.py

me too