Ye Yuan comments

Results 10 comments of


                                            Ye Yuan

Evaluating Arc Easy Got NonMatchingSplitsSizesError

I updated my `datasets` package from 2.12.0 to 2.16.0 and the issue disappeared. Perhaps this should be added to the dependencies. Thanks anyway.

Error when reproducing mistral results

> Hi, you can add `repeat_kv` from `inf_llm/attention/utils.py` before the qk computation. I've found it. Thanks a lot!

Different versions of NVILA

I have the same question. The paper didn't describe what is NVILA-Lite model. I'm also wondering what's the difference among all different types of models.

Different versions of NVILA

Thanks for the explanation. Looking forward to the new version of your paper!

Llama 3.1 Load Fail

Hi, I got the same error. Did you fix it?

Llama 3.1 Load Fail

> > Hi, I got the same error. Did you fix it? > > Yes, the API seemed changed, but it still didn't work well for me. I've changed the...

[Usage]: vllm infer with 2 * Nvidia-L20, output repeat !!!!

I've encountered the same problem on 2*L20 and Qwen-2.5-32B-Instruct, exactly same with this post. The refered similar issue is about 4-bit GPTQ, but I used BF16 and got the same...

[Usage]: vllm infer with 2 * Nvidia-L20, output repeat !!!!

> > I've encountered the same problem on 2*L20 and Qwen-2.5-32B-Instruct, exactly same with this post. The refered similar issue is about 4-bit GPTQ, but I used BF16 and got...

[Bug] 无法使用豆包大模型

any updates on this?

[Bug] 无法使用豆包大模型

> > any updates on this? > > Thank you reply > > Already solve it, I was just only modify docker config. > > ``` > services: > chatgpt-next-web:...