CEfanmin

Results 15 comments of CEfanmin

你好,我也不知道怎么训练的conv数据,麻烦能答疑解惑一下不?

好,谢谢,我也研究研究看,知道了我就回复你。

> You can remove that argument in `Trainer`, or change where you want to report your logs, e.g.` report_to=["tensorboard"]`. In this case you will have to make sure that it...

> How to use multiple GPUs?I have try CUDA_VISIBLE_DEVICES, no effect do you success?now

> I am trying to fine tune bigcode/starcoderbase model on compute A100 with 8 GPUs 80Gb VRAM. My initial steps are to adjust parameters. I get some impression that it...

返回的中文也是乱码

> Hi @CEfanmin, thanks for your feedback. Since LLMLingua uses token-level prompt compression, it can indeed cause garbled text in some languages. You can try using LLMLingua-2. Thanks!

> v0.15.2 v0.15.1版本支持Qwen2.5-72B-ins的工具调用,但不支持流式返回

File "/home/fanmin/qianwen/code/qwenLLM.py", line 56, in _call generated_ids = self.model.generate(model_inputs.input_ids, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/fanmin/qianwen/lib64/python3.11/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return func(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^ File "/home/fanmin/qianwen/lib64/python3.11/site-packages/transformers/generation/utils.py", line 1592, in generate return self.sample( ^^^^^^^^^^^^ File...