piekey2022 issues

Results 16 issues of


                                            piekey2022

多进程初始化的时候读取缓存文件会报错

会报这个错误json.decoder.JSONDecodeError: Expecting ',' delimiter: line 112577 column 19 (char 2131822)

分词结果不一致

我有个文件，逐行进行分词，刚好第1595个句子分词后的list长度是48个词如果我直接读第1595个句子进行分词，长度就是50个词打印彼此的结果，会发现直接对这个句子进行分词的话，有个人名莱迪格会被分词成三个字。但逐行逐行的进行分词，到那一句的时候，莱迪格就不会被拆分成三个字，所以会少两个词。为什么分词的结果会有出入呢

tritonclient.utils.InferenceServerException: [StatusCode.INTERNAL] onnx runtime error 2: not enough space: expected 270080, got 261760

**Description** When I enabled max_queue_delay_microseconds to improve the response speed of the model, I found that there were occasional errors. I set max_queue_delay_microseconds to 70000. Then I sent three tensor...

关于如何做微调的一些疑问

这份代码的tokenizer和之前glm的tokenizer代码似乎不太一样，特别是没有build_inputs_for_generation这个函数，请问forward的时候input_ids和position_ids的构造方式和以前是一样的吗。以前似乎原文中必须包含mask，现在这个代码，我看generate函数的输入好像没有要求要有mask_token

documentation

[BUG/Help] 显存占用感觉比10b的大

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior 在一张80g显存的卡上训练，之前训练10b的glm模型可以开到batchsize4，这个模型开到2就很容易爆显存。有用amp，不知道什么原因。直接用huggingface的代码没办法做模型并行，不知道有什么好的办法 ### Expected Behavior _No response_ ### Steps To Reproduce...

Can deepspeed support lora in peft, especially the future multi-adapter version?

A recent branch of peft is about to support multiple lora adapters. This implementation feels very suitable for the training in ppo stage. An sft model can be used as...

question

deespeed chat

piekey2022

多进程初始化的时候读取缓存文件会报错

分词结果不一致

tritonclient.utils.InferenceServerException: [StatusCode.INTERNAL] onnx runtime error 2: not enough space: expected 270080, got 261760

关于如何做微调的一些疑问

[BUG/Help] 显存占用感觉比10b的大

Can deepspeed support lora in peft, especially the future multi-adapter version?

Deepspeed seems to be easier to OOM after 0.18.0.

[Question]: 请问aquilachat支持多轮吗？

RuntimeError: shape '[1, 1, 1, 32, 128]' is invalid for input of size 16384

When will the ninth generation pokemon be added?