Guoheng Sun comments

Results 15 comments of


                                            Guoheng Sun

想请教下模型输入如果维度大于3时的情况

同问，有什么解决方法吗？ lol

单机8张3090，运行后exits with return code = -9，错误码信息在那里可以查询？

可能是因为内存不足

A100 80 G fine tune llama-65b-hf got CUDAout of Memory

@PeiqinSun Hello, I have encountered the same issue. The loss dramatically decreases after each epoch and then gradually increases. I have also observed this phenomenon in a [paper](https://arxiv.org/abs/2304.14454). After conducting...

The loss curve exhibits a stair-step pattern of descent.

> Same issue here.Have you solved it? I did not solve this problem. Recently, I have some new discoveries: when I use the default code of FastChat, the loss curve...

The loss curve exhibits a stair-step pattern of descent.

@FHL1998

[BUG] try to finetune a llama 33b on 8*A100 40G, 600G RAM. But always OOM on RAM.

@LuJunru Hi, does this mean you have successfully finetuned a 33-B-parameter model using zero stage3 + offload optimizer & param on A100 40G * 8 + 600G CPU RAM? I...

[BUG] try to finetune a llama 33b on 8*A100 40G, 600G RAM. But always OOM on RAM.

@LuJunru Many Thanks! Would you mind sharing your Deepspeed script, please? I have tried other scripts from this issue and Deepspeed's official default script, but I am hoping to rule...

[BUG] try to finetune a llama 33b on 8*A100 40G, 600G RAM. But always OOM on RAM.

@LuJunru I understand your situation. Thanks again.

微调之后加载权重发现输出停不下来

在生成阶段可以设置一下repetition_penalty，如： generation_config = GenerationConfig( temperature=temperature, top_p=top_p, top_k=top_k, num_beams=num_beams, repetition_penalty=1.2, **kwargs, ) generate_params = { "input_ids": input_ids, "generation_config": generation_config, "return_dict_in_generate": True, "output_scores": True, "max_new_tokens": max_new_tokens, }

微调之后加载权重发现输出停不下来

> Repetition Penalty 在web界面上可以调整吧默认设置是2 。 repetition_penalty：控制生成的文本中重复标记的惩罚力度。你为什么还调整小了。。我用的是alpaca-lora的代码，alpaca-lora默认是没设置repetition_penalty的