Haixia comments

Results 9 comments of


                                            Haixia

[BUG]: [extension] Compiling or loading the JIT-built cpu_adam kernel during runtime now

运行到这里，一直卡着不往下进行下去了。我怀疑是optimizer的问题

[BUG]: save model parameters are incorrect

the same puzzle!! when i set the "lora_rank" as "0", i can run successfully. But the the saved model file is very big, it is about 13GB!!!

[BUG]: save model parameters are incorrect

> When you set lora_rank to 0, you are training the model without lora training, which is described here： yes， i know that when i set "lora_rank" to 0 means...

[BUG]: save model parameters are incorrect

> > When you set lora_rank to 0, you are training the model without lora training, which is described here： > > yes， i know that when i set "lora_rank"...

[BUG]: save model parameters are incorrect

> @Camille7777 Hello, I found that the code you submitted did not solve the problem of using lora training to save model parameters. I found that after the training, the...

[BUG]: ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: -9) local_rank: 3 (pid: 812917) of binary

me too!! why! when i run code on 7 or 8 gpu, it runs the same error as you! but when i run on 6 gpu, it successes! i am...

[BUG]: LlamaRM model has no attribute 'resize_token_embeddings'

Meet the same issue: AttributeError: 'LlamaRM' object has no attribute 'resize_token_embeddings' WARNING:torch.distributed.elastic.multiprocessing.api:Sending process 526441 closing signal SIGTERM ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 0 (pid: 526440) of binary: /data/anaconda3/bin/python Traceback (most recent...

[BUG]: LlamaRM model has no attribute 'resize_token_embeddings'

> I met this error too, but if you are training the stage 2, you should modified the pretrain to Coati7B(you trained in stage 1) instead of LLaMA7B that provided...

Ask CodeGeeX 用的是哪个版本模型？prompt应该如何构造？

请问这个问题你解决了嘛？同样好奇如何构造prompt！