ColossalAI icon indicating copy to clipboard operation
ColossalAI copied to clipboard

[BUG]: LoRA still does not support the training of reward models

Open young-chao opened this issue 2 years ago β€’ 2 comments

πŸ› Describe the bug

Although ht-zhou said that the LoRA problem has been fixed, according to the latest code and experimental tests of ColossalAI, LoRA still does not support the training of the reward model. I read his modifications, I don't think those modifications can fix the problem, and I noticed that the latest main version of ColossalAI removed the use of LoRA during rm model training, and now I think ColossalAI does not support LoRA at all, right? image

Environment

env

OS:ubuntu 20.04 GPU:4 x A10 python==3.9.0 torch==1.13.1-cu116 colossalai==0.2.5

command

python train_reward_model.py --pretrain "bigscience/bloom-560m" --lora_rank 16

young-chao avatar Mar 05 '23 14:03 young-chao

@ht-zhou

young-chao avatar Mar 05 '23 14:03 young-chao

Bot detected the issue body's language is not English, translate it automatically. πŸ‘―πŸ‘­πŸ»πŸ§‘β€πŸ€β€πŸ§‘πŸ‘«πŸ§‘πŸΏβ€πŸ€β€πŸ§‘πŸ»πŸ‘©πŸΎβ€πŸ€β€πŸ‘¨πŸΏπŸ‘¬πŸΏ


@ht-zhou

Issues-translate-bot avatar Mar 05 '23 14:03 Issues-translate-bot

Thank you for your feed back, it is fixed and we use lora_rank when initialize model. image

ht-zhou avatar Mar 07 '23 02:03 ht-zhou