Adib Mosharrof

Results 15 comments of Adib Mosharrof

I have the same error in V100 32 GB. I have tried with other GPUs and they seem to work. It would be great if it could work on V100

thank you very much for your response. I will make the changes you suggested and get back to you.

@pacman100 I had tried using what u suggested before but that did not seem to work. I will try this new suggestion. P.s. I have been traveling, so I am...

@pacman100 Thank you very much for looking into this issue. In your example notebook, I can see that you no longer use `model = prepare_model_for_int8_training(model)` Initially I had that in...

@pacman100 I ran the code from the notebook you shared in #337. The only changes I made was to load the model in 8bit `model_name = "EleutherAI/gpt-j-6B" model = AutoModelForCausalLM.from_pretrained(...

I was able to make things work and all my requirements have been fulfilled. @pacman100 Thank you very much for supporting me with this issue. I am really grateful for...

When you add extra tokens, the embedding dimensions of some layers change. You have to add those layers in modules_to_save. Depending on your model, the modules_to_save can change. Please take...

> New idea: Now the training finally works. Setting fp16=False would make the training be super slow and not mem-friendly. > > To avoid "ValueError: Attempting to unscale FP16 gradients",...

what GPU are you using @iMountTai please take a look into this issue https://github.com/TimDettmers/bitsandbytes/issues/240