Srinivasan Nandakumar

Results 2 issues of Srinivasan Nandakumar

Hi, I tried replicating the pretraining bert script and when I ran it with the yaml script I got the following error: Value bf16 is not available in Precision. I...

Hi, I am finetuning tiny llama on T4 with FP16. When I use packing the loss seems to be okay. But when I set it to false, the grad_norm goes...

currently fixing