Srinivasan Nandakumar
Results
2
issues of
Srinivasan Nandakumar
Hi, I tried replicating the pretraining bert script and when I ran it with the yaml script I got the following error: Value bf16 is not available in Precision. I...
Hi, I am finetuning tiny llama on T4 with FP16. When I use packing the loss seems to be okay. But when I set it to false, the grad_norm goes...
currently fixing