Saeed Najafi

Results 2 comments of Saeed Najafi

Hi, I am getting some cache errors while doing generation with llama3 and fsdp. I am using flash_attention_2, and the use_cache=True in the generate function. Latest transformer from the repo...