Ravi Ghadia
Ravi Ghadia
Hi @muellerzr I am able to reproduce the same error for a dummy case, where I make no others imports other than the accelerate library. Cell:0 `from accelerate import Accelerator`...
Works for me too, thanks a lot @muellerzr for the prompt response!
Hi, Probably somewhat related to this, having a forced_decoder_ids argument in the policy.generate() function might help with the offline RL setting, so is there a specific reason to not have...
Got it, thanks!
this cache is very helpful (for low-memory long-context inference) thanks for adding it!