jdchang1
jdchang1
Hi @takuseno, I have been trying to reproduce the MOPO results using your library and I have been having trouble. I have been following your MOPO script in the reproduce...
Hi, I have been trying to reproduce the win rate results from the paper for summarization and I'm struggling to get similar values. I wonder if you've experienced this as...
Hi I was wondering if there were efforts to support Llama 4 Scout/Maverick. Thank you!
# What does this PR do? Transformers recently added in `mean_resizing` to `resize_token_embeddings`. This is breaking with mixed initialization in downstream training tasks that requires adding tokens to Composer Huggingface...