jdchang1

Results 4 issues of jdchang1

Hi @takuseno, I have been trying to reproduce the MOPO results using your library and I have been having trouble. I have been following your MOPO script in the reproduce...

Hi, I have been trying to reproduce the win rate results from the paper for summarization and I'm struggling to get similar values. I wonder if you've experienced this as...

Hi I was wondering if there were efforts to support Llama 4 Scout/Maverick. Thank you!

model-request

# What does this PR do? Transformers recently added in `mean_resizing` to `resize_token_embeddings`. This is breaking with mixed initialization in downstream training tasks that requires adding tokens to Composer Huggingface...