jdchang1 issues

Repositories
Issues
Comments

Results 4 issues of


                                            jdchang1

Reproduce MOPO results

Hi @takuseno, I have been trying to reproduce the MOPO results using your library and I have been having trouble. I have been following your MOPO script in the reproduce...

Reproducing Win Rate inference for TL;DR

Hi, I have been trying to reproduce the win rate results from the paper for summarization and I'm struggling to get similar values. I wonder if you've experienced this as...

Support for Llama 4

Hi I was wondering if there were efforts to support Llama 4 Scout/Maverick. Thank you!

model-request

mean_resizing = True does not work with mixed/meta initialization

# What does this PR do? Transformers recently added in `mean_resizing` to `resize_token_embeddings`. This is breaking with mixed initialization in downstream training tasks that requires adding tokens to Composer Huggingface...