Rulin Shao issues

Repositories
Issues
Comments

Results 4 issues of


                                            Rulin Shao

Can't reproduce UniT results with gradient accumulation

## Instructions To Reproduce the Issue: Hi, thanks for the codes! I tried to reproduce the UniT vqa2 single task training example as given in the doc: https://mmf.sh/docs/projects/unit/ The default...

Is the deduplication scope separate or global when deduplicating multiple files?

Thanks for sharing the great codes!! They have been very useful for me! I'm new to Rust and bloom filter and I have one question regarding the deduplication scope in...

api

[WIP] add long-form rl-rag reward

# Reward Design ### Global Rewards * Rubric rewards. LLM-as-a-judge * For each training/test sample, we generate per-sample rubrics for evaluation. For example, the rubric could be “the starting of...