Rulin Shao
Rulin Shao
## Instructions To Reproduce the Issue: Hi, thanks for the codes! I tried to reproduce the UniT vqa2 single task training example as given in the doc: https://mmf.sh/docs/projects/unit/ The default...
Thanks for sharing the great codes!! They have been very useful for me! I'm new to Rust and bloom filter and I have one question regarding the deduplication scope in...
# Reward Design ### Global Rewards * Rubric rewards. LLM-as-a-judge * For each training/test sample, we generate per-sample rubrics for evaluation. For example, the rubric could be “the starting of...