RLHF-Reward-Modeling
RLHF-Reward-Modeling copied to clipboard
Update eval_bench_mark.py allow use bf16 or f32
Some environment does support bfloat16 that good, so adding a new argument, works similarly to bf16 parameter in gemma_rm.py