Zizheng Yang

Results 2 issues of Zizheng Yang

Use len(names) instead of 13 allows to run part of the evaluation benchmark each time, for machine does not have that much g-ram, this could be helpful.

Some environment does support bfloat16 that good, so adding a new argument, works similarly to bf16 parameter in gemma_rm.py