Hunter Lang

Results 2 comments of Hunter Lang

> Because when we service the Bradley Terry RM with pipeline, it will automatically add a bos_token inside the pipeline when tokenizing. I don't fully understand...if the inference-time pipeline adds...

Thanks for the reply! Just to clarify: If I remove those `.replace(tokenizer.bos_token, "")` calls, then training should match inference, because the inference pipeline adds BOS automatically. If I modify the...