Hunter Lang
Results
2
comments of
Hunter Lang
> Because when we service the Bradley Terry RM with pipeline, it will automatically add a bos_token inside the pipeline when tokenizing. I don't fully understand...if the inference-time pipeline adds...
Thanks for the reply! Just to clarify: If I remove those `.replace(tokenizer.bos_token, "")` calls, then training should match inference, because the inference pipeline adds BOS automatically. If I modify the...