litsh
litsh
Thanks for your great solution. May I ask why is the estimated degree of freedom so far from the one in theory?
Thank you for your reply! I will read the paper. Thaison ***@***.*** Original Email Sender:"szcf-weiya"< ***@***.*** >; Sent Time:2023/12/5 23:16 To:"szcf-weiya/ESL-CN"< ***@***.*** >; Cc recipient:"litsh"< ***@***.*** >;"Mention"< ***@***.*** >; Subject:Re:...
I have the same question. Is there anyone who have solved this ?
> Hi litsh, the tagger is trained with vanilla prompts and a default system prompt "You are a helpful assistant.", so no other template prompts are used for inference. For...
> > `at {"tag": str, "explanation": str}. Query: How are you? assistant` > > Hi, have you figured out this? @SefaZeng Use the former one should be fine.
您好,请问可以和您交流一下奖励模型的训练吗?方便的话可以留一下联系方式。
Same issue, would you please tell me if you have solved this?