litsh

Results 8 comments of litsh

Thanks for your great solution. May I ask why is the estimated degree of freedom so far from the one in theory?

Thank you for your reply! I will read the paper. Thaison ***@***.*** Original Email Sender:"szcf-weiya"< ***@***.*** &gt;; Sent Time:2023/12/5 23:16 To:"szcf-weiya/ESL-CN"< ***@***.*** &gt;; Cc recipient:"litsh"< ***@***.*** &gt;;"Mention"< ***@***.*** &gt;; Subject:Re:...

I have the same question. Is there anyone who have solved this ?

> Hi litsh, the tagger is trained with vanilla prompts and a default system prompt "You are a helpful assistant.", so no other template prompts are used for inference. For...

> > `at {"tag": str, "explanation": str}. Query: How are you? assistant` > > Hi, have you figured out this? @SefaZeng Use the former one should be fine.

您好,请问可以和您交流一下奖励模型的训练吗?方便的话可以留一下联系方式。

Same issue, would you please tell me if you have solved this?