Jiachen Li

Results 3 comments of Jiachen Li

> > Hi @yingShen-ys > > That sounds like a reasonable result. I will leave the issue open however, so that we can see if others are able to reproduce...

Hi @aishwaryap, Thank you so much for your timely reply! This plan makes sense to me. Currently I am working on accessing the `last_event` of the controller at each step....

Hi @rajcscw, Any update on this issue? I'm wondering if Q-Learning methods can work for LLM training 🤔 Would be extremely grateful if you can share your experience on this.