richard28039
Results
2
issues of
richard28039
Hello, Can anyone tell me the meaning of the 'lambda_ambient', when I was training the loss will increase slowly but not stop like below  thanks
> Hi @xyzhang626 , thank you for the support! > > We use `claude-3-7-sonnet-20250219` as our manager and worker, and use `bytedance-research/UI-TARS-72B-DPO` from HuggingFace as our grounding model. The hyperparams...