Vivek

Results 5 comments of Vivek

I was trying RL using trl on T5 (Seq2Seq) Model with PEFT, and facing this issue with zero stage 3. It was working fine with stage 2. Can anyone help...

> @loadams - please see if this is connected to #3735 Yes it is connected to this issue. Can you help me with this issue?

Has anyone been able to determine the correct injection_policy for Flan-UL2, or is it confirmed whether this policy is supported or unsupported for Flan-UL2?