Wawa
Results
2
comments of
Wawa
Thank you for the prompt response! Is the Table 1 result coming form the `ngram_13_0.8` as well?
> Some tips (might be helpful) > > 1. Decrease `actor_rollout_ref.rollout.n` > 2. Ensure the setting `export VLLM_ATTENTION_BACKEND=XFORMERS` > 3. Decrease `actor_rollout_ref.actor.ppo_micro_batch_size` > 4. Decrease `actor_rollout_ref.rollout.log_prob_micro_batch_size` and `actor_rollout_ref.ref.log_prob_micro_batch_size` > 5....