verl
verl copied to clipboard
[misc] feat: Add `actor_rollout_ref.actor.calculate_entropy` for entropy fwd
Currently, entropys is only calculated in non-bypass when calculating old_log_prob