Shashank Verma
Shashank Verma
# What does this PR do ? Add a one line overview of what this PR aims to accomplish. **Collection**: [Note which collection this PR will affect] # Changelog -...
# What does this PR do ? * The forward pass in Gemma 2 passes runtime_gather_output * Leads to unexpected keyword arg error while finetuning gemma 2 2b without it...
# What does this PR do ? Provides clarification to use the NGC Personal API Key # GitHub Actions CI The Jenkins CI system has been replaced by GitHub Actions...
**Is your feature request related to a problem? Please describe.** Automodel has been validated on the Databricks platform with Torch Distributor on Spark cluster. Requesting to add documentation to guide...
**Describe the bug** The documentation / tutorial does not pin a specific commit of RL and Gym. Main branches for both are fast moving and introduce breaking changes often. Simple...
**Describe the bug** NeMo RL GRPO training scripts save rollout results in wandb as run.summary logs. In datasets that have multiple tools, it appears as if its saving redundant attribute...