Pramodith Ballapuram issues

Results 6 issues of


                                            Pramodith Ballapuram

Fix for issue #12182 to ensure that the tutorial for zero shot distillation works

# What does this PR do? The code for training models via zero shot distillation was breaking because the .map() function was removing the _labels_ field from the dataset object....

map() function removes columns when input_columns is not None

## Describe the bug The map function, removes features from the dataset that are not present in the _input_columns_ list of columns, despite the columns being removed not mentioned in...

bug

Running the EdgeLM model on XSum

Hi I am trying to use the EdgeLM model and was wondering 1. how I can run the model with pre-trained weights in interactive mode, I'd like to run it...

[WIP]: Autotune Chunk Size

## Summary Our chunked loss functions currently statically set the chunk size to 1. However, this might lead to underutilized gpu memory resources. In this PR we show how the...

Fix GRPO Reasoning Advanced Reward Tutorial

# What does this PR do? Fixes the notebook to disable `bf16` because the model and lora weights are configured to load in `bf16` Fixes # (issue) ## Who can...

RLOO supports async rewards.

# What does this PR do? Similar to #4567 but for RLOO. Fixes # (issue) ## Before submitting - [X] This PR fixes a typo or improves the docs (you...