Pramodith Ballapuram

Results 6 issues of Pramodith Ballapuram

# What does this PR do? The code for training models via zero shot distillation was breaking because the .map() function was removing the _labels_ field from the dataset object....

## Describe the bug The map function, removes features from the dataset that are not present in the _input_columns_ list of columns, despite the columns being removed not mentioned in...

bug

Hi I am trying to use the EdgeLM model and was wondering 1. how I can run the model with pre-trained weights in interactive mode, I'd like to run it...

## Summary Our chunked loss functions currently statically set the chunk size to 1. However, this might lead to underutilized gpu memory resources. In this PR we show how the...

# What does this PR do? Fixes the notebook to disable `bf16` because the model and lora weights are configured to load in `bf16` Fixes # (issue) ## Who can...

# What does this PR do? Similar to #4567 but for RLOO. Fixes # (issue) ## Before submitting - [X] This PR fixes a typo or improves the docs (you...