Asha Anoosheh
Asha Anoosheh
Hi, Can someone confirm if the model is known to work in mixed-precision by simply using torch's autocast function? (or if there's a more proper way intended) Thanks
> [!IMPORTANT] > The `Update branch` button must only be pressed in very rare occassions. > An outdated branch is never blocking the merge of a PR. > Please reach...
> [!IMPORTANT] > The `Update branch` button must only be pressed in very rare occassions. > An outdated branch is never blocking the merge of a PR. > Please reach...
> [!IMPORTANT] > The `Update branch` button must only be pressed in very rare occassions. > An outdated branch is never blocking the merge of a PR. > Please reach...
> [!IMPORTANT] > The `Update branch` button must only be pressed in very rare occassions. > An outdated branch is never blocking the merge of a PR. > Please reach...
> [!IMPORTANT] > The `Update branch` button must only be pressed in very rare occassions. > An outdated branch is never blocking the merge of a PR. > Please reach...
# What does this PR do ? Fixes a bug where models that have specs which depend on number of layers would be problematic between student and teacher models if...
# What does this PR do ? Prepare a refactor of the Model Optimizer product name and urls. :warning: For major changes (either in lines of code or in its...
> [!IMPORTANT] > The `Update branch` button must only be pressed in very rare occassions. > An outdated branch is never blocking the merge of a PR. > Please reach...