Callum McLean
Callum McLean
**System information** - Have I written custom code (as opposed to using a stock example script provided in TensorFlow): Yes - OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Linux,...
Hello! First of all, thank you for doing such great work and making it so accessible. I'm looking at using `mup` for a project but I'm a bit confused about...
# What does this PR do? Adds `run_speech_recognition_seq2seq.py` for training/fine-tuning Seq2Seq speech recognition models, such as Whisper, on the IPU. ## Before submitting - [ ] This PR fixes a...
## Changelogs - Allow having 0 GNN layers (i.e. only the task heads) on the last IPU in a pipeline split --- _Checklist:_ - [ ] _Was this PR discussed...
Appendix D of the Tensor Programs V paper contains a number of practical suggestions for using muP which we would do well to consider, such as: - fixing the dimension...
So that users/developers can add it to their own layers/models
## Changelogs - _enumerate the changes of that PR._ --- _Checklist:_ - [ ] _Was this PR discussed in an issue? It is recommended to first discuss a new feature...