Callum McLean

Results 7 issues of Callum McLean

**System information** - Have I written custom code (as opposed to using a stock example script provided in TensorFlow): Yes - OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Linux,...

stat:awaiting response
type:bug
stalled
comp:xla
2.6.0

Hello! First of all, thank you for doing such great work and making it so accessible. I'm looking at using `mup` for a project but I'm a bit confused about...

# What does this PR do? Adds `run_speech_recognition_seq2seq.py` for training/fine-tuning Seq2Seq speech recognition models, such as Whisper, on the IPU. ## Before submitting - [ ] This PR fixes a...

## Changelogs - Allow having 0 GNN layers (i.e. only the task heads) on the last IPU in a pipeline split --- _Checklist:_ - [ ] _Was this PR discussed...

Appendix D of the Tensor Programs V paper contains a number of practical suggestions for using muP which we would do well to consider, such as: - fixing the dimension...

enhancement
mup

So that users/developers can add it to their own layers/models

documentation
Low priority
mup

## Changelogs - _enumerate the changes of that PR._ --- _Checklist:_ - [ ] _Was this PR discussed in an issue? It is recommended to first discuss a new feature...