Add `run_speech_recognition_seq2seq.py`

Open callumm-graphcore opened this issue 2 years ago • 5 comments

What does this PR do?

Adds run_speech_recognition_seq2seq.py for training/fine-tuning Seq2Seq speech recognition models, such as Whisper, on the IPU.

[ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[ ] Did you make sure to update the documentation with your changes?
[ ] Did you write any new necessary tests?

Jun 20 '23 15:06 callumm-graphcore

This should be considered WIP, I need to test it with e.g. whisper-tiny

Jun 20 '23 15:06 callumm-graphcore

Sorry, should have clarified: this is no longer WIP

Jul 11 '23 10:07 callumm-graphcore

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

Jul 11 '23 11:07 HuggingFaceDocBuilderDev

The failing test is the one that compares the diff file. Is the current one up to date?

Jul 17 '23 12:07 katalinic-gc

The failing test is the one that compares the diff file. Is the current one up to date?

I thought it was, but with the new changes, I'll redo it

Jul 17 '23 14:07 callumm-graphcore