optimum-graphcore icon indicating copy to clipboard operation
optimum-graphcore copied to clipboard

Blazing fast training of 🤗 Transformers on Graphcore IPUs

Results 18 optimum-graphcore issues
Sort by recently updated
recently updated
newest added

Graphcore's contract with Paperspace has expired. This PR removes the "Run on Gradient" links in the examples.

Bumps [scikit-learn](https://github.com/scikit-learn/scikit-learn) from 0.24.2 to 1.0.1. Release notes Sourced from scikit-learn's releases. scikit-learn 1.0.1 We're happy to announce the 1.0.1 release with several bugfixes: You can see the changelog here:...

dependencies

docs now deleted automatically after 30 days https://github.com/huggingface/doc-builder/blob/main/.github/workflows/delete_old_pr_documentations.yml As done in optimum : https://github.com/huggingface/optimum/pull/1565 cc @regisss

# What does this PR do? Fixes # (issue) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks...

# What does this PR do? Adds `run_speech_recognition_seq2seq.py` for training/fine-tuning Seq2Seq speech recognition models, such as Whisper, on the IPU. ## Before submitting - [ ] This PR fixes a...

# What does this PR do? Given that the kv cache is on device, there is no need to return `past_key_values`. However, this would require overriding the forward methods of...

What would it require to support LLaMA-based models as well?

# What does this PR do? This PR is a WIP ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the...

# What does this PR do? - Provides a custom tracer, built upon `transformers.utils.fx.HFTracer` allowing the trace and transform the models we support in `optimum-graphcore` - A set of pipelining...

Currently, many LayerNorm's eps are smaller than 6.1e-5 (smallest fp16 value), which might cause underflow.