Michael Benayoun issues

Results 13 issues of


                                            Michael Benayoun

Fix the way the linear modules are removed from the parent module

# What does this PR do? Fixes # (issue) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks...

Quantization with torch FX [part 2]

# What does this PR do? This is the second PR in the process of adding PyTorch based quantization features to `optimum`. This PR continues what was started in the...

Community contribution - `optimum.exporters.onnx` support for new models!

Following what was done by @ChainYo in Transformers, in the [ONNXConfig: Add a configuration for all available models](https://github.com/huggingface/transformers/issues/16308) issue, the idea is to add support for exporting new models in...

good first issue

Add support for Speech Encoder Decoder models in `optimum.exporters.onnx`

### Feature request Add support for [Speech Encoder Decoder Models](https://huggingface.co/docs/transformers/v4.25.1/en/model_doc/speech-encoder-decoder#speech-encoder-decoder-models) ### Your contribution Me or other members can implement it (cc @mht-sharma @fxmarty )

feature-request

onnx

Uses torch.fx to parallelize and transform pipelined models

# What does this PR do? - Provides a custom tracer, built upon `transformers.utils.fx.HFTracer` allowing the trace and transform the models we support in `optimum-graphcore` - A set of pipelining...

Implement the different generation methods using poptorch.for_loop to make things much faster

- [ ] Greedy search - [ ] Sample - [ ] Beam search - [ ] Beam sample - [ ] Group beam search - [ ] Constrained beam...

enhancement

generation

Allow the encoder outputs to be computed on the CPU for generation

Currently, during generation, the encoder outputs can only be computed after having compiled the encoder separately. It would be nice to be able to compute the encoder outputs directly from...

enhancement

generation

Michael Benayoun