nimble
nimble copied to clipboard
There should be documentation of how to use nimble on Huggingface transformers
🚀 Feature
In much the same way that there is documentation of wrapping Nimble around a torchvision model, there should be documentation (and benchmarks?) around wrapping Nimble around 🤗 language models. Unclear: whether this is a docs-only change.
Motivation
Nimble looks interesting and I am interesting in speeding up my NLP runs and I only see docs for torchvision
Pitch
- Adapt Nimble to ingest some 🤗 transformers or other models. (This may be a no-op)
- Write it up on the readme