fastembed Sentence Transformers Candidate Models

Models

[ ] https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2
[ ] https://huggingface.co/sentence-transformers/multi-qa-MiniLM-L6-cos-v1
[ ] https://huggingface.co/sentence-transformers/bert-base-nli-mean-tokens
[ ] https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L6-v2
[ ] https://huggingface.co/sentence-transformers/all-mpnet-base-v2
[ ] https://huggingface.co/sentence-transformers/multi-qa-mpnet-base-dot-v1

Please add in comments if you would like a model from this list to be added!

Feb 23 '24 05:02 NirantK

@NirantK Hey, I think fastembed is dependent on the huggingface's tokenizer, right? Why does it not support any of the models mentioned in the issue description by default?

Feb 26 '24 18:02 Tanmaypatil123

We export the models to ONNX and quantize them to improve compute times, while preserving as much performance as we can — that means each model needs a bit of work and testing.

I am not sure how the tokenizer is relevant though. We don't use transformers or torch libs.

Feb 29 '24 19:02 NirantK

Hey! I have used this https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L6-v2 in few chat-bots that i have built. And this really works well even with the large sized llms. You can also check in one where i used it: https://github.com/Ya-shh/Custom-Ai-ChatBot.git

Mar 04 '24 13:03 Ya-shh