fastembed icon indicating copy to clipboard operation
fastembed copied to clipboard

Sentence Transformers Candidate Models

Open NirantK opened this issue 1 year ago • 3 comments

Models

  • [ ] https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2
  • [ ] https://huggingface.co/sentence-transformers/multi-qa-MiniLM-L6-cos-v1
  • [ ] https://huggingface.co/sentence-transformers/bert-base-nli-mean-tokens
  • [ ] https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L6-v2
  • [ ] https://huggingface.co/sentence-transformers/all-mpnet-base-v2
  • [ ] https://huggingface.co/sentence-transformers/multi-qa-mpnet-base-dot-v1

Please add in comments if you would like a model from this list to be added!

NirantK avatar Feb 23 '24 05:02 NirantK

@NirantK Hey, I think fastembed is dependent on the huggingface's tokenizer, right? Why does it not support any of the models mentioned in the issue description by default?

Tanmaypatil123 avatar Feb 26 '24 18:02 Tanmaypatil123

We export the models to ONNX and quantize them to improve compute times, while preserving as much performance as we can — that means each model needs a bit of work and testing.

I am not sure how the tokenizer is relevant though. We don't use transformers or torch libs.

NirantK avatar Feb 29 '24 19:02 NirantK

Hey! I have used this https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L6-v2 in few chat-bots that i have built. And this really works well even with the large sized llms. You can also check in one where i used it: https://github.com/Ya-shh/Custom-Ai-ChatBot.git

Ya-shh avatar Mar 04 '24 13:03 Ya-shh