fastembed
fastembed copied to clipboard
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Hi i am experiencing high memory usages which caused my pod to be killed because of exeeding its limits. after some experiments i found its related to text length that...
Intro new sparse model based on attention weights. Consider it as an extension of bm25 for short documents.
# Downloading model i.e, model.onnx and tokenizer.json, vocab.txt files from huggingface. Now I want to pass this local path and dont want fastembed to download from HF. trying below but...
Canonical vector values: https://colab.research.google.com/drive/1OTZcLxWWAmJqV1ZDS57W5x6zJPRzku4n?usp=sharing Onnx model: https://huggingface.co/yashvardhan7/bge-m3-onnx/tree/main Contributed ONNX weights to the BAAI/bge-m3 HF model repo :https://huggingface.co/BAAI/bge-m3/tree/main/onnx
Fixes #174
@michaelfeil had mentioned this in https://github.com/qdrant/fastembed/pull/54 — but I kind of missed the bus on that. Let's do that now. Primarily, need to do two things: 1. Port this into...
nomic-embed-text-v1.5 model supports variable dimensions, is there a way to set the model dimensions/size? embeddings_model = TextEmbedding("nomic-ai/nomic-embed-text-v1.5")