rag-api-server icon indicating copy to clipboard operation
rag-api-server copied to clipboard

Support multi-pass RAG search

Open juntao opened this issue 1 year ago • 0 comments

The current approach to search only the last user message for RAG content is too simplistic, especially in multi-turn conversations or in agentic apps where the agent automatically adds or re-phrases the last user message.

I think we need to combine the last 3 to 5 user messages together, and perform a second search pass. The highest scored vectors from both searches will be selected for the context.

juntao avatar Oct 03 '24 08:10 juntao