Able to chat images with multimodal apis/models

Open bjwswang opened this issue 2 years ago • 0 comments

user uploads a imiage, call a multimodal service(qwen-vl) to identify it (maybe with documentloader) and embedding the description message to vectorstore with extra image info.
Based on user's question in chat, we should return a special image reference

Feb 29 '24 08:02 bjwswang