MindSearch icon indicating copy to clipboard operation
MindSearch copied to clipboard

Multi-modal Support

Open ZackBradshaw opened this issue 1 year ago • 1 comments

Support for Video and Image understanding in search. I'm looking to use this with VILA. I'd love to use lmdeploy but as vila is not supported I'm wondering how feasible it would be to swap out inference engine to some thing like tiny chat

ZackBradshaw avatar Aug 09 '24 15:08 ZackBradshaw

nvm found https://huggingface.co/OpenGVLab/InternVL2-40B Still wondering if this works with mllm

ZackBradshaw avatar Aug 09 '24 16:08 ZackBradshaw