supermemory Images & Multimodal support

When asked about a specific text from the database, it only recognizes text, but the text from the image isn't recognized: Here the question:

Here the tweet mentioning the image with the text

Jul 24 '24 21:07 tomipardinas

There's ways to fix this right now, but I think that at scale it doesn't make sense to use a vision model or even OCR for that matter on every single image. We currently are reaching into 60,000 tweets imported into supermemory. a lot of them also have images. would get expeensive

Jul 24 '24 21:07 Dhravya

so you have any other method method ?

Jul 25 '24 14:07 CodeTorso

@Dhravya for the time being, if we really want the feature, we can use some OCR API? which are free/extremely cheap?

Jul 25 '24 17:07 krakenftw

https://ocr.space/ocrapi here is one

Jul 25 '24 17:07 krakenftw

This should be tackled eventually. If we're importing tweets into Supermemory, providing a first-class experience to the user is important.

Jul 25 '24 23:07 Welding-Torch

Yes. If the objective is truly to create a "super" memory, it needs to be able to extract useful information from all types of unstructured data in your tweets, including images containing text. This is especially important because tweets with images are more likely to be retweeted, which increases their chances of being valuable and, consequently, the probability of being bookmarked.

Jul 25 '24 23:07 tomipardinas

Hmmm. this is gonna be a long discussion. We did create endpoints and stuff to make this work long ago but nothing after that

Jul 26 '24 14:07 Dhravya

yep that endpoint uses llava-1.5-7b-hf,

nowhere on cloudflare's docs we can get an idea of how much it will cost

but since we are already looking forward to adding pro plan, why not make it part of that ?

Jul 27 '24 23:07 CodeTorso

@CodeTorso As a user I would definitely expect this to be part of the free plan since importing tweets right now is basically our USP

Jul 28 '24 07:07 Welding-Torch

Hmmmmm, llava right now is free. We can probably look into this and do a dual sort of import, should be not that hard when we have the image URL that we can fetch

but we can only do this after the queues PR is merged, and then add an extra step in the add workflow to fetch any images and embed those too

Aug 04 '24 17:08 Dhravya

@Dhravya we can add paid version for saving images. Can integrate stripe. What say?

Aug 05 '24 16:08 ameeetgaikwad

Again, As a user I would definitely expect recognizing content in saved tweet images to be part of the free plan since importing tweets right now is basically our USP

Aug 05 '24 16:08 Welding-Torch

Hmm, that's right.

Aug 06 '24 14:08 ameeetgaikwad

this feature is already live with file import support

Aug 20 '25 05:08 MaheshtheDev