Is it possible to OCR playing video using OCR backend like Tesseract

Open gunir opened this issue 1 year ago • 2 comments

I'm trying to make mpv OCR a video to detect subtitles and translate to another window, mostly for watching movies that I don't understand and language learning, but the issue is I can't find API to capture screen to memory, or even more important, capture region to memory (to make OCR more accurate and cost less resource, because smaller = faster ? Is there APIs like that ?

Apr 12 '24 06:04 gunir

There are already projects that allow you to do this.

Apr 12 '24 15:04 fideliochan

It'd be better to translate subtitles beforehand via whisper. https://github.com/abb128/LiveCaptions Also exists.

Apr 18 '24 09:04 cold-nagai