vlm-ocr topic

List vlm-ocr repositories

ImageFromTextGenerator

20
Stars
1
Forks
20
Watchers

IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, apply over 10 built-in noise effects, and customize fonts and layo...

ocr-benchmark

45
Stars
4
Forks
45
Watchers

Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments