EasyOCR icon indicating copy to clipboard operation
EasyOCR copied to clipboard

Fine tuning g2 latin / spanish model on my own dataset

Open emigomez opened this issue 3 years ago • 2 comments

Hi,

I see your instructions in https://github.com/JaidedAI/EasyOCR/blob/master/custom_model.md to train a model, but I'm not sure if these instructions can be used to fine tuning an existing easyocr model, or just to train it from models provided here https://github.com/clovaai/deep-text-recognition-benchmark

I want to fine tune the existing easyocr spanish model (used in reader = easyocr.Reader(['es'])) with my own dataset.

How can I do it? how can I indicate the easyocr model as a starting point in the https://github.com/clovaai/deep-text-recognition-benchmark train? or is better to use these scripts https://github.com/JaidedAI/EasyOCR/tree/master/trainer?

Thanks!

emigomez avatar May 30 '22 10:05 emigomez

I'm having the same doubts!

CamiloSaboA-csv avatar May 31 '22 19:05 CamiloSaboA-csv

You can download the latin model from the official website and configure it in the .yaml as a starting point

Model hub: https://www.jaided.ai/easyocr/modelhub/

in your config.yaml saved_model: "saved_models/latin/latin_g2.pth" (or whatever path you have)

iblub1 avatar Jun 13 '22 13:06 iblub1

but how do you set the language and charset for training the latin_g2.pth?

SkygirlLuna avatar Jan 24 '23 01:01 SkygirlLuna