linkstatic12
linkstatic12
Please elaborate more
I might be wrong here but the table transformer doesn't do OCR.
yes both functions don't have OCR on them.
https://colab.research.google.com/drive/1lLRyBr7WraGdUJm-urUm_utArw6SkoCJ?usp=sharing Take a look at this
I ran extractable thinking it would be a good alternative to my implementation but sadly it's not that good.
Can you share the FinTabNet model here.
I would advise to use pdftools which is available in R. This library can be used in python. The pdftools are much more accurate when it comes to pdf manipulation
you can convert the PDF pages to images.
`pipe= TableExtractionPipeline(det_config_path='detection_config.json', det_model_path='../pubtables1m_detection_detr_r18.pth', det_device='cuda', str_config_path='structure_config.json', str_model_path='../TATR-v1.1-Fin-msft.pth', str_device='cuda') ` example of usage of fin tab weights only
@bsmock It did. I was training the fintable dataset but you guys already did it.