layout-parser
layout-parser copied to clipboard
A Unified Toolkit for Deep Learning Based Document Image Analysis
Code: import layoutparser as lp model = lp.Detectron2LayoutModel( config_path ='lp://MFD/faster_rcnn_R_50_FPN_3x/config', # In model catalog label_map ={0: "Text", 1: "Title", 2: "List", 3:"Table", 4:"Figure"}, # In model`label_map` extra_config=["MODEL.ROI_HEADS.SCORE_THRESH_TEST", 0.7] # Optional...
Currently using the `lp.Detectron2LayoutModel` leads to an error. Correcting this to be `lp.models.Detectron2LayoutModel`
Subject: GitHub Message - Continued Issues Despite Training with 2500 Images Dear GitHub Community, I hope this message finds you well. I am reaching out with a concern regarding a...
The documentation at [this link ](https://github.com/Layout-Parser/layout-parser/blob/main/docs/example/deep_layout_parsing/index.rst) is throwing an error . ``` TypeError: 'inplace' is an invalid keyword argument for sort() ``` The line which is throwing this error is...
I have been trying to analyze the documents using layout parser on different types of documents, I am able to get expected results on True pdfs but not on scanned...
The link to the official Detectron2 [installation instruction](https://github.com/facebookresearch/detectron2/blob/master/INSTALL) leads to an page not found (404). It should be changed to their new readthedocs [installation instruction](https://detectron2.readthedocs.io/en/latest/tutorials/install.html).
This request adds support for EfficientOCR (https://arxiv.org/abs/2304.02737) to LayoutParser as an available OCR engine. EffOCR has been demonstrated to provide quick, accurate OCR of historical English and Japanese documents in...
**Motivation** I hope layout-parser can support the open standard HTML OCR (hOCR) file format that represents document layouts. It would allow easier creation of OCR'ed PDFs and allow for interoperability...