haystack icon indicating copy to clipboard operation
haystack copied to clipboard

Make use of Parsr's heading detection

Open bogdankostic opened this issue 3 years ago • 0 comments

Describe the solution you'd like Parsr has built-in heading detection. We should make use of it and add headline information of PDFs to the converted Documents. As far as I know, Parsr only detects the headings but not the hierarchy of the headings. We might determine the hierarchy of the headings by using a heuristic based on the font size of the headings.

bogdankostic avatar Aug 17 '22 12:08 bogdankostic