layout-parser icon indicating copy to clipboard operation
layout-parser copied to clipboard

Is there any way to preserve heading and content together.

Open SAIVENKATARAJU opened this issue 4 years ago • 3 comments

Hi,

Thanks for your library. I have below screenshot with headers and text, is there any way to get the heading together with text. output

SAIVENKATARAJU avatar Oct 26 '21 12:10 SAIVENKATARAJU

Hi @SAIVENKATARAJU !

Could you please provide more context to your questions?

  1. Is it a PDF or just scanned document?
  2. What do you mean by "get the heading together with text"? You mean you'd like to get structured data like the following?
    [
        {
            "heading": "..",
            "text": "..."
        }, ...
    ]
    

lolipopshock avatar Oct 31 '21 23:10 lolipopshock

Hi @lolipopshock Thanks for your reply. My documents are PDF's. and yes I just want like you mentioned above.

SAIVENKATARAJU avatar Nov 08 '21 14:11 SAIVENKATARAJU

@SAIVENKATARAJU I have the same case :) Somebody solve this problem ? @lolipopshock

ciepielajan avatar Aug 19 '22 14:08 ciepielajan