Enhanced page layout extraction

Open shihanmax opened this issue 3 years ago • 1 comments

Added new function get_multi_page_layouts() based on existing get_page_layout().

The new function returns a list of PDFMiner LTPage objects and page dimensions for each page of a multi-page PDF file. This is achieved by iterating through each page of the PDF file and extracting the LTPage object and page dimension for that page.

Mar 21 '23 04:03 shihanmax

Hey!

As camelot is dead, we try to build a maintained fork at pypdf_table_extraction.

Do you want to open the PR against that branch so that we can merge your improvement?

Feb 25 '24 11:02 MartinThoma