python-docx2txt icon indicating copy to clipboard operation
python-docx2txt copied to clipboard

How to differentiate between header text vs paragraph text?

Open tbell511 opened this issue 8 years ago • 1 comments

Hi! I absolutely love this project. Quick question though.

After processing a document and printing the result, is there a way to see what is header text vs what is just a paragraph?

Right now all text, including headers, are being printed with the paragraph text.

Thanks!

tbell511 avatar Jul 12 '17 21:07 tbell511

This should be possible with small changes in the code. If you look at the code, text from header, main document and footer are appended to a single 'text' variable. You can collect these text in different variables and print as per your need.

Feel free to send a PR for this.

ankushshah89 avatar Jul 14 '17 04:07 ankushshah89