markitdown icon indicating copy to clipboard operation
markitdown copied to clipboard

Invalid readme - it's not able to convert PDF to markdown

Open ol-loginov opened this issue 1 year ago • 4 comments

And it wasn't going to do it. It just take output from pdfminer, which is declared as "a text extraction tool for PDF documents".

Markdown is something different

ol-loginov avatar Dec 20 '24 09:12 ol-loginov

I was very excited that Microsoft has open-sourced its document-to-markdown tool, but when I tried it with a PDF, it only converted to text, not Markdown, and headings and tables couldn't be converted.

yutaixi avatar Dec 20 '24 09:12 yutaixi

same issue

lilu-trustplus avatar Dec 30 '24 08:12 lilu-trustplus

It is just converting the pdf data into text format, not giving the proper structured markdown format as excel

ashish40108686 avatar Jan 06 '25 07:01 ashish40108686

+1, markitdown better than other similar tool?

GallonDeng avatar Jan 06 '25 11:01 GallonDeng