Maxim Lysak

Results 47 comments of Maxim Lysak

Thanks for the example @aborruso!

@AdBaWa, can you please try converting your tables with this option: `TableFormerMode.ACCURATE` as described here: [control-pdf-table-extraction-options](https://ds4sd.github.io/docling/usage/#control-pdf-table-extraction-options) This is to use the version of our TableFormer that has more layers /...

I see, the header is misaligned with content of the table (text of a header from one column is above the content of another column). Thanks for the input, we...

Hello @AdBaWa, I believe we have a fix for this issue, here is a PR: https://github.com/docling-project/docling/pull/1238

I checked on `Windows 11 Pro 64bit x86` bare metal machine. Both commands (with OCR and without OCR) worked fine, and produced MD files

Same issue: https://github.com/DS4SD/docling/issues/795

Thank you @mattmalcher, @MiguelAngelTorres, @asvintheguy, this PR should resolve this issue: https://github.com/DS4SD/docling/pull/842

@jkindahood, small update, I'm working on PR to resolve image identification in DOCX: https://github.com/DS4SD/docling/pull/330

Thanks @EngineerChao! We've been looking into this issue with more generic approach that would treat not only rotated tables, but also any other element (rotated paragraphs, illustrations), it is not...

Hey @duongkstn , here is a PR that should fix the issue, thanks for the provided example, I used it to check the fix: https://github.com/docling-project/docling/pull/1238