Maxim Lysak

Results 47 comments of Maxim Lysak

We had issues related to matching predicted table cells with underlying PDF cells that are obtained from the PDF backend. With newest improvement of PDF backend we have more granular...

@MahmoudAtef999 Please try with Docling v2.26.0, we updated Table model with new weights and it should address your issue.

Thanks for the examples @MahmoudAtef4499 !

@JTCorrin, any chance you could share example of a problematic PPTX with us? I would expect that it appears not on every PPTX, but just the ones that have WMF...

@pwab indeed, we have to add this functionality to all other backends (apart from PDF), I'll look into it

@kurtgdl Please try with Docling v2.26.0, we updated Table model with new weights and it should address your issue.

@pwab There are some bugs going on with this file, I'm looking into it!

> Hi @maxmnemonic I am facing a similar issue but the behavior is different for markdown export and text export, while converting the following document... > Could you please explain...

Quick Update: the issue of skipping over text converted to vector images described here, can be handled also with forced full-page OCR, that is being prepared with this PR: [feat(OCR):...