Maxim Lysak
Maxim Lysak
We need to add parameter to enable or disable empty column / row removal from Table Model post processing. Details in the discussion: https://github.com/DS4SD/docling/discussions/201 See if there are some other...
### Requested feature Our users encounter from time to time documents that instead of text have vector path's representing themselves as text. Because of the vector nature of it, we...
**Issue resolved by this Pull Request:** Resolves #273 **Checklist:** - [ ] **Commit Message Formatting**: Commit titles and messages follow guidelines in the [conventional commits](https://www.conventionalcommits.org/en/v1.0.0/#summary). - [ ] Documentation has...
This PR populates image data in docling documents by PPTX backend, also introduces basic PPTX tests. **Checklist:** - [ ] Documentation has been updated, if necessary. - [ ] Examples...
- Fixes for referencing drawing blip in wordx - Added safety try-except when trying to load pillow image from a docx blob. - Added explicit dependency on lxml. **Issue resolved...
Using style id instead of style names, which should be localization agnostic **Issue resolved by this Pull Request:** Resolves #489 **Checklist:** - [ ] Documentation has been updated, if necessary....
old wip, need to consider if we want to use such approach **Checklist:** - [ ] Documentation has been updated, if necessary. - [ ] Examples have been added, if...
Preliminary integration with SmolDocling model and VLM Pipeline: - SmolDocling inference model - New VLM Pipeline that uses SmolDocling model - Assembly code that builds Docling document from Doc-tags format...
### Requested feature Currently Docling Document doesn't support images inside table cells, it's happening across input formats. Example with DOCX: https://github.com/DS4SD/docling/issues/674 ### Alternatives ...