PyMuPDF icon indicating copy to clipboard operation
PyMuPDF copied to clipboard

Incorrect text positions for some font

Open JorjMcKie opened this issue 1 year ago • 3 comments

Description of the bug

The attached PDF page uses a font which ascender value is incorrectly determined by MuPDF. Basis of this post is Discussions post #3772.

How to reproduce the bug

The revealing command is mutool draw -o test.xml -F stext test.pdf. test.pdf

The problematic font is "AdvOT144a03c1" (embedded subset). Note that this error happens in MuPDF, not PyMuPDF. So we assign the label "upstream bug". However, PyMuPDF's Page method get_text("dict") does not correctly repair the computation when setting TOOLS.set_small_glyph_heights(True). So we also assign the label (PyMuPDF) "enhancement".

PyMuPDF version

1.24.9

Operating system

Windows

Python version

3.12

JorjMcKie avatar Aug 14 '24 11:08 JorjMcKie

MuPDF bug id: https://bugs.ghostscript.com/show_bug.cgi?id=707947.

JorjMcKie avatar Aug 14 '24 11:08 JorjMcKie

@JorjMcKie This has been fixed upstream, can can be marked "fix developed", right?

sebras avatar Sep 27 '24 21:09 sebras

This issue also needs a fix to PyMuPDF itself which i don't think has been done yet, so fix developed shouldn't be used yet.