PyMuPDF
PyMuPDF copied to clipboard
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
### Description of the bug Want to split a pdf file into multiple pdf by 5 pages per file. When insert_pdf is called, a ValueError is thrown.  [example.pdf](https://github.com/user-attachments/files/16655286/example.pdf) ###...
### Description of the bug I'm adding a redaction region to a part of the PDF, but after calling apply_redactions(), one side of the entire page goes missing (opened in...
### Description of the bug Code in documentation is outdated. ### How to reproduce the bug How to Use Ink Annotations section: page.addInkAnnot function outdated. should be page.add_ink_annot How to...
### Description of the bug The attached PDF page uses a font which ascender value is incorrectly determined by MuPDF. Basis of this post is Discussions post #3772. ### How...
### Description of the bug When setting the border of links I get the following error: ``` max@merry ~/github/zotlinksToPdfs [14:05:18] (base) > $ python demo.py Traceback (most recent call last):...
### Description of the bug The `pymupdf.get_tessdata()` function raises an unexpected error when the installed version of Tesseract OCR is not 4.0 (tested on the latest Debian, with Tesseract 5)....
### Description of the bug I was trying to remove all text from PDF files. My python script looks like the following: ```python for page in document: info = json.loads(page.get_text('json',...
### Description of the bug All attempts at saving a pdf with encryption and an owner_pw result in a PDF that will open in Acrobat Pro (2024.002.20964) on macOS, but...