pdfparser
pdfparser copied to clipboard
PdfParser, a standalone PHP library, provides various tools to extract data from a PDF file.
# Type of pull request * [X] Bug fix (involves code and configuration changes) # About The current `(string)` replacement regexp in `formatContent()` only backchecked two characters for escaped slashes,...
pdfparser version:2.10.0 I have about 600,000 pdf files, all of which use pdfparser for text extraction.This kind of problem was shown to have been solved on 704, probably because of...
- PDFParser Version: 2.9 ### Description: I want to extract the PDF then save text to db and image to storage, but the order matters, if i take page 1,...
Symptom was that some documents' contents was rendering as a bunch of control characters. These are the untranslated strings. This was happening because for two different reasons, these strings weren't...
- PHP Version: 8.2 - PDFParser Version: 2.8 RC 2 ### Description: ### PDF input [在历史的风陵渡口 (高华) (Z-Library).pdf](https://github.com/smalot/pdfparser/files/13848997/Z-Library.pdf) ### Expected output & actual output I get an avalanche of weird...
Hello, Iwant to extract this pdf, but the result is empty. https://www.mediafire.com/file/azb7yddqo2ry55j/123.pdf/file this is my code ``` $parser = new \Smalot\PdfParser\Parser(); // Parse pdf file using Parser library $pdf =...
- PHP Version: PHP 8.3.0 (cli) (built: Nov 24 2023 13:48:03) (NTS) Copyright (c) The PHP Group Zend Engine v4.3.0, Copyright (c) Zend Technologies with Zend OPcache v8.3.0, Copyright (c),...
I try to extract all XObject (images) from the test pdf PNG Images not work.
The error we have in our logs comes from when we parse the data of the pages. We are using a PDF generated by the latest version of FPDF, version...
- PHP Version: 8.2 - PDFParser Version: 2.9 ### Description: I want to parse some CV, and I have sometimes wrong character. I would to try to parse correctly the...