ocr2text
ocr2text copied to clipboard
Convert a PDF via OCR to a TXT file in UTF-8 encoding
There are two errors, one is raise exceptions.ShellError(, the otherpipe.communicate(). Do you need to rely on other libraries?
# README Windows Directions Update ## Description Closes issue #2 by updating directions. Add a new step 16 between unzipping the project and running `pip install --user --requirement requirements.txt` which...
Bumps [pillow](https://github.com/python-pillow/Pillow) from 6.2.0 to 9.0.1. Release notes Sourced from pillow's releases. 9.0.1 https://pillow.readthedocs.io/en/stable/releasenotes/9.0.1.html Changes In show_file, use os.remove to remove temporary images. CVE-2022-24303 #6010 [@radarhere, @hugovk] Restrict builtins within...
There is a well-supported port of the Tesseract OCR engine at https://tesseract.projectnaptha.com/ Combined with using the application framework https://www.electronjs.org/, this could provide a path forward for creating a library-independent build...
I am facing below issue. 
Here's the output ``` pip install --user --requirement requirements.txt Collecting Pillow==6.2.0 Using cached Pillow-6.2.0.tar.gz (37.4 MB) Collecting pdf2image==1.9.0 Using cached pdf2image-1.9.0.tar.gz (7.4 kB) Collecting pytesseract==0.2.7 Using cached pytesseract-0.2.7.tar.gz (169 kB)...
Bumps [pillow](https://github.com/python-pillow/Pillow) from 6.2.0 to 10.0.1. Release notes Sourced from pillow's releases. 10.0.1 https://pillow.readthedocs.io/en/stable/releasenotes/10.0.1.html Changes Updated libwebp to 1.3.2 #7395 [@radarhere] Updated zlib to 1.3 #7344 [@radarhere] 10.0.0 https://pillow.readthedocs.io/en/stable/releasenotes/10.0.0.html Changes...