ocr2text icon indicating copy to clipboard operation
ocr2text copied to clipboard

Convert a PDF via OCR to a TXT file in UTF-8 encoding

Results 7 ocr2text issues
Sort by recently updated
recently updated
newest added

There are two errors, one is raise exceptions.ShellError(, the otherpipe.communicate(). Do you need to rely on other libraries?

# README Windows Directions Update ## Description Closes issue #2 by updating directions. Add a new step 16 between unzipping the project and running `pip install --user --requirement requirements.txt` which...

Bumps [pillow](https://github.com/python-pillow/Pillow) from 6.2.0 to 9.0.1. Release notes Sourced from pillow's releases. 9.0.1 https://pillow.readthedocs.io/en/stable/releasenotes/9.0.1.html Changes In show_file, use os.remove to remove temporary images. CVE-2022-24303 #6010 [@​radarhere, @​hugovk] Restrict builtins within...

dependencies

There is a well-supported port of the Tesseract OCR engine at https://tesseract.projectnaptha.com/ Combined with using the application framework https://www.electronjs.org/, this could provide a path forward for creating a library-independent build...

enhancement

I am facing below issue. ![1614771664371](https://user-images.githubusercontent.com/79977734/109800723-90449980-7c43-11eb-9a8b-403a5e3e4919.jpg)

Here's the output ``` pip install --user --requirement requirements.txt Collecting Pillow==6.2.0 Using cached Pillow-6.2.0.tar.gz (37.4 MB) Collecting pdf2image==1.9.0 Using cached pdf2image-1.9.0.tar.gz (7.4 kB) Collecting pytesseract==0.2.7 Using cached pytesseract-0.2.7.tar.gz (169 kB)...

Bumps [pillow](https://github.com/python-pillow/Pillow) from 6.2.0 to 10.0.1. Release notes Sourced from pillow's releases. 10.0.1 https://pillow.readthedocs.io/en/stable/releasenotes/10.0.1.html Changes Updated libwebp to 1.3.2 #7395 [@​radarhere] Updated zlib to 1.3 #7344 [@​radarhere] 10.0.0 https://pillow.readthedocs.io/en/stable/releasenotes/10.0.0.html Changes...

dependencies