scipdf_parser icon indicating copy to clipboard operation
scipdf_parser copied to clipboard

Python PDF parser for scientific publications: content and figures

Results 21 scipdf_parser issues
Sort by recently updated
recently updated
newest added

Hi, I encountered this issue when importing scipdf from a conda environment. I have several environments with spacy installed, each of them most likely at a slightly different version, with...

Hello, Is there any functionality in the library to process the pdfs in batches and/or using GPU to accelerate computing? If not, what would be the go to steps to...

Hi, I am trying to run grobid via scipdf in my python environment. However, couldn't find option to include "consolidateHeader" option provided by grobid. Can you please help if there...

Unfortunately I cannot share the PDF as it is proprietary: ``` KeyError Traceback (most recent call last) Cell In [6], line 8 6 for app in apps: 7 app =...

mypy throws these warnings. Are you interested in a PR that fixes this? ``` [...]/.local/lib/python3.10/site-packages/scipdf/pdf/parse_pdf.py:114: SyntaxWarning: "is not" with a literal. Did you mean "!="? if middlename is not "":...

grobid-0.6.2 $ ./gradlew run Configuration(s) specified but the install task does not exist in project :grobid-core. Configuration(s) specified but the install task does not exist in project :grobid-home. Configuration(s) specified...

Sometimes with scanned pages we get '[NO_BLOCKS] PDF parsing resulted in empty content' and with GROBID parsing errors we get '[GENERAL] An exception occurred while running Grobid.' to catch these...

In PR #22 (https://github.com/titipata/scipdf_parser/commit/d7d537e0baea536ccbbc96ac052aa8497ce5d6bb) a new dependency fitz library has been added (line number 9 in parse_pdf.py file) which has not been added in dependencies list. Due to which when...

Hello everyone, I was used to launch "scipdf_parser" on Google Colab and it worked so well! Today I tried to launch it again with the same commands, but it does...

Hey, I found your project and that was exactly what I needed, but it rather crude form and the overall quality of code seemed not satisfactory to me. So I...