python-boilerpipe icon indicating copy to clipboard operation
python-boilerpipe copied to clipboard

fixed getImages RuntimeError when using boilerpipe-1.2.0.jar built…

Open benpryke opened this issue 10 years ago • 5 comments

…from the latest source

The error that has been fixed was:

>>> from boilerpipe.extract import Extractor
>>> extractor = Extractor(extractor='ArticleExtractor', url='some-url')
>>> extractor.getImages()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "C:\Program Files (x86)\Python 2.7.3\lib\site-packages\boilerpipe\extract\__init__.py", line 84, in getImages
    } for image in images
  File "C:\Program Files (x86)\Python 2.7.3\lib\site-packages\jpype\_jcollection.py", line 32, in __next__
    return next(self.iterator)
RuntimeError: No matching overloads found. at native\common\jp_method.cpp:121

This occurred when using a version of boilerpipe-1.2.0.jar built from the most recent source.

benpryke avatar May 22 '15 01:05 benpryke

Ki koitam tor to mone koile kicho likhi dis.

mdfahad777 avatar Aug 05 '16 19:08 mdfahad777

@Ninjakannon If this PR is still applicable, please resolve the conflicts and update.

Also please add a test case so this can be detected in future.

tuxdna avatar Jan 13 '17 05:01 tuxdna

@tuxdna There are no conflicts and as far as I can tell, this repo has no test cases; it is unclear to me how to best go about adding them.

benpryke avatar Jan 15 '17 14:01 benpryke

@Ninjakannon Oh right there are no conflicts. Perhaps I mistook with other PRs in this project.

What would be best way to replicate the issue, for which this PR is submitted?

tuxdna avatar Jan 15 '17 14:01 tuxdna

@tuxdna Although I haven't used this for a while, I believe my original merge request comment (at the top) illustrates the bug. To replicate, one must pull the repo, compile boilerpipe-1.2.0.jar from source, extract a webpage, and call the getImages method.

benpryke avatar Jan 15 '17 16:01 benpryke