boilerpipe3 icon indicating copy to clipboard operation
boilerpipe3 copied to clipboard

A fork of boilerpipe with python 3 and small fixes, ported from source `https://pypi.python.org/pypi/boilerpipe-py3.

Results 6 boilerpipe3 issues
Sort by recently updated
recently updated
newest added

Hi! I've been using Boilerpipe with [Bitextor](https://github.com/bitextor/bitextor), and everything has worked out fine. The problem is that when I processed a PDF file, specifically [this one](https://ec.europa.eu/clima/system/files/2016-11/analysis_appendix_en.pdf), I run out of...

Build machines do not have access to the external network and accessing the github.com repo directly is a breaking requirement. Changed to reference the zipped boilerpipe library locally.

Using the Extractor leads to the following error, triggered by the lack of keyword argument convertStrings when boilerpipe launches the JVM. ------------------------------------------------------------------------------- Deprecated: convertStrings was not specified when starting the...

Pass the argument to the `KeepEverythingWithMinKWordsExtractor`java constructor. The problem is better explained in issue https://github.com/slaveofcode/boilerpipe3/issues/1 ,

Hi, When I try to extract an article from varzesh3.com (for example https://www.varzesh3.com/news/1554055/) I get this error: ``` Traceback (most recent call last): File "", line 1, in NameError: name...

First, thanks for the port. When trying to use `KeepEverythingWithMinKWordsExtractor`, I get the error: ``` Traceback (most recent call last): File "test.py", line 4, in extractor = Extractor(extractor='KeepEverythingWithMinKWordsExtractor', url=url, kMin=20)...