boilerpipe
boilerpipe copied to clipboard
Automatically exported from code.google.com/p/boilerpipe
Boilerpipe
Automatically exported from http://code.google.com/p/boilerpipe, and then maintained a bit with manual wiki-extraction with some edits.
How to build and use
To build run:
ant
To use, run:
java -jar /path/to/boilerpipe-core/dist/boilerpipe-1.2-dev.jar ./example.html out.txt
Old QuickStart
Boilerpipe is an HTML content extraction tool. Check out QuickStart.