nodejs-web-scraper
nodejs-web-scraper copied to clipboard
exclude content based on a list of tags, instead of removeStyleAndScriptTags
There are a few use cases which aren't covered by the removeStyleAndScriptTags option:
- exclude other noise content, eg
<svg> - exclude style, but leave scripts
- exclude scripts which don't have
Would you please consider adding a more generic option, removeTags? Ideally, it would also support attributes, but just tag names would be very useful anyway.
Usage:
const config = {
removeTags: ['style', 'link', 'script', 'svg']
}
Thanks
Hey, feel free to maybe do it yourself, and make a pull request, thus contributing :D