http-crawler
http-crawler copied to clipboard
Allow user to choose which links to follow
We currently follow all links, but in some cases this might not be appropriate
We should find a way to allow the user to configure which links to follow.
I've just added the follow_external_links option to crawl, which defaults to True. I think this addresses the likely main use case for this.