Makdi
Makdi
### Which package is this bug report for? If unsure which one to select, leave blank @crawlee/core ### Issue description We have noticed a warning message in our crawlee logs,...
### Which package is this bug report for? If unsure which one to select, leave blank @crawlee/http (HttpCrawler) ### Issue description Error ``` makdi-test | DEBUG HttpCrawler:SessionPool: No 'persistStateKeyValueStoreId' options...
### Which package is the feature request for? If unsure which one to select, leave blank @crawlee/core ### Feature I'm using puppeteer crawler with `keepAlive` as `true` and `crawler.run()` (without...
Example for forbes.com robots txt https://www.forbes.com/robots.txt They have blocked all paths for `GPTBot` ``` User-agent: GPTBot Disallow: / ``` However for url `https://www.forbes.com/test` ```java public boolean canCrawl(String url, String userAgent,...
### Which package is the feature request for? If unsure which one to select, leave blank @crawlee/http (HttpCrawler) ### Feature Hi, I see with both HttpCrawler and PuppeteerCrawler, ProxyConfiguration is...
Does adding multiple puppeteer blockers on single puppeteer [page](https://pptr.dev/api/puppeteer.page/) object work Example both ```javascript await PuppeteerBlocker.fromLists(fetch, [ 'https://secure.fanboy.co.nz/fanboy-annoyance.txt' ]).then(async (blocker) => { await blocker.enableBlockingInPage(page); }); await PuppeteerBlocker.fromPrebuiltAdsOnly(fetch).then(async (blocker) => {...