Makdi

Results 6 issues of Makdi

### Which package is this bug report for? If unsure which one to select, leave blank @crawlee/core ### Issue description We have noticed a warning message in our crawlee logs,...

bug
t-tooling

### Which package is this bug report for? If unsure which one to select, leave blank @crawlee/http (HttpCrawler) ### Issue description Error ``` makdi-test | DEBUG HttpCrawler:SessionPool: No 'persistStateKeyValueStoreId' options...

bug
t-tooling

### Which package is the feature request for? If unsure which one to select, leave blank @crawlee/core ### Feature I'm using puppeteer crawler with `keepAlive` as `true` and `crawler.run()` (without...

feature
t-tooling

Example for forbes.com robots txt https://www.forbes.com/robots.txt They have blocked all paths for `GPTBot` ``` User-agent: GPTBot Disallow: / ``` However for url `https://www.forbes.com/test` ```java public boolean canCrawl(String url, String userAgent,...

robots

### Which package is the feature request for? If unsure which one to select, leave blank @crawlee/http (HttpCrawler) ### Feature Hi, I see with both HttpCrawler and PuppeteerCrawler, ProxyConfiguration is...

feature
t-tooling

Does adding multiple puppeteer blockers on single puppeteer [page](https://pptr.dev/api/puppeteer.page/) object work Example both ```javascript await PuppeteerBlocker.fromLists(fetch, [ 'https://secure.fanboy.co.nz/fanboy-annoyance.txt' ]).then(async (blocker) => { await blocker.enableBlockingInPage(page); }); await PuppeteerBlocker.fromPrebuiltAdsOnly(fetch).then(async (blocker) => {...