core
core copied to clipboard
The complete web scraping toolkit for PHP.
There are two separate crawlers that need to share the same login operation. Should this login step be made into a middleware or a processor? It is now a php...
**Describe the bug** When using the `ExecuteJavascriptMiddleware`, two requests are sent. One by Browsershot, the other by Guzzle. **Reproduction** My spider only has the `ExecuteJavascriptMiddleware` registered as downloader middleware. I...
Bumps [monolog/monolog](https://github.com/Seldaek/monolog) from 3.5.0 to 3.6.0. Release notes Sourced from monolog/monolog's releases. 3.6.0 Added LineFormatter->setBasePath(...) that allows removing the project's path from the stack trace output (#1873) Added $includeExtra option...
Firstly, I really like this package. Really nice work! In my app, I want to act based on the fact if the ExecuteJavascriptMiddleware drops a response. However, this is not...
I created it to avoid error about memory limit when it needs to crawl websites with many links. The downloader middleware late much to drop request, but this spider middleware...
**Describe the bug** According to the [docs](https://roach-php.dev/docs/repl), one can run `php vendor/bin/roach ` to start Roach's Interactive Shell, but it actually requires a command (`roach:shell`) to be inserted just before...
Hey, first of all, nice work on this lib! Super easy to migrate from scrapy! :-D I was wondering how one could handle `SpiderInterface::parse()` if, for example, what I'm looking...
Bumps [league/container](https://github.com/thephpleague/container) from 4.2.4 to 5.1.0. Release notes Sourced from league/container's releases. 5.1.0 Added Attribute based resolution for dependencies using #[Inject] and #[Resolve] attributes. Docs: https://container.thephpleague.com/5.x/attribute-resolution/ Support for PHPUnit 12...
This PR adds exception handling to the downloader and allows middleware to act upon caught exceptions. Personally, I need this for handling Javascript (Browsershot) exceptions, so I can retry them....
Bumps [ergebnis/php-cs-fixer-config](https://github.com/ergebnis/php-cs-fixer-config) from 6.45.0 to 6.46.0. Release notes Sourced from ergebnis/php-cs-fixer-config's releases. 6.46.0 What's Changed Enhancement: Enable PhpCsFixerCustomFixers/php_unit_requires_constraint fixer by @localheinz in ergebnis/php-cs-fixer-config#1200 Full Changelog: https://github.com/ergebnis/php-cs-fixer-config/compare/6.45.0...6.46.0 Changelog Sourced from ergebnis/php-cs-fixer-config's...