Nasrin
Nasrin
### crawl4ai version 0.6.3 ### Expected Behavior - ### Current Behavior **Version:** `Crawl4AI v0.6.3` **Description:** When extracting raw text from a set of PDF URLs using `AsyncWebCrawler` with `PDFCrawlerStrategy` and...
Feature/scraping strategy - refactor: Remove WebScrapingStrategy and fix metadata extraction (#995)
## Summary This PR refactors the content scraping strategy by removing the BeautifulSoup-based `WebScrapingStrategy` class and making `LXMLWebScrapingStrategy` the sole implementation. This simplifies the codebase by eliminating duplicate functionality while...
### crawl4ai version 0.7.7 ### Expected Behavior When using the embedding strategy in AdaptiveCrawler, the system should: 1. Take the user's query (e.g., "who is on the board of directors")...