Reasat
Reasat
Hi @Mephisto405 Isn't it weird that the query strategy by ground truth losses is performing so poorly? Theoretically. this strategy should be as good as ll4al or better. Do you...
@unclecode I am not sure the chunking code written [here](https://github.com/unclecode/crawl4ai/blob/dde14eba7db2de240d7a1dc80f436f5c821571e8/crawl4ai/extraction_strategy.py#L683works) works as intended. If a document have a total token greater than the chunk_threshold, it does not chunk the doc...
@unclecode thanks a lot for the description! I'll check out the implementation in the next branch.