[Bug]: a gist file link had been 404 error in document. (sample_ecommerce.html)
crawl4ai version
document website
Expected Behavior
link html , fine
Current Behavior
404 error
Is this reproducible?
Yes
Inputs Causing the Bug
While learning JSON schema, I'm stuck and trying to understand through entire example.
[post](https://docs.crawl4ai.com/extraction/no-llm-strategies/)
[404 error link](https://gist.githubusercontent.com/githubusercontent/2d7b8ba3cd8ab6cf3c8da771ddb36878/raw/1ae2f90c6861ce7dd84cc50d3df9920dee5e1fd2/sample_ecommerce.html)
Steps to Reproduce
1. [post](https://docs.crawl4ai.com/extraction/no-llm-strategies/)
2. find [404 error link](https://gist.githubusercontent.com/githubusercontent/2d7b8ba3cd8ab6cf3c8da771ddb36878/raw/1ae2f90c6861ce7dd84cc50d3df9920dee5e1fd2/sample_ecommerce.html)
Code snippets
OS
macos
Python version
fine
Browser
chrome
Browser version
No response
Error logs & Screenshots (if applicable)
No response
@weykon I'm unable to understand what exactly is the issue here. Could you share a code snippet for the issue you are facing!
In the official documentation, at https://docs.crawl4ai.com/extraction/no-llm-strategies/, the third point 3. Advanced Schema & Nested Structures contains an HTML example, which is a link, but it is currently invalid. (I assume that it used to exist).
@Ahmed-Tawfik94 We need to update the documentation here: https://docs.crawl4ai.com/extraction/no-llm-strategies/#3-advanced-schema-nested-structures