raven-reader icon indicating copy to clipboard operation
raven-reader copied to clipboard

"Full Content" does not fetch the entire article

Open niveK77pur opened this issue 2 years ago • 0 comments

Describe the bug Feeds from the https://arstechnica.com/ site only have the first few paragraphs or sections scraped when using Full Content. The rest of the article very consistently does not appear. It requires using View original to see everything until the end.

To Reproduce Steps to reproduce the behavior:

  1. Add feed: http://feeds.arstechnica.com/arstechnica/index
  2. Fetch articles
  3. Compare the contents from Full Content and View original
  4. See how Full Content's text stops midway
  5. If 4. could not be observed, find another (longer) article and repeat from 3.

Expected behavior The article should be scraped in its entirety.

Screenshots The end of an article as seen using Full Content VinLudensScreenshot

The same passage in the article as seen using View original (also see the scroll bar, the article goes on for much longer) VinLudensScreenshot

Desktop:

  • OS: ArcoLinux (Arch)
  • Browser: Raven built-in (?)
  • Version: 1.0.79

Additional context

It appears as though content stops being shown after the 2nd or 3rd advertisement.

To be noted is that their articles tend to be quite long. So far, ArsTechnica is the most prominent one where I observe this behavior, so I am not too sure if it is an isolated case.

niveK77pur avatar May 30 '23 09:05 niveK77pur