rsspls icon indicating copy to clipboard operation
rsspls copied to clipboard

Feature request: add article scraping to extract website contents for item summaries

Open jkramer opened this issue 10 months ago • 2 comments

Would be great if rsspls could optionally fetch the parsed URL, extract the contents using a library like article_scraper or readability for example and use that as the summary.

jkramer avatar Apr 09 '25 15:04 jkramer

Are you able to clarify further on how this suggestion would work? rsspls already fetches the URL specified for each feed in the configuration file.

wezm avatar May 28 '25 06:05 wezm

What I meant is to fetch every URL found by the feed.config.link selector, extract the contents of each page and use that as summary (instead of feed.config.summary). I've done this myself in the meantime as a Rust exercise, feel free to use any of the code if you want to implement the feature: https://codeberg.org/jkramer/snugrss

jkramer avatar May 28 '25 11:05 jkramer

Since you have solved your use case I'm going to close this.

wezm avatar Oct 04 '25 04:10 wezm