feedhq icon indicating copy to clipboard operation
feedhq copied to clipboard

Items received twice for some feeds

Open vincentbernat opened this issue 8 years ago • 3 comments

Hey!

I am subscribed to the following feed: http://hackaday.com/feed/

Most of the items appear twice in my list. However, the most recent one may be present only once. I wonder if this is not due to PubSubHubbub. Maybe the received update doesn't exactly match RSS content?

vincentbernat avatar Aug 05 '17 16:08 vincentbernat

I get something similar for https://threatpost.com/feed. Usually items appear twice, but often appear as many as four times.

threatpost

djmoch avatar Aug 06 '17 12:08 djmoch

Duplicates in feeds are usually resolved using the "guid" in the feed source. When a guid is provided, it's supposed to be the unique key for a given entry.

These feed's guids look like http://hackaday.com/?p=266498 or http://hackaday.com/?p=267092 in the feed source, which looks correct.

However, once stored in FeedHQ duplicates appear with guids looking like http://hackaday.com?p=258890&preview=true&preview_id=258890.

This could be a bug in either Wordpress's preview mechanism, or in Wordpress's pubsubhubbub plugin (your two examples are WP sites). I don't know if duplicates appear in pubsubhubbub posts or in periodic, pull-based updates. It's more likely that it involves pubsubhubbub, though. The WP plugin looks unmaintained.

I could work around this by discarding pubsubhubbub updates corresponding to a wordpress preview…

brutasse avatar Aug 07 '17 07:08 brutasse

A workaround would be great. I have that problem on 2-3 websites and it’s become very annoying. No feed reader I used or tried over the last weeks has this problem.

sgoehre avatar Mar 18 '18 11:03 sgoehre