Skip to content

mobeets/rss-replay

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Goal: Play back older posts in an RSS feed. (Why? Assume you like reading blogs in your RSS reader, but you'd like to read older blog posts as well.)

Problem: RSS feeds don't typically store their older posts. Google Reader used to archive old posts, but it no longer exists.

Solution: If a blog's RSS feed has been regularly archived on web.archive.org, we can read the archived RSS feeds to find older posts.

First, check to make sure that archives of the feed are available, using this. Then, get a list of timestamps of archives using cdx.

Usage

Inputs

Form

  • pick feed; start date, end date of posts
  • pick frequency of posting

Outputs

  • check to make sure rss path exists on web.archive.org
  • add entry to database, if duplicate does not already exist
  • if new entry was added, create empty rss at new feed url (i.e., set up routing to serve this url)
  • show user new feed url

Task database

  • input feed url, output feed url, start date, end date, post frequency, date of last post, last web.archive timestamp posted, last post id posted

Feed database

  • output feed url, rss content

Processing

  • should run once a day, and go through every item in database
  • for each item in Task database:
    • if "date of last post" is empty, create post; otherwise, check "date of last post" and "post frequency" to see if new post is needed
    • fetch posts using "last web.archive timestamp posted", and try to find next post id after "last post id posted"; if not, fetch the next web.archive timestamp and repeat
    • create new post by updating Feed database

About

Play back older posts in an RSS feed

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published