Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Import very old content from www.wincent.com #82
May be able to write some hacky script to get the HTML out of articles like this one. A lot of that old content is garbage but it does have some historical interest. I have articles spanning from around 2005 to 2008. (Actually, just found one as old as 2004.)
Possibly use Pandoc or something to convert to Markdown.
Will need import script that can put these on a branch somewhere, then rewrite the content branch to rebase the new content on top of the old content while preserving all the dates correctly.
Copying in some older notes I have:
Many URLs are obviously going to break. For example a blog post like:
Will get moved to a new home at a URL like:
The old page should become a 301 (permanent) redirect.