
I'm doing an interesting project, reviewing all the top-level pages on the
RSS 2.0 spec, checking every link to see if it's broken. For those that are, I try to come up with a suitable replacement, usually the archive.org snapshot taken around the time the page was written. If it isn't broken I don't do anything. Here's an
OPML file with all the links I've found. Most of the external links are gone, but there are some surprises, sites that have been well-preserved, for example
Jeremy Zawodny's. I just came across a link to the
first podcast feed, it's still there. I'm not sure how, but it is. This is a
restoration project, started because there was breakage caused by the move to HTTPS on the server. That breakage has been fixed. I also changed the line-endings from carriage-return to linefeed, which now is the almost-universal standard. At the time, 2003, it was still kind of muddy. As a result you can now read the source of the pages in a current browser. The site is a relic. It's a good idea to fix the broken links every 15 years or so.
💥#
I'll probably continue to post notes here as I go along. There were two crucial broken links on the home page of the spec: 1.
RFC 822, and 2.
Payloads for RSS, both of which are integral to the format. I tried to find a
solid link to the RFC that would stand the test of time. And I discovered that I had copied the Payloads doc, which was originally on a domain that I let go, to
the RSS site. So I'm going to use that link, instead of an archive.org link.
#
This project gives an idea of how important archive.org is going to be to web restorations. Their snapshots are invaluable. Most sites are very temporary things. A great example, I did a project with Jim Moore and Nicco Mele at the Dean campaign in January 2004, an aggregate of a lot of the Dean blogs in RSS format. The Dean campaign was a seminal event in the bootstrap of political blogging. Dean was the first "Web Candidate," certainly not the last. All the websites that the Dean people started are gone today, including the project I did with them, but they're all there on archive.org, so I was able to do a
decent restoration of that project. (Note the updated links are not present on the site yet.)
#
Radio UserLand,
before it was a blogging tool and RSS aggregator, was the "Programming environment for music on the Internet." It didn't take off, so we pivoted. Same codebase, very different application.
#
Andrew Grumet's
blog is still at its original address. Not surprising. ;-)
#
Nick Bradbury's
blog is still at its original address.
#
Jenny Levine's
blog is still at its original address.
#