June 19, 2012 · 0 Comments
By Craig Silverman:
It’s been said that the Internet never forgets, but that doesn’t necessarily mean it’s easy to recall something from minutes or even seconds before.
Web pages, articles, blog posts and other content are easily altered, changed, deleted. And it’s not always clear what’s new or has been removed. Cached versions of content exist, but it can be tough to locate them.
To help address this issue, a group of participants in this weekend’s Mozilla Knight Mozilla MIT hackathon have launched a new tool to make it easier to track changes made to content from two news organizations.
I emailed them last night when they had a rough version of the site online and they worked until around 5 a.m. this morning to create a more functional website. You can now browse a selection of the changes made to articles that appear on the homepages of CNN and The New York Times, starting yesterday.
“Sometimes the changes are minor — small edits in language or correction of spelling mistakes,” they write on the project website. “Other times, the stories change and evolve rapidly, as a result of breaking news. Occasionally, the lede and substance of an article changes, as in the example to the right.”
Here’s the example they point to, which went viral back in the fall during Occupy Wall Street:
NewsDiffs follows in the tradition of ProPublica’s ChangeTracker, which lets you track changes to pages on the White House website, and the recently announced Politwoops from the Sunlight Foundation, which lets you track the tweets deleted by Twitter accounts belonging to politicians.
The common thread that run between these tools is they expose changes or capture deleted content that is otherwise rarely visible or trackable by the public.
The NewsDiffs team writes that their work “is inspired by the version control tracking used in computer programming.” They called the project NewsDiffs because, as Lee said in an email, “Diff is an incredibly common programming term that lets you compare versions of files.” (I sent them other questions and will add their responses after they’ve had time to catch up on sleep.)
The use of versioning in software development was also the inspiration for an argument made by journalist Scott Rosenberg that news organizations should expose the revision history to online content. He also referenced the “view history” tab on Wikipedia articles that enables anyone to see how that page has evolved over time, and who made edits.
Let readers see the older versions of stories. Let them see the diffs. Toss no text down the memory hole, and trigger no Orwell alarms.
Versioning should be the model for how we present the evolution of news stories on the Web. In fact, it makes so much sense that, even though right now no one is using it, I’m convinced it will become the norm over the next decade.
Rosenberg later followed up to announce the release of a WordPress plugin to enable anyone using that CMS to expose the version history of a post.
NewsDiffs is fun to play around with, and it will be interesting to see how people make use of the data/changes it collects.