We're living in the age of Github. The company -- which has the same name as its central product -- emits a certain species of unalloyed nerd good. It's the kind of software people spend their lives in -- and it's the kind of software the shapes how people think.
Now they're making a play for nerds who don't write code.
For the uninitiated: Github helps programmers (and teams of programmers) keep track of different versions of code, an important task when hundreds of people might be editing the same code base or piece of software. It can compare two different code snippets and tell you what the differences between them are, and who made those changes. It can tell you how all the changes in a project compare to each other. And it can, in the service of that one, huge, change-tracking feature, do a ton of other things too.* The New Yorker published an online introduction to the service earlier this month.
So, Github is a singular phenomenon in the world of code. But three features it's launched in the past year indicate it's moving beyond that world to other quantitative domains.
In the past, if a user uploaded plain text or an image to Github, the service could display them as they appear. This goes for images, too, so if a user uploaded an image, then changed it, it could even display the differences between those changes:
But this year, it's begun to represent more than just text and images. In April, the website began rendering STL files, a file used by 3D modeling software, in 3D. Instead of seeing a series of numbers and descriptions, Github users would now see the object those data described. Earlier this summer, it began representing certain kinds of geographical data, not as text, but as maps, with the help of the Washington, DC-based startup Mapbox.
And, on Thursday, it began displaying tabular data, described in text, as tables. "Tabular data described as text" may sound like an obscure phenomenon, but it's not: Excel easily exports either of the two file-types Github can now display. Today's feature means any Excel file (or at least, any Excel file without internal equations) can enter the domain of Github.
What does that mean? Turns out someone's already been thinking about this. Talking to Northwestern University's Knight Lab a month ago, designer and author Clay Johnson said, "I think the rest of 2013 is going to be a big year for GitHub":
Their latest move to easily allow people to render maps in git repositories blew my mind because it begins the transition to a more interesting audience. There are two obvious steps for GitHub here: the first is creating viewers for other kinds of files -- like perhaps the csv. The second one -- and the one that GitHub is very well suited for -- is showing the diffs.
Check out that last link, and imagine what's going to happen... when you start isolating the differences between two versions of a CSV. The interesting stories in data aren't in the rows, they're in the differences between them. And I think GitHub is getting ready to make as much of a play in data as it has in code.
Johnson nailed it, and it's easy to envision the rest of Github's feature set, especially version- and change-tracking, coming to models, maps and tables. Even before yesterday's news, organizations had already uploaded tabular data to Github: they just didn't expect it to look like tables. (Organizations uploaded maps before the site could render maps, too -- organizations like the City of Chicago.) Github, with its feature set and pre-existing large user base, could become the de facto home of data posted to the Internet.
* To wit:
- It lets users annotate code, marking areas that don't work or need improvement.
- It allows users to correspond with each other, so many companies conduct internal communications through it.
- It provides free hosting for open-source software projects.
- It hosts the sites made with the blogging software Jekyll for free.