• "I thought this was a fascinating take on the need within companies for stories… Companies spend a lot of money looking for these stories. Traditional product companies had to ask people and users to tell their stories, normally through market research. Web companies are at a huge advantage: they have rivers of usage data flowing through their servers, and the problem inverses – how to make sense and tease out meaning and interest from such a torrent." This is very good; I'm looking forward to future installments.
  • "If Ferelden has room for priests, elves, mages and golems then why doesn’t it have room for sceptics and scientists too?" Lovely notion – roleplaying an aetheist in Dragon Age (as best possible within the game). In this case, the player character believes in magic, but not in the montheist religion that much of the world ascribes to; miracles are really just magic at work. Subsitute "magic" for "science" and you begin to see his point. It's a nicely thought-through piece.
  • "The closer has confounded hitters with mostly one pitch: his signature cutter." Lovely motion infographics – informative, and powerfully confirming the narration.
  • "The move during the past 10 years or so has been from cameras being precision mechanical devices to molded polycarbonate containers for electronic components. This has meant a lowering of overall physical quality. What one gets in terms of features, functions and image quality is higher than ever before, but the satisfaction of owning and using a high quality mechanical and optical device has for the most part evaporated. Only the top models within any brand produce a tactile satisfaction and please one's esthetic sense." The quotation is from Michael Reichmann; the discussion that follows is as thoughtful as usual from TOP's readers.
  • And, with their 119th update, Valve helpfully included the list of all their previous patches, as well. Just look at the amount that's changed – and how swiftly. A proper, living game (unlike the stillborn 360 version). Can't wait to play it on the Mac; it's almost like a different game to the one I played at the beginning.
  • "Ben Gimpert is a friend of the Open Library. He and I got together over lunch a few months ago to talk about big data, statistical natural language processing, and extracting meaning from Open Library programmatically. His efforts are beginning to bear some really interesting fruit, and while we work out how we might be able to present it online, we thought you might be interested to hear what he’s been up to." Answer: good things. Ben is awesome, and this work sounds great. (I can't quote a suitable passage, so George's intro will have to do).
  • A few short tips on find; one of the bash tools I use least, and should probably use more.
  • "Diller, Brill, and Murdoch seem be stating a simple fact—we will have to pay them—but this fact is not in fact a fact. Instead, it is a choice, one its proponents often decline to spell out in full, because, spelled out in full, it would read something like this: “Web users will have to pay for what they watch and use, or else we will have to stop making content in the costly and complex way we have grown accustomed to making it. And we don’t know how to do that.”"
  • "ASBOrometer is a mobile application that measures levels of anti-social behaviour at your current location (within England and Wales) and gives you access to key local ASB statistics… This app was created by Jeff Gilfelt and made possible by the data.gov.uk initiative, which is opening up UK government data for public reuse." What sensationalist rot; no number of pretty visualisations make this kind of fearmongering acceptable. It's nice that the data is open; it's a shame this is the best thing people can think to do with it. Whether you like it or not, this information is very, very loaded.

eg2k9.png

I really liked last.fm’s end-of-year charts. I also really liked this analysis of Pitchfork’s scoring in 2009, just for the statistical fun. And then I thought about my favourite score-oriented website, and wondered why no-one’s done this for Eurogamer yet. I’d be the first to argue that scores in reviews aren’t that important – but everybody loves arguing about them in comments threads, and besides, they make for interesting statistics. What I’d really like would be something like the Pitchfork analysis, but looking a bit more like the last.fm site.

And then after two days I just decided to build it myself.

It’s relatively straightforward: a small app to explore a year’s worth of review scores, built around the pillars of reviews, writers, and scores. Most blue things are clickable; writers have pages that show their reviews, as well as their own averages, deviance from EG’s norm, and the scale of their contribution to the overall average. That latter figure is something I call influence; it took a long while to get to, and I write about it more here. Here’s Simon Parkin’s page as a good example of a writer’s page.

Reviews also have pages – here’s one for Modern Warfare 2, which show how the review compares to the site’s average, the writer’s average, and also to Metacritic. And, of course, you can see just how many games scored 7 – or any other score – if you want. Basically: have a click around.

I started two weeks ago, and guess I stopped committing in the middle of last week, but towards the endit was just front-end tweaks. It’s not been a big project at all – about an hour or two’s work a day on average, in evenings, and lunch-hours, over about ten days.

It’s not a very advanced project, and touches lots of bases I’m working with a lot right now – data analysis, visualisation, scraping. That said, it’s got some interesting stuff under the hood. I’m using Typekit for the attractive type, and it’s been a pleasure to work with. The graphs are a combination of the Google Charts API and gRaphaël, which I’ve had reasonable results from recently. gRaphaël’s strength are beautiful visualisations, rather than ultra-accurate charting, so the pair of tools are used for their strengths. Finally, it’s all deployed on Heroku, which has been a joy as ever; cloud deployment of databased apps, on dynamic hosting, as simple as pushing to a new git repo. And, for the scale of the Eurogamer tool, totally free.

So there you go. A little exploration of some numbers, which bring some interesting figures to light, and was also fun to build. It only felt right to share it. As the site says, scores aren’t everything – you should read reviews too, folks – but when you’ve got numerical data, it seems a shame not to do anything with it.

  • "With every piece of metadata that you don’t throw away, you gain a factor more potential ways of slicing through your content and delivering it as a separate product, simply as a result of a database lookup. In the case of Vogue today, say, commissioning an editorial product that simply shows every dress designed by Christian Dior that appears in the archive would involve weeks of intern-work, instantly making it unprofitable or too late. A metadata-complete archive in the future would give you that with a single line of code." Hammersley on the value to journalism of sensible datastorage. Data-driven journalism in the sense that it is not *about* data, but in that it is *treated as data* – and from this more stories can flow.