-
"Marky Markov is an experiment in Markov Chain generation implemented in Ruby. It can be used both from the command-line and as a library within your code." It's very fast, and basically does all the work I've been doing on my projects by hand for me. But better.
-
MattB writes down his tips for language processing/machine learning; useful that somebody's done this.
-
"Imperfect and internal rhymes are two important features in rap music previously ignored in the music information retrieval literature. We developed a method of scoring potential rhymes using a probabilistic model based on phoneme frequencies in rap lyrics. We used this scoring scheme to automatically identify internal and line-final rhymes in song lyrics and demonstrated the performance of this method compared to rules-based models. We then calculated higher-level rhyme features and used them to compare rhyming styles in song lyrics from different genres, and for different rap artists. We found that these detected features corresponded to real- world descriptions of rhyming style and were strongly characteristic of different rappers, resulting in potential applications to style-based comparison, music recommendation, and authorship identification." Awesome, and something I am going to sit down and read properly.
-
"A series of informative posters detailing how some of the most notable drum sequences were programmed using the Roland TR-808 Drum Machine. Each sequence has been analyzed and represented as to allow users to re-programme each sequence, key for key." Gorgeous. (If I had to pick, I'd take Voodoo Ray – which is a lovely piece of drum programming amongst many other things).
-
"Both within the academy and within tech startups, we’ve been hearing some similar questions lately: Where can I find a good data scientist? What do I need to learn to become a data scientist? Or more succinctly: What is data science?" Great starting point; looking forward to more from the blog.
-
"When someone with a bad case fails to finish a book, they don’t start a new one; they go into a holding pattern, crippled by guilt over their failure and unable to let go and start over. All reading stops. People have confessed to me that it’s been months since they last picked up a book, because they still haven’t finished the last one." Yup. We really don't have to finish this book, sometimes.
-
"…don’t confuse this kind of data exploration, where the goal is to size up the data, with building proper data plumbing, where you want robustness and maintainability. Perl and bash scripts are nice for the former, but can be a nightmare for building data pipelines." Lots of good stuff in this article; this was a highlight.
-
Excellent, detailed article on how Microsoft calculate TrueSkill – an algorithm for matching you to players about in your skill level. This is what is used every time you hit "game with strangers" on an XBL title, basically. Fascinating, detailed, not too challenging if you take it slow/steady – and the implementation is on github…
-
"Being NOTES and SLIDES on a talk given at PLAYFUL 09, concerning CHARLES BABBAGE, HEATH ROBINSON, MENACE and MAGE" Awesome; shame I couldn't be there. I wondered where that link about Michie had come from a few weeks ago…
-
"Someone at work recently asked how he should go about studying machine learning on his own. So I’m putting together a little guide." Ooh, useful. Lots of starting points for machine learning in R.
-
"When you look at the dubstep scene you realize quickly that it’s a fairly young genre. Not in terms of its own existence as a named thing, but as a measure of the age of many of its prominent musicians. They’re of the generation that doesn’t know a world before the Nintendo Entertainment System and a lot of the music reflects that… If you had a giant Venn Diagram of dubstep and 8-bit chiptunes, you’d see a large overlap between the two. Why dubstep is particularly prone to this, more than other electronic styles, I don’t know. Maybe it has to do with its relatively lo-fi, home studio feel of the genre? … There’s a hidden, untold history there, but it’d be best told by someone that knows the genre, and its players, better than I do. In the meantime, I’ll continue enjoying it until it’s pillaged and destroyed for all its worth." Mike on the overlap between dubstep and chiptune culture.
-
"All artworks have been created using data from the game "Unreal Tournament". Each image represents about 30 mins of gameplay in which the computers AI plays against itself. There are 20-25 bots playing each game and they play custom maps which I create. Each map has been specially designed so that the AI bots have a rough idea of where to go in order to create the image I want. I log the position (X,Y,Z) of each bot, every second using a modification for the game, I also log the position of a death. I then run my own program written in Processing to create printable postscript files of that match."
-
"This page will maintain list of AI related libraries for the Ruby programming language." Some interesting stuff here, although it's all in varying degrees of maturity…
-
"What data can we wring out of the rural environment that might prove of use to it’s residents and visitors? What embedded processes should have APIs opened up to the wider community?" Lovely lovely lovely design of little bits of Ruricomp and what they might look like from the lovely lovely Paul Pod (who was in the studio a week or two ago, and a joy to work with and around). The twitterbots are especially good.
-
"We’ve been working with the fine folks at Flickr (thanks Aaron!) to add Noticings to their third party machine tag services. What on earth does that mean? Basically, a badge on the sidebar of your Flickr photos linking to Noticings…" Ding! Rather excited about this. Lots more to come (especially from my half of the deal) on Noticings soon, but this bit is super-exciting.