Visual Look at 2 Million Chess Games: Part 2

We’re looking deeper into the amazing MillionBase database with over two million chess games to see what else we can learn about the data specifically, and possibly about chess in general. I have analyzed some of this data in Part 1. If you haven’t seen that, feel free to give it a read. This post will focus on other aspects of the data, and I have drawn from some reader suggestions from the previous post.

One Year in Vancouver: Visualizing Events

We’re looking at the year of 2018 in Vancouver through the lens of event listings on the Georgia Straight Events page. I find their list extremely useful and use it all the time to find cool things to do, and notice a great deal of events happening all over the city. I collected all the events over the course of 2018 and wanted to make a “map of the city” from this data.

Text Mining LOST

We’re text mining the transcripts of the TV show LOST, and visualizing them! I retrieved and parsed the transcripts of LOST from Lostpedia1 and used a few different tools to look at this data. One thing to keep in mind is that this analysis is on only the text that the characters speak. I’m not a big LOST connoisseur, so take these visualizations at their face value and not as some objective judgement about your favorite LOST character or writer.

A Visual Look at 2 Million Chess Games

I wanted to do something like this for a long time, and finally I think it’s at a point where I can release this into the wild. We’ll take a look at more than 2 million games, taken from the MillionBase PGN database. I ignored any Chess960 games contained, but in total there are 2,197,113 games. I was interested to see what kind of visualizations I can do, and what patterns would be revealed by considering so many games.