ca·coph·o·ny - a harsh discordant mixture of sounds.

I always get glimpses of cool ideas when I see great data, and after watching parts of the First Presidential Debate of 2020 I knew I had to do something with it. The transcript over at contains incredibly detailed and millisecond timestamped data. Hearing about how chaotic the debate went (and witnessing it after digging into the data), I wanted to look at how often people spoke over each other.

Color in Books

Project Gutenberg is an amazing source of books that are in the public domain. An avid reader myself, I wanted to make a creative visualization using the text of the great books when I came across the “Best Books Ever” list. Click on the graphic to see it in full size. I hope this inspires you to read one of these classics! Interested in prints of this graphic? I have a few copies printed on beautiful art paper.

One Century of Plane Crashes

An exploration of 100 years of plane crashes. I discovered the ASN Aviation Safety Database some time ago, and knew almost instantly that I would have to dig deep on this one. The earliest crash is from August 2nd, 1919 and the latest record I have is from August 1st, 2019. There are approximately 22 thousand crash records. Planes and aviation represent a great chunk of our progress in technology and manufacturing, as well as bringing the world closer to us, with novel means of not just travel, but other endeavors like war and scientific research.

Visual Look at 2 Million Chess Games: Part 2

We’re looking deeper into the amazing MillionBase database with over two million chess games to see what else we can learn about the data specifically, and possibly about chess in general. I have analyzed some of this data in Part 1. If you haven’t seen that, feel free to give it a read. This post will focus on other aspects of the data, and I have drawn from some reader suggestions from the previous post.

One Year in Vancouver: Visualizing Events

We’re looking at the year of 2018 in Vancouver through the lens of event listings on the Georgia Straight Events page. I find their list extremely useful and use it all the time to find cool things to do, and notice a great deal of events happening all over the city. I collected all the events over the course of 2018 and wanted to make a “map of the city” from this data.

Text Mining LOST

We’re text mining the transcripts of the TV show LOST, and visualizing them! I retrieved and parsed the transcripts of LOST from Lostpedia1 and used a few different tools to look at this data. One thing to keep in mind is that this analysis is on only the text that the characters speak. I’m not a big LOST connoisseur, so take these visualizations at their face value and not as some objective judgement about your favorite LOST character or writer.

A Visual Look at 2 Million Chess Games

I wanted to do something like this for a long time, and finally I think it’s at a point where I can release this into the wild. We’ll take a look at more than 2 million games, taken from the MillionBase PGN database. I ignored any Chess960 games contained, but in total there are 2,197,113 games. I was interested to see what kind of visualizations I can do, and what patterns would be revealed by considering so many games.