Text Mining LOST

We’re text mining the transcripts of the TV show LOST, and visualizing them! I retrieved and parsed the transcripts of LOST from Lostpedia1 and used a few different tools to look at this data. One thing to keep in mind is that this analysis is on only the text that the characters speak. I’m not a big LOST connoisseur, so take these visualizations at their face value and not as some objective judgement about your favorite LOST character or writer.

A Visual Look at 2 Million Chess Games

I wanted to do something like this for a long time, and finally I think it’s at a point where I can release this into the wild. We’ll take a look at more than 2 million games, taken from the MillionBase PGN database. I ignored any Chess960 games contained, but in total there are 2,197,113 games. I was interested to see what kind of visualizations I can do, and what patterns would be revealed by considering so many games.