We’re text mining the transcripts of the TV show LOST, and visualizing them!
I wondered how much the dialogue text contributes to a TV show, and whether you would be able to “detect” things in it, looking only at the words that the characters speak, removed from how their actors looked or sounded like. How much of the writing shows through and contributes, especially in this “mass production” mentality of similar entertainment media?
I wanted to do something like this for a long time, and finally I think it’s at a point where I can release this into the wild.
We’ll take a look at more than 2 million games, taken from the MillionBase PGN database. I ignored any Chess960 games contained, but in total there are 2,197,113 games. I was interested to see what kind of visualizations I can do, and what patterns would be revealed by considering so many games.
So - okay, simple enough, 100% width/height canvas at the background, put out some randomly placed dots, then link the dots depending on the distance.