I am a graduate student studying statistics and I play chess once in a while for fun.
I was thinking (just for the fun of it) -- applying some of what I've learned in statistics to chess games: I want to perform statistical analysis of chess games.
Note: My primary purpose is recreation:
- Tinkering around
- Re-producing earlier works -- for example how much of an advantage does white have over black
- Looking at mid-game winning patterns
- Etc. etc.
So to that:
- I need a large collection of chess games (historic data)
- I will then analyze them using R which is a statistical programming language
Question:
- I'm looking a collection of games that I can then use as my data set (I would love to get a list of open source chess databases -- commercial chess database might have restrictions on em)
- I'm looking for any reference material -- like seminal papers in this area.
Thanks.