Ratings
Chess engine rating lists aim to provide statistically significant measures of relative engine strength. These lists play multiple games between engines on standard hardware platforms, so that processor differences are factored out. Some also standardize the opening books, in an attempt to measure the strength differences of the engines only. These lists not only provide a ranking, but also margins of error on the given ratings. Also rating lists typically play games continuously, publishing many updates per year, compared to tournaments which only take place annually.
There are a number of factors that vary among the chess engine rating lists:
- Time control. Longer time controls, such as 40 moves in 120 minutes, are better suited for determining tournament play strength, but also make testing more time-consuming.
- Hardware used. Faster hardware with more memory leads to stronger play.
- 64-bit (vs. 32-bit) hardware and operating systems favor bitboard-based programs
- Multiprocessor vs. single processor hardware.
- Ponder settings (speculative analysis while the opponent is thinking) aka Permanent Brain.
- Transposition table sizes.
- Opening book settings.
- All listed engines are 64-bit
These differences affect the results, and make direct comparisons between rating lists difficult.
Rating list | Time control (moves/minutes) |
Year started |
Last updated | Engine/platform entries |
Games played |
Top three engines | Rating |
---|---|---|---|---|---|---|---|
CCRL | 40/40 Ponder OFF |
2005 | November 10, 2012 | 1241 | 422,452 | Houdini 3 x64 4CPU Critter 1.6a x64 4CPU Rybka 4 x64 4CPU |
3287 3207 3199 |
CEGT | 40/20 Ponder ON |
2006 | October 14, 2012 | 1067 | 620,333 | Houdini 1.5a x64 6CPU Critter 1.6 x64 4CPU Stockfish 2.3 x64 4CPU |
3101 3056 3056 |
IPON | 5m+3s ~16min/game Ponder ON |
2006 | October 17, 2012 | 110 | 213,250 | Houdini 3.0 STD x64 1CPU Komodo 5 x64 1CPU Critter 1.4a x64 1CPU |
3096 3011 2990 |
SWCR | 40/10 Ponder ON |
2009 | January 17, 2012 | 67 | 161,160 | Houdini 2.0c x64 1CPU Critter 1.4 x64 1CPU Rybka 4 Exp. 42 x64 1CPU |
3019 2980 2968 |
SSDF | 40/120 Ponder ON |
1984 | June 28, 2012 | 311 | 121,472 | Deep Rybka 4 x64 2GB Q6600 2.4GHz Naum 4.2 MP x64 2GB Q6600 2.4GHz Deep Shredder 12 x64 2GB Q6600 2.4GHz |
3221 3138 3115 |
WBEC | 40/40 Ponder ON |
2001 | May 15, 2011 | 226 (Historically: 850+) |
100,749 | Rybka 4 x64 2CPU Stockfish 2.0.1 x64 2CPU Thinker 5.5.4A1 x64 2CPU |
3124 3121 3114 |
- Note that the listings in the above table only count the best entry for a given engine.
These ratings, although calculated by using the Elo system (or similar rating methods), have no direct relation to FIDE Elo ratings or to other chess federation ratings of human players. Except for some man versus machine games which the SSDF had organized many years ago (which were far from today's level), there is no calibration between any of these rating lists and player pools. Hence, the results which matter are the ranks and the differences between the ratings, not the absolute level of the numbers. Also, each list calibrates their Elo via a different method. Therefore no Elo comparisons can be made between the lists. Nevertheless, in view of recent man versus machine matches, it is generally undisputed that top computer chess engines should be rated at least in the range of top human performances, and probably significantly higher.
Missing from many rating lists are IPPOLIT and its derivatives (e.g. Fire ). Although very strong and open source, there are allegations from commercial software interests that they were derived from disassembled binary of Rybka. Due to the controversy, all these engines have been blacklisted from many tournaments and rating lists. Although Rybka has been accused of being based on Fruit, it is not blacklisted from computer chess tournaments or rating lists. In June 2011, the ICGA claimed Rybka was derived from Fruit and Crafty and Rybka has been banned from the International Computer Games Association World Computer Chess Championship, and its previous victories (2007, 2008, 2009, and 2010) has been revoked. The ICGA was criticized for this decision by Dr. Søren Riis, a longstanding supporter of the Rybka program. All rating lists continue to include Rybka.
Read more about this topic: Chess Engine, Comparisons