Gameplay Leaderboard

How well do LLMs actually play chess? Data aggregated from independent research — we do not run these evaluations ourselves.

Data by dubesor.de — independent evaluation, not affiliated with Chess AI Bench.Cached Feb 25, 2026Source
407 models
#ModelELOGamesWin RateAccuracyLegal Move %Avg Turns
1gemini-3-pro-preview_Reasoning1845400.0%
2gemini-3.1-pro-preview_Reasoning1837150.0%
3gpt-4.5-preview ˟_Continuation1800200.0%
4qwen3-max-thinking_Reasoning180010.0%
5gemini-3-pro-preview_Continuation1795270.0%
6gpt-5.1-codex_Reasoning1785160.0%
7gpt-5-codex_Reasoning1777230.0%
8gemini-3-flash-preview_Continuation1767240.0%
9gemini-3.1-pro-preview_Continuation1661120.0%
10grok-4_Reasoning1615270.0%
11chatgpt-4o-latest ˟_Continuation1584180.0%
12o3_Reasoning1558350.0%
13gpt-5_Reasoning1526230.0%
14gpt-5.1_Reasoning1526190.0%
15gpt-5-chat_Continuation1497280.0%
16gpt-4o_Continuation1463370.0%
17gpt-5_Continuation1443170.0%
18gemini-3-flash-preview_Reasoning1436330.0%
19gpt-5.1-codex_Continuation1429140.0%
20Human14132450.0%
21gpt-5.1-chat_Continuation1401170.0%
22gpt-3.5-turbo-instruct_Continuation1379880.0%
23gpt-3.5-turbo_Continuation1374630.0%
24gpt-5.1-codex-max_Continuation1353140.0%
25o3_Continuation1347170.0%
26gpt-5.1-codex-max_Reasoning1342180.0%
27gpt-5.2-codex_Reasoning1327170.0%
28gpt-4o-2024-11-20_Continuation1308230.0%
29gpt-5-codex_Continuation1291130.0%
30gpt-4.1-mini_Continuation1261280.0%
31gpt-4.1_Continuation1239370.0%
32step-3.5-flash_Reasoning1232210.0%
33gpt-5.1_Continuation1231120.0%
34grok-4.1-fast-reasoning_Reasoning1207280.0%
35gpt-4_Continuation1203150.0%
36gemini-2.0-flash-001_Continuation1196230.0%
37gpt-5.2_Reasoning1174180.0%
38grok-4-fast-reasoning_Reasoning1145300.0%
39gpt-5.2-codex_Continuation1145130.0%
40gpt-5-mini_Continuation1117160.0%
41gpt-5-nano_Reasoning1112300.0%
42codex-mini ˟_Reasoning1108350.0%
43gpt-5-mini_Reasoning1107300.0%
44gpt-5.3-codex_Reasoning108980.0%
45gpt-5.3-codex_Continuation108270.0%
46claude-opus-4.1_Continuation1078200.0%
47deepseek-v3.2-speciale_Reasoning1072230.0%
48gemini-2.5-pro_Continuation1069250.0%
49gpt-5-nano_Continuation1049160.0%
50o4-mini_Reasoning1034370.0%
51gpt-4-turbo_Continuation1024190.0%
52o4-mini_Continuation995120.0%
53gpt-4.5-preview ˟_Reasoning992150.0%
54grok-4_Continuation992120.0%
55o1_Continuation98970.0%
56gpt-4o-mini_Continuation955130.0%
57gpt-5.1-codex-mini_Reasoning952220.0%
58gemini-2.5-pro_Reasoning943460.0%
59grok-code-fast-1_Reasoning927230.0%
60kimi-k2.5_Reasoning927260.0%
61gpt-oss-120b_Reasoning925350.0%
62gpt-5.2_Continuation920120.0%
63gpt-oss-20b_Continuation906190.0%
64claude-opus-4.5_Continuation900170.0%
65gemini-2.5-flash_Continuation896220.0%
66grok-4.1-fast-reasoning_Continuation894120.0%
67nemotron-3-nano-30b-a3b_Reasoning892260.0%
68gemini-1.5-pro ˟_Continuation888100.0%
69claude-opus-4_Continuation871180.0%
70o1-mini ˟_Continuation86980.0%
71claude-opus-4.5_Reasoning869280.0%
72gpt-oss-20b_Reasoning854310.0%
73codestral-2508_Reasoning85410.0%
74gpt-5.1-chat_Reasoning853210.0%
75minimax-m2_Continuation838120.0%
76codex-mini ˟_Continuation833100.0%
77gpt-4o_Reasoning829380.0%
78claude-opus-4.6_Continuation827150.0%
79gpt-5.2-chat_Reasoning825200.0%
80gpt-5.1-codex-mini_Continuation824100.0%
81qwen3.5-397b-a17b_Reasoning823150.0%
82deepseek-v3.2-speciale_Continuation819100.0%
83seed-oss-36b-instruct_Reasoning818130.0%
84gpt-4.1_Reasoning817400.0%
85qwen3.5-plus-02-15_Reasoning812140.0%
86claude-opus-4.6_Reasoning811270.0%
87o1_Reasoning807110.0%
88seed-1.6_Reasoning802210.0%
89chatgpt-4o-latest ˟_Reasoning799170.0%
90lfm-7b ˟_Continuation796110.0%
91gpt-5.2-chat_Continuation793120.0%
92glm-5_Reasoning791220.0%
93claude-sonnet-4.6_Reasoning790160.0%
94gpt-5-chat_Reasoning787260.0%
95qwen3-next-80b-a3b-thinking_Reasoning783160.0%
96claude-sonnet-4_Continuation780170.0%
97grok-3_Continuation780170.0%
98deepseek-v3.2_Reasoning780200.0%
99minimax-m2.1_Continuation78080.0%
100kimi-k2.5_Continuation772120.0%
101grok-4-fast-reasoning_Continuation771140.0%
102o1-mini ˟_Reasoning766240.0%
103grok-3-mini_Reasoning766380.0%
104grok-4-fast-non-reasoning_Reasoning756240.0%
105gemini-2.5-flash_Reasoning755310.0%
106deepseek-v3.1-terminus_Continuation75510.0%
107lfm-2.5-1.2b-instruct_Reasoning75510.0%
108grok-2-latest ˟_Continuation74480.0%
109command-a_Reasoning743180.0%
110qwen3-8b_Reasoning741180.0%
111kimi-k2_Reasoning737310.0%
112aurora-alpha_Reasoning73610.0%
113kimi-k2-0905_Reasoning734320.0%
114kimi-k2-thinking_Continuation731110.0%
115gemini-2.5-flash-lite_Continuation727150.0%
116minimax-m2_Reasoning721240.0%
117lfm-2.5-1.2b-thinking_Reasoning72110.0%
118kimi-k2-thinking_Reasoning715200.0%
119o3-mini_Continuation714100.0%
120glm-5_Continuation714100.0%
121gpt-4.1-mini_Reasoning713270.0%
122claude-opus-4_Reasoning709240.0%
123o3-mini_Reasoning707160.0%
124qwen3-32b_Reasoning706260.0%
125mistral-large-2-2411_Reasoning704350.0%
126qwen2.5-72b-instruct_Reasoning704380.0%
127kimi-k2_Continuation704140.0%
128qwen-plus-2025-07-28_Reasoning704170.0%
129longcat-flash-chat_Reasoning701220.0%
130qwen3-14b_Reasoning700160.0%
131claude-3.7-sonnet_Continuation698120.0%
132claude-opus-4.1_Reasoning698280.0%
133internvl3-78b_Reasoning697110.0%
134deepseek-r1_Reasoning691130.0%
135llama-3.3-70b-instruct_Reasoning687520.0%
136qwen3-max_Reasoning686210.0%
137claude-haiku-4.5_Reasoning680210.0%
138glm-4.6v_Reasoning680180.0%
139grok-4.1-fast-non-reasoning_Reasoning668140.0%
140qwen2.5-max_Reasoning667230.0%
141qwen-plus-2025-07-28_Continuation667100.0%
142claude-sonnet-4.6_Continuation667100.0%
143qwen3-235b-a22b-thinking-2507_Reasoning666190.0%
144gemini-2.0-flash-lite-001_Continuation665130.0%
145qwen3-coder-next_Reasoning664120.0%
146claude-sonnet-4.5_Reasoning663300.0%
147deepseek-r1-0528_Reasoning662160.0%
148deepseek-v3.2-exp_Reasoning660180.0%
149devstral-2512_Reasoning660180.0%
150gpt-4o-mini_Reasoning659210.0%
151qwen3-coder-plus_Reasoning659140.0%
152grok-2-latest ˟_Reasoning659160.0%
153seed-1.6-flash_Reasoning659210.0%
154gemma-2-27b-it_Reasoning658180.0%
155glm-4.5_Reasoning658210.0%
156claude-3.7-sonnet_Reasoning655260.0%
157claude-3.5-sonnet_Continuation653110.0%
158minimax-m1_Reasoning653150.0%
159olmo-3-32b-think_Reasoning650150.0%
160phi-4_Reasoning649140.0%
161qwen2.5-plus_Reasoning649150.0%
162intellect-3_Reasoning64970.0%
163deepseek-v3-0324_Continuation648130.0%
164hunyuan-a13b-instruct_Continuation64880.0%
165gpt-4_Reasoning648120.0%
166gemini-2.5-flash-lite_Reasoning645280.0%
167claude-3-sonnet ˟_Reasoning64260.0%
168claude-sonnet-4_Reasoning640330.0%
169qwen3-235b-a22b_Reasoning639190.0%
170llama-3.1-nemotron-ultra-253b-v1_Reasoning637130.0%
171gpt-oss-120b_Continuation637130.0%
172claude-sonnet-4.5_Continuation634170.0%
173ernie-4.5-21b-a3b-thinking_Reasoning634130.0%
174gpt-4-turbo_Reasoning632210.0%
175qwen3-235b-a22b_Continuation630100.0%
176claude-haiku-4.5_Continuation629150.0%
177ministral-14b-2512_Reasoning628130.0%
178gemini-1.5-pro ˟_Reasoning627120.0%
179gemini-2.0-flash-001_Reasoning626470.0%
180deepseek-v3_Continuation626100.0%
181qwen2.5-vl-32b-instruct_Reasoning62620.0%
182inflection-3-pi_Reasoning625110.0%
183mistral-large-3-2512_Reasoning621150.0%
184grok-code-fast-1_Continuation619100.0%
185llama-3.3-nemotron-super-49b-v1.5_Reasoning619110.0%
186deepseek-v3-0324_Reasoning613270.0%
187glm-4.6_Reasoning613230.0%
188llama-3.3-70b-instruct_Continuation611130.0%
189qwen3-30b-a3b_Reasoning610170.0%
190deepseek-v3_Reasoning609170.0%
191minimax-m2.1_Reasoning607120.0%
192qwen3.5-397b-a17b_Continuation60570.0%
193devstral-2512_Continuation60490.0%
194llama-3.3-nemotron-super-49b-v1_Reasoning602100.0%
195grok-3_Reasoning602210.0%
196devstral-small-2505_Reasoning59930.0%
197minimax-m2.5_Reasoning599140.0%
198magistral-medium-2506_Reasoning598100.0%
199inflection-3-pi_Continuation59810.0%
200gpt-4.1-nano_Continuation59780.0%
201gemini-1.5-flash ˟_Reasoning594100.0%
202llama-3.1-70b-instruct_Reasoning593130.0%
203qwen3-next-80b-a3b-instruct_Reasoning593200.0%
204aurora-alpha_Continuation59310.0%
205deepseek-v3.1_Reasoning591190.0%
206deepseek-v3.2_Continuation591110.0%
207llama-3.1-405b-instruct_Reasoning590240.0%
208nemotron-3-nano-30b-a3b_Continuation58950.0%
209gpt-4o-2024-11-20_Reasoning588250.0%
210gemini-2.0-flash-lite-001_Reasoning588180.0%
211devstral-medium_Reasoning588120.0%
212glm-4.5-air_Reasoning585190.0%
213qwen3-vl-235b-a22b-thinking_Reasoning585110.0%
214mimo-v2-flash_Reasoning585130.0%
215gemma-3-12b-it_Reasoning584160.0%
216claude-3.5-haiku_Reasoning581220.0%
217qwen3-coder-480b-a35b_Reasoning581120.0%
218qwq-32b_Reasoning580130.0%
219gemma-2-27b-it_Continuation57960.0%
220magistral-medium-2506:thinking_Reasoning57820.0%
221hunyuan-a13b-instruct_Reasoning576140.0%
222llama-4-maverick_Reasoning575260.0%
223ling-1t_Reasoning574170.0%
224qwen2.5-turbo_Reasoning573130.0%
225jamba-large-1.7_Reasoning572140.0%
226gpt-4.1-nano_Reasoning570200.0%
227inflection-3-productivity_Reasoning570110.0%
228mistral-small-3.2-24b-instruct_Reasoning569140.0%
229ernie-4.5-300b-a47b_Reasoning569190.0%
230lfm2-8b-a1b_Reasoning569190.0%
231qwen3-next-80b-a3b-thinking_Continuation568100.0%
232deepseek-v3.1-terminus_Reasoning568140.0%
233claude-opus-4.5-thinking_Reasoning56710.0%
234ernie-4.5-21b-a3b_Reasoning566180.0%
235mistral-medium-3_Reasoning565170.0%
236glm-4.7-flash_Reasoning562110.0%
237qwen3-30b-a3b-thinking-2507_Reasoning561130.0%
238ministral-8b_Reasoning560210.0%
239command-r-plus-08-2024_Reasoning560130.0%
240internvl3-78b_Continuation55920.0%
241qwen3-30b-a3b-instruct-2507_Reasoning558220.0%
242gpt-3.5-turbo-instruct_Reasoning553130.0%
243mistral-small-24b-instruct-2501_Reasoning553150.0%
244nova-2-lite-v1_Reasoning553130.0%
245gemini-1.5-flash-8b ˟_Reasoning552100.0%
246deepseek-r1_Continuation55120.0%
247lfm-7b ˟_Reasoning550250.0%
248gpt-3.5-turbo_Reasoning550140.0%
249hermes-4-70b_Reasoning55020.0%
250step-3.5-flash_Continuation548110.0%
251qwen3-next-80b-a3b-instruct_Continuation546110.0%
252glm-4-32b_Reasoning545170.0%
253kimi-k2-0905_Continuation545130.0%
254minimax-m2.5_Continuation54410.0%
255claude-3-opus ˟_Reasoning543100.0%
256grok-3-mini_Continuation542100.0%
257mistral-medium-3.1_Reasoning541170.0%
258qwen3-vl-32b-instruct_Reasoning54170.0%
259qwen3-max_Continuation540100.0%
260grok-4-fast-non-reasoning_Continuation540120.0%
261claude-3-haiku_Reasoning539130.0%
262qwen3-235b-a22b-instruct-2507_Reasoning539270.0%
263command-r-08-2024_Reasoning538120.0%
264llama-4-scout_Continuation537100.0%
265gemma-3-27b-it_Reasoning536190.0%
266llama-4-maverick_Continuation535130.0%
267claude-3.5-sonnet_Reasoning533130.0%
268longcat-flash-chat_Continuation533100.0%
269deepseek-v3.2-exp_Continuation53260.0%
270mimo-v2-flash_Continuation531100.0%
271mistral-large-2-2411_Continuation530100.0%
272qwen3-vl-235b-a22b-instruct_Reasoning527120.0%
273gemma-2-9b-it_Reasoning526190.0%
274llama-3.1-405b-instruct_Continuation520100.0%
275claude-3.7-sonnet:thinking_Reasoning52020.0%
276qwen3-vl-8b-instruct_Reasoning51970.0%
277llama-4-scout_Reasoning517260.0%
278glm-z1-32b_Reasoning51720.0%
279gemini-1.5-flash ˟_Continuation51730.0%
280mistral-large-3-2512_Continuation517110.0%
281wizardlm-2-8x22b_Reasoning515120.0%
282seed-1.6-flash_Continuation51320.0%
283deepseek-prover-v2_Reasoning51080.0%
284mistral-nemo_Reasoning510150.0%
285jamba-large-1.6_Reasoning50760.0%
286devstral-small_Reasoning506120.0%
287magistral-small-2506_Continuation50520.0%
288llama-3.1-8b-instruct_Reasoning503300.0%
289magistral-small-2506_Reasoning503110.0%
290llama-3-8b-instruct_Reasoning503140.0%
291qwen3.5-plus-02-15_Continuation50380.0%
292molmo-2-8b_Reasoning502120.0%
293gemma-3-27b-it_Continuation49970.0%
294olmo-3.1-32b-instruct_Reasoning499120.0%
295command-r-08-2024_Continuation49560.0%
296grok-4.1-fast-non-reasoning_Continuation495110.0%
297qwen3-coder-480b-a35b_Continuation49490.0%
298qwen3-4b_Reasoning49350.0%
299llama-3.2-3b-instruct_Reasoning491140.0%
300kimi-linear-48b-a3b-instruct_Reasoning489110.0%
301jamba-large-1.7_Continuation488100.0%
302ministral-8b-2512_Reasoning488160.0%
303devstral-small_Continuation48720.0%
304qwen3-vl-30b-a3b-thinking_Reasoning48730.0%
305qwen-2.5-7b-instruct_Reasoning487150.0%
306deepseek-r1-0528-qwen3-8b_Reasoning48630.0%
307glm-4-32b_Continuation48680.0%
308deepseek-v3.1_Continuation48380.0%
309ernie-4.5-300b-a47b_Continuation48280.0%
310jamba-large-1.6_Continuation48050.0%
311hermes-4-405b_Reasoning47920.0%
312qwen2.5-72b-instruct_Continuation476110.0%
313deepseek-prover-v2_Continuation47440.0%
314mistral-small-3.1-24b-instruct_Reasoning473190.0%
315inflection-3-productivity_Continuation47110.0%
316tng-r1t-chimera_Reasoning47110.0%
317olmo-3.1-32b-think_Reasoning47110.0%
318llama-3.3-8b-instruct_Reasoning470150.0%
319rnj-1-instruct_Reasoning464120.0%
320seed-1.6_Continuation46230.0%
321phi-4_Continuation46170.0%
322mistral-small-creative_Reasoning461130.0%
323gemma-3-4b-it_Reasoning460140.0%
324mythomax-l2-13b_Reasoning460110.0%
325granite-4.0-h-micro_Reasoning459110.0%
326minimax-m1_Continuation45610.0%
327qwen2.5-vl-72b-instruct_Reasoning45410.0%
328deepseek-r1t-chimera_Reasoning45310.0%
329olmo-2-0325-32b-instruct_Reasoning453130.0%
330olmo-3-7b-think_Reasoning453110.0%
331glm-4.6_Continuation44690.0%
332glm-4.5-air_Continuation44380.0%
333lfm2-8b-a1b_Continuation44310.0%
334afm-4.5b_Reasoning442140.0%
335ui-tars-1.5-7b_Continuation44210.0%
336minimax-m2-her_Reasoning44110.0%
337qwen3-coder-plus_Continuation44050.0%
338ministral-3b_Reasoning439170.0%
339llama-3.1-nemotron-ultra-253b-v1_Continuation43820.0%
340seed-oss-36b-instruct_Continuation43830.0%
341qwen2.5-plus_Continuation43790.0%
342deepseek-r1-0528_Continuation43720.0%
343qwen2.5-turbo_Continuation437100.0%
344gemma-3-12b-it_Continuation43520.0%
345trinity-large-preview_Reasoning43410.0%
346ministral-3b-2512_Reasoning433110.0%
347llama-3.1-nemotron-70b-instruct_Reasoning43220.0%
348command-r7b-12-2024_Reasoning431170.0%
349glm-4.7-flash_Continuation43110.0%
350jamba-mini-1.6_Reasoning43070.0%
351mistral-medium-3_Continuation42890.0%
352jamba-mini-1.7_Continuation42850.0%
353llama-3.3-nemotron-super-49b-v1.5_Continuation42810.0%
354phi-3-medium-128k-instruct_Reasoning425120.0%
355lfm-3b ˟_Continuation42410.0%
356glm-4.5_Continuation42390.0%
357mistral-7b-instruct-v0.1_Reasoning422120.0%
358olmo-3-7b-instruct_Reasoning422170.0%
359qwen3-coder-next_Continuation42150.0%
360lfm-2.2-6b_Reasoning417120.0%
361jamba-mini-1.7_Reasoning416150.0%
362gemma-3n-e4b-it_Reasoning415160.0%
363qwen2.5-max_Continuation41470.0%
364qwen3-vl-235b-a22b-thinking_Continuation41410.0%
365gemini-1.5-flash-8b ˟_Continuation41310.0%
366lfm-3b ˟_Reasoning412140.0%
367qwen3-30b-a3b-instruct-2507_Continuation41260.0%
368glm-4.6v_Continuation411100.0%
369gemma-2-9b-it_Continuation41010.0%
370qwen3-30b-a3b-thinking-2507_Continuation40810.0%
371claude-3-opus ˟_Continuation40720.0%
372mistral-medium-3.1_Continuation40680.0%
373ui-tars-1.5-7b_Reasoning405190.0%
374claude-3.5-haiku_Continuation39520.0%
375qwen3-235b-a22b-thinking-2507_Continuation39220.0%
376wizardlm-2-8x22b_Continuation39120.0%
377ernie-4.5-21b-a3b_Continuation39130.0%
378qwq-32b_Continuation38510.0%
379claude-3-haiku_Continuation38450.0%
380gemma-3n-e4b-it_Continuation38420.0%
381kimi-linear-48b-a3b-instruct_Continuation38450.0%
382olmo-3-32b-think_Continuation38410.0%
383olmo-2-0325-32b-instruct_Continuation38420.0%
384qwen3-30b-a3b_Continuation37920.0%
385command-a_Continuation37860.0%
386command-r-plus-08-2024_Continuation37850.0%
387qwen3-vl-32b-instruct_Continuation37810.0%
388mistral-nemo_Continuation37520.0%
389mistral-small-3.2-24b-instruct_Continuation37430.0%
390ministral-14b-2512_Continuation37210.0%
391ministral-8b-2512_Continuation37010.0%
392deepseek-r1-distill-llama-8b_Reasoning36720.0%
393qwen3-235b-a22b-instruct-2507_Continuation366110.0%
394llama-3.1-nemotron-70b-instruct_Continuation36610.0%
395olmo-3-7b-instruct_Continuation36510.0%
396llama-3.1-8b-instruct_Continuation36420.0%
397olmo-3-7b-think_Continuation36310.0%
398mistral-small-3.1-24b-instruct_Continuation36120.0%
399claude-3-sonnet ˟_Continuation36110.0%
400olmo-3.1-32b-instruct_Continuation35620.0%
401qwen3-32b_Continuation35160.0%
402llama-3.1-70b-instruct_Continuation33910.0%
403qwen3-vl-235b-a22b-instruct_Continuation33910.0%
404deepseek-r1-distill-qwen-7b_Reasoning33320.0%
405mistral-small-24b-instruct-2501_Continuation32920.0%
406llama-3.3-nemotron-super-49b-v1_Continuation32410.0%
407qwen2.5-vl-32b-instruct_Continuation31610.0%