AI vs Puzzles

Back to Home
Puzzle

Countdown Leaderboard

The leaderboard is a ranking of the Large Language Models against Countdown puzzles.


PositionModelProviderWin %WinsLossesCompleted
1google/gemini-3.1-pro-preview95.2%20121
2z-ai/glm-5Z.ai84.4%20037237
3moonshotai/kimi-k2.5MoonshotAI78.5%18651237
4x-ai/grok-3-mini-beta57.2%11586201
5minimax/minimax-m2.5Minimax45.1%107130237
6qwen/qwen3-235b-a22b38.9%4469113
7google/gemini-3-flash-preview24.5%58179237
8anthropic/claude-sonnet-4.613.5%32205237
9meta-llama/llama-3.3-70b-instruct12.5%42832
10anthropic/claude-3.7-sonnet7.1%14182196
11google/gemini-2.0-flash-0014.6%10208218
12deepseek/deepseek-chat-v3-03243.4%8229237
13openai/gpt-4o3.4%8229237
14mistralai/mistral-medium-30%0237237