AI vs Puzzles

Back to Home
Puzzle

Countdown Leaderboard

The leaderboard is a ranking of the Large Language Models against Countdown puzzles.


PositionModelProviderWin %WinsLossesCompleted
1google/gemini-3.1-pro-preview95.2%20121
2z-ai/glm-5Z.ai87%12719146
3moonshotai/kimi-k2.5MoonshotAI74%10838146
4x-ai/grok-3-mini-beta56.2%8264146
5qwen/qwen3-235b-a22b38.9%4469113
6minimax/minimax-m2.5Minimax32.9%4898146
7google/gemini-3-flash-preview21.9%32114146
8anthropic/claude-sonnet-4.613%19127146
9meta-llama/llama-3.3-70b-instruct12.5%42832
10google/gemini-2.0-flash-0016.2%9137146
11anthropic/claude-3.7-sonnet6.2%9137146
12deepseek/deepseek-chat-v3-03243.4%5141146
13openai/gpt-4o3.4%5141146
14mistralai/mistral-medium-30%0146146