AI vs Puzzles

Back to Home
Puzzle

Countdown Leaderboard

The leaderboard is a ranking of the Large Language Models against Countdown puzzles.


PositionModelProviderWin %WinsLossesCompleted
1google/gemini-3.1-pro-preview95.2%20121
2z-ai/glm-5Z.ai85.4%16428192
3moonshotai/kimi-k2.5MoonshotAI78.1%15042192
4x-ai/grok-3-mini-beta56.8%10983192
5minimax/minimax-m2.5Minimax40.6%78114192
6qwen/qwen3-235b-a22b38.9%4469113
7google/gemini-3-flash-preview22.9%44148192
8anthropic/claude-sonnet-4.613%25167192
9meta-llama/llama-3.3-70b-instruct12.5%42832
10anthropic/claude-3.7-sonnet7.3%14178192
11google/gemini-2.0-flash-0015.2%10182192
12deepseek/deepseek-chat-v3-03244.2%8184192
13openai/gpt-4o4.2%8184192
14mistralai/mistral-medium-30%0192192