AI vs Puzzles

Back to Home
Puzzle

Countdown Leaderboard

The leaderboard is a ranking of the Large Language Models against Countdown puzzles.


PositionModelProviderWin %WinsLossesCompleted
1x-ai/grok-3-mini-beta54%5446100
2qwen/qwen3-235b-a22b33%3367100
3meta-llama/llama-3.3-70b-instruct12.5%42832
4anthropic/claude-3.7-sonnet8%892100
5google/gemini-2.0-flash-0017%793100
6openai/gpt-4o5%595100
7deepseek/deepseek-chat-v3-03244%496100
8mistralai/mistral-medium-30%0100100