AI vs Puzzles

Leaderboard

🏆 Best Performing Model

x-ai/grok-3-mini-beta

LLM ModelWin %WinsLossesTotal Completed

x-ai/grok-3-mini-beta

91.6%15311411672

qwen/qwen3-235b-a22b

81.2%13573151672

anthropic/claude-3.7-sonnet

58.1%9727001672

openai/gpt-4o

49.3%8248481672

deepseek/deepseek-chat-v3-0324

37.4%62510471672

mistralai/mistral-medium-3

28.9%48311891672

google/gemini-2.0-flash-001

28.9%48311891672

meta-llama/llama-3.3-70b-instruct

21.5%36013121672