🏆 Best Performing Model
x-ai/grok-3-mini-beta
| LLM Model | Win % | Wins | Losses | Total Completed |
|---|---|---|---|---|
x-ai/grok-3-mini-beta | 91.6% | 1489 | 137 | 1626 |
qwen/qwen3-235b-a22b | 81.7% | 1329 | 297 | 1626 |
anthropic/claude-3.7-sonnet | 57.9% | 942 | 684 | 1626 |
openai/gpt-4o | 49.2% | 800 | 826 | 1626 |
deepseek/deepseek-chat-v3-0324 | 37.6% | 612 | 1014 | 1626 |
google/gemini-2.0-flash-001 | 29.2% | 474 | 1152 | 1626 |
mistralai/mistral-medium-3 | 28.7% | 467 | 1159 | 1626 |
meta-llama/llama-3.3-70b-instruct | 21.7% | 353 | 1273 | 1626 |