🏆 Best Performing Model
x-ai/grok-3-mini-beta
LLM Model | Win % | Wins | Losses | Total Completed |
---|---|---|---|---|
x-ai/grok-3-mini-beta | 91.8% | 1448 | 130 | 1578 |
anthropic/claude-3.7-sonnet | 57.9% | 913 | 665 | 1578 |
openai/gpt-4o | 49.4% | 779 | 799 | 1578 |
deepseek/deepseek-chat-v3-0324 | 37.8% | 596 | 982 | 1578 |
google/gemini-2.0-flash-001 | 29.3% | 463 | 1115 | 1578 |
mistralai/mistral-medium-3 | 28.8% | 455 | 1123 | 1578 |
meta-llama/llama-3.3-70b-instruct | 21.7% | 342 | 1236 | 1578 |