AI vs Puzzles

Back to Home
Puzzle#191

Sudoku

Released: 2026-02-06


8
Models Tested
0
Solved
8
LLM Providers

anthropic/claude-3.7-sonnet

anthropic

9

5

4

6

2

3

8

1

7

6

2

3

8

4

7

5

3

9

8

7

6

9

5

1

3

2

4

3

8

2

5

6

9

7

4

1

1

6

7

3

8

2

9

5

4

4

9

5

1

7

4

6

8

6

5

3

8

2

1

6

4

6

3

2

6

1

4

3

5

7

9

8

7

4

9

7

9

8

2

3

1


mistralai/mistral-medium-3

mistral

6

1

2

4

5

3

8

9

7

4

5

3

8

7

6

1

2

9

8

7

9

9

2

1

3

4

5

3

8

2

5

6

9

7

1

4

1

2

7

3

8

2

6

5

4

5

9

4

1

3

7

2

8

6

9

3

6

2

1

8

4

6

5

2

6

1

7

4

5

9

3

8

7

4

5

6

9

3

2

8

1


google/gemini-2.0-flash-001

google

2

1

6

4

5

3

8

9

7

4

7

3

8

2

6

1

5

9

5

8

9

7

4

1

6

3

2

6

3

2

5

8

9

7

1

4

1

5

7

6

3

2

9

8

4

8

9

4

1

7

5

3

2

6

9

2

5

3

1

7

4

6

3

3

6

1

2

9

4

5

7

8

7

4

8

9

6

5

2

3

1


meta-llama/llama-3.3-70b-instruct

meta

5

1

9

7

3

3

8

2

7

2

8

3

8

5

6

1

4

9

4

6

1

9

8

1

5

3

2

3

5

2

5

9

9

6

7

1

1

4

7

2

6

2

3

9

4

9

3

8

1

4

5

7

6

6

7

2

6

4

1

8

9

5

3

8

9

5

3

7

4

2

1

8

6

7

4

6

2

7

1

8

5


openai/gpt-4o

openai

9

6

1

4

5

3

8

2

7

4

8

3

8

2

7

5

1

9

5

7

2

9

6

1

3

8

4

2

4

9

5

3

9

6

7

3

1

3

7

6

8

2

9

5

4

8

9

5

1

7

4

7

3

6

3

5

8

7

1

6

4

6

2

6

2

1

3

4

5

1

9

8

7

4

6

2

9

8

2

4

1


deepseek/deepseek-chat-v3-0324

deepseek

2

5

9

6

4

3

8

1

7

4

7

3

8

2

5

6

1

9

6

8

1

9

7

2

5

4

3

3

6

2

5

8

9

1

7

4

1

8

7

3

6

2

9

5

4

5

9

4

1

7

8

3

2

6

8

3

5

7

1

4

9

6

2

9

2

1

4

5

6

7

3

8

7

4

6

2

9

8

5

3

1


x-ai/grok-3-mini-beta

x-ai

4

1

6

2

9

3

8

5

7

2

5

3

8

1

7

6

4

9

8

7

9

4

6

5

3

1

2

6

3

2

5

7

9

4

8

1

1

6

7

3

8

2

5

9

4

5

9

8

1

4

6

7

3

2

9

2

1

7

3

8

4

6

5

6

3

4

9

5

1

2

7

8

7

8

5

4

2

6

9

3

1


qwen/qwen3-235b-a22b

qwen

5

1

9

2

6

3

8

4

7

6

7

3

8

5

4

2

1

9

2

8

4

9

7

1

5

3

6

8

3

2

5

4

9

7

1

6

1

5

7

6

8

2

9

7

4

4

9

6

1

3

7

5

2

8

9

2

8

7

1

5

4

6

3

3

6

1

4

2

9

7

5

8

7

4

5

3

9

8

1

6

2


Methodology Note

Each model receives the same Sudoku puzzle with a partially filled grid. Models are tasked with solving the puzzle by filling in all empty cells following Sudoku rules, and their solutions are evaluated without feedback or retries.

Leaderboard