AI vs Puzzles

Back to Home
Puzzle#509

Sudoku

Released: 2026-05-23


6
Models Tested
0
Solved
5
LLM Providers

google/gemini-2.0-flash-001

google

Difficulty:
hard

7

9

6

5

8

4

1

2

3

2

4

3

1

7

6

5

9

8

1

8

5

3

9

2

7

6

4

4

2

8

7

3

5

9

1

6

9

5

7

6

8

1

3

4

2

6

3

1

2

4

9

8

7

5

5

7

9

8

6

2

4

1

3

8

6

4

9

5

3

2

7

1

3

1

2

4

7

8

6

5

9


google/gemini-3-flash-preview

google

Difficulty:
hard

7

9

6

5

8

4

1

2

3

2

4

3

1

7

6

5

9

8

1

8

5

3

2

9

7

6

4

3

1

2

8

9

5

4

7

6

9

5

4

7

6

1

8

3

2

8

6

7

2

4

3

9

5

1

5

3

9

6

7

2

8

1

4

6

2

1

4

5

3

0

0

7

4

7

8

9

1

8

6

2

5


openai/gpt-4o

openai

Difficulty:
hard

7

5

6

9

8

1

2

4

3

2

4

3

6

7

0

5

9

8

1

9

5

3

0

2

7

6

4

4

7

2

8

3

5

1

0

6

9

6

8

1

0

7

3

5

2

3

1

0

5

4

6

9

8

7

5

8

9

7

6

4

0

1

0

6

3

0

2

5

0

8

0

7

0

2

7

0

1

8

0

3

0


mistralai/mistral-medium-3

mistral

Difficulty:
hard

7

6

8

4

8

2

4

5

3

2

4

3

1

7

6

5

9

8

1

9

5

3

5

4

7

6

2

8

3

2

7

9

5

1

4

6

9

5

6

8

3

1

8

3

2

4

7

1

2

4

9

3

8

5

5

2

9

6

2

7

6

1

4

6

1

4

5

6

3

2

8

7

3

8

7

9

1

8

9

2

1


meta-llama/llama-3.3-70b-instruct

meta

Difficulty:
hard

7

3

8

1

8

9

2

4

3

2

4

6

3

7

5

5

9

1

1

9

5

4

6

8

7

3

2

8

1

4

9

3

5

6

2

6

9

6

3

7

2

4

1

8

2

5

2

1

8

4

6

3

7

9

5

8

9

2

1

3

4

1

6

4

5

7

6

9

3

8

2

7

3

7

2

5

1

8

9

6

4


deepseek/deepseek-chat-v3-0324

deepseek

Difficulty:
hard

7

5

6

9

8

2

4

1

3

2

4

3

1

7

6

5

9

8

1

9

5

3

4

6

7

2

8

3

1

2

7

9

5

8

4

6

9

6

7

8

3

1

4

5

2

8

3

4

2

6

9

1

7

5

5

8

9

6

2

4

3

1

7

4

2

1

5

3

7

6

8

9

6

7

8

4

1

3

2

5

9


Methodology Note

Each model receives the same Sudoku puzzle with a partially filled grid. Models are tasked with solving the puzzle by filling in all empty cells following Sudoku rules, and their solutions are evaluated without feedback or retries.

Leaderboard