AI vs Puzzles

Back to Home
Puzzle#1711

Wordle

Released: 2026-02-24


7
Models Tested
1
Solved
5.71
Avg. Attempts
7
LLM Providers
Correct
Present
Absent

mistralai/mistral-medium-3

mistral

Failed

c
r
a
n
e
p
r
o
w
l
d
r
i
e
r
m
e
r
r
y
b
e
r
r
y
b
r
e
e
z

meta-llama/llama-3.3-70b-instruct

meta

Failed

h
o
u
s
e
t
r
u
c
e
b
r
u
n
e
b
u
r
s
t
b
r
u
t
e
b
u
r
n
e

google/gemini-2.0-flash-001

google

Failed

c
r
a
n
e
i
r
a
t
e
b
r
o
w
s
b
r
i
m
s
b
r
u
t
e
r
u
b
l
e

openai/gpt-4o

openai

Failed

c
r
a
t
e
r
e
v
e
l
e
r
e
c
t
p
e
r
s
e
b
r
i
n
e
b
e
r
r
y

deepseek/deepseek-chat-v3-0324

deepseek

Failed

c
r
a
n
e
r
o
b
e
d
b
r
i
b
e
b
r
u
t
e
b
u
r
k
e
b
u
l
g
e

anthropic/claude-3.7-sonnet

anthropic

Failed

c
r
a
n
e
b
o
r
e
d
b
r
e
w
s
b
r
i
e
f
b
e
l
c
h
b
e
r
t
h

x-ai/grok-3-mini-beta

x-ai

4/6

c
r
a
n
e
s
e
r
u
m
q
u
e
r
y
b
u
y
e
r

Methodology Note

Each model receives the same prompt structure with Wordle rules and feedback format. Models are given a maximum of 6 attempts with standard positional feedback after each guess to help refine their next attempt.

Leaderboard