Back to home
Overall
Combined rankings of AI models across all benchmark categories. Arena Elo scores from LMSYS Chatbot Arena.
RankModelScore
1
2
3
4
5
6
7
8
9
101112
13
1415
1617
18
19
20
21
22
23
24
2526
272829
30
Claude Opus 4.6anthropic/claude-opus-4-6-thinking
1499.0
Claude Opus 4.7anthropic/claude-opus-4-7-thinking
1486.0
Gemini 3.5 Flashgoogle/gemini-3.5-flash
1482.0
Gemini 3.1 Progoogle/gemini-3.1-pro-preview
1481.0
Gemini 3 Progoogle/gemini-3-pro
1480.0
Qwen 3.7 Maxalibaba/qwen3.7-max-preview
1475.0
Muse Sparkmeta-llama/muse-spark
1474.0
GPT-5.4openai/gpt-5.4-high
1472.0
Qwen 3.5 Maxalibaba/qwen3.5-max-preview
1471.0
B
Ernie 5.1baidu/ernie-5.1
1470.0
Z
GLM 5.1zai/glm-5.1
1469.0
GPT-5.5openai/gpt-5.5-high
1469.0
Gemini 3 Flashgoogle/gemini-3-flash
1466.0
X
Xiaomi: MiMo V2.5 Proxiaomi/mimo-v2.5-pro
1461.0
Gemini 2.5 Progoogle/gemini-2.5-pro
1457.0
M
kimi k2.6moonshot/kimi-k2.6
1456.0
Claude Sonnet 4.6anthropic/claude-sonnet-4-6
1454.0
Grok 4.20xai/grok-4.20-beta-0309-reasoning
1454.0
Grok 4.20xai/grok-4.20-multi-agent-beta-0309
1451.0
Claude Opus 4.5anthropic/claude-opus-4-5-20251101
1449.0
dola seed 2.0 probytedance/dola-seed-2.0-pro
1449.0
amazon nova chat 26 02 10amazon/amazon-nova-experimental-chat-26-02-10
1448.0
DeepSeek V4 Prodeepseek/deepseek-v4-pro-thinking
1446.0
gemini 3 flashgoogle/gemini-3-flash (thinking-minimal)
1446.0
B
ernie 5.0 0110baidu/ernie-5.0-0110
1446.0
Grok 4.20xai/grok-4.20-beta1
1446.0
Z
glm 5zai/glm-5
1445.0
M
kimi k2.5moonshot/kimi-k2.5-thinking
1445.0
qwen3.6 maxalibaba/qwen3.6-max-preview
1444.0
Gemma 4 31Bgoogle/gemma-4-31b
1442.0
312 models tested