AI LLM Ranking

Ranking	Model	Performance Score	Accuracy Score	Popularity Score
1	GPT-5.2 Pro (OpenAI)	9.8	9.7	10
2	Gemini 3 Pro (Google)	9.6	9.5	9.3
3	Claude Opus 4.6 (Anthropic)	9.5	9.4	9.2
4	Claude Sonnet 4.6 (Anthropic)	9.3	9.2	9.0
5	GPT-5.2 (OpenAI)	9.2	9.1	9.0
6	Grok-4.1 (xAI)	8.9	8.8	8.6
7	DeepSeek V3.1	8.6	8.5	8.3
8	Llama 4-Scout (Meta)	8.3	8.2	8.0
9	Gemini 3 Flash (Google)	8.0	7.9	7.8
10	Qwen3-Max (Alibaba)	7.8	7.7	7.5

Here’s an updated AI Large Language Model Ranking (2026) based on the most recent benchmark & leaderboard data available as of late February 2026. This ranking synthesizes multiple independent sources — including LMArena community Elo scores, MMLU/GPQA accuracy metrics, and overall performance evaluations — to give you a rounded view of how leading models stack up in performance, accuracy, and popularity/usage.

List created and curated by ChatGPT.
Last updated February 25, 2026.

>> Quit Program.