LLM Leaderboard
Frontier LLMs ranked
Industry-standard rankings by Intelligence, Coding, and Math indices. Sourced from Artificial Analysis. Click into any model on Opper for full benchmarks, hosting routes, and privacy posture.
# | Model | Intelligence | Coding | Math | Speed | On Opper |
|---|---|---|---|---|---|---|
| 1 | GPT-5.5 (xhigh)OpenAI | 60.2 | 59.1 | — | 69 tok/s | View → |
| 2 | GPT-5.5 (high)OpenAI | 58.9 | 58.5 | — | 65 tok/s | View → |
| 3 | 57.3 | 52.5 | — | 54 tok/s | View → | |
| 4 | Gemini 3.1 Pro PreviewGoogle | 57.2 | 55.5 | — | 126 tok/s | View → |
| 5 | GPT-5.4 (xhigh)OpenAI | 56.8 | 57.3 | — | 78 tok/s | View → |
| 6 | GPT-5.5 (medium)OpenAI | 56.7 | 56.2 | — | 67 tok/s | View → |
| 7 | Kimi K2.6Moonshot | 53.9 | 47.1 | — | 38 tok/s | View → |
| 9 | GPT-5.3 Codex (xhigh)OpenAI | 53.6 | 53.1 | — | 77 tok/s | View → |
| 10 | Grok 4.3xAI | 53.2 | 41.0 | — | 80 tok/s | View → |
| 11 | 53.0 | 48.1 | — | 50 tok/s | View → | |
| 13 | 51.8 | 53.1 | — | 49 tok/s | View → | |
| 15 | 51.7 | 50.9 | — | 69 tok/s | View → | |
| 16 | 51.5 | 47.5 | — | 35 tok/s | View → | |
| 17 | GLM-5.1 (Reasoning)Zhipu | 51.4 | 43.4 | — | 52 tok/s | View → |
| 18 | GPT-5.2 (xhigh)OpenAI | 51.3 | 48.7 | 99.0 | 71 tok/s | View → |
| 19 | GPT-5.5 (low)OpenAI | 50.8 | 52.1 | — | 64 tok/s | View → |
| 21 | 49.8 | 43.3 | — | 34 tok/s | View → | |
| 22 | GLM-5 (Reasoning)Zhipu | 49.8 | 44.2 | — | 68 tok/s | View → |
| 23 | Claude Opus 4.5 (Reasoning)Anthropic | 49.7 | 47.8 | 91.3 | 56 tok/s | View → |
| 24 | MiniMax-M2.7MiniMax | 49.6 | 41.9 | — | 47 tok/s | View → |
| 28 | GPT-5.2 Codex (xhigh)OpenAI | 49.0 | 43.0 | — | 94 tok/s | View → |
| 29 | GPT-5.4 mini (xhigh)OpenAI | 48.9 | 51.5 | — | 167 tok/s | View → |
| 32 | GPT-5.4 (low)OpenAI | 47.9 | 45.6 | — | 62 tok/s | View → |
| 33 | GPT-5.1 (high)OpenAI | 47.7 | 44.7 | 94.0 | 138 tok/s | View → |
| 34 | GLM-5-TurboZhipu | 46.8 | 36.8 | — | — | View → |
| 35 | Kimi K2.5 (Reasoning)Moonshot | 46.8 | 39.5 | — | 52 tok/s | View → |
| 36 | GPT-5.2 (medium)OpenAI | 46.6 | 44.2 | 96.7 | — | View → |
| 37 | 46.5 | 38.7 | — | 72 tok/s | View → | |
| 38 | 46.5 | 47.6 | — | 46 tok/s | View → | |
| 39 | 46.4 | 42.6 | 97.0 | 198 tok/s | View → | |
| 40 | Qwen3.6 27B (Reasoning)Alibaba | 45.8 | 36.5 | — | 64 tok/s | View → |
| 41 | 45.0 | 41.3 | — | 52 tok/s | View → | |
| 42 | 44.9 | 39.8 | — | — | View → | |
| 44 | GPT-5 (high)OpenAI | 44.6 | 36.0 | 94.3 | 78 tok/s | View → |
| 45 | GPT-5 Codex (high)OpenAI | 44.6 | 38.9 | 98.7 | 170 tok/s | View → |
| 46 | 44.4 | 46.4 | — | 54 tok/s | View → | |
| 47 | GPT-5.4 nano (xhigh)OpenAI | 44.0 | 43.9 | — | 158 tok/s | View → |
| 49 | 43.8 | 35.8 | — | 43 tok/s | View → | |
| 50 | Qwen3.6 35B A3B (Reasoning)Alibaba | 43.5 | 35.1 | — | 189 tok/s | View → |
| 52 | GPT-5.1 Codex (high)OpenAI | 43.1 | 36.6 | 95.7 | 174 tok/s | View → |
| 53 | Claude Opus 4.5 (Non-reasoning)Anthropic | 43.1 | 42.9 | 62.7 | 50 tok/s | View → |
| 54 | Kimi K2.6 (Non-reasoning)Moonshot | 43.0 | 38.4 | — | 36 tok/s | View → |
| 55 | Claude 4.5 Sonnet (Reasoning)Anthropic | 43.0 | 38.6 | 88.0 | 49 tok/s | View → |
| 56 | 42.9 | 36.2 | — | — | View → | |
| 57 | 42.6 | 43.0 | — | 56 tok/s | View → | |
| 58 | GLM-4.7 (Reasoning)Zhipu | 42.1 | 36.3 | 95.0 | 89 tok/s | View → |
| 59 | Qwen3.5 27B (Reasoning)Alibaba | 42.1 | 34.9 | — | 90 tok/s | View → |
| 60 | GPT-5 (medium)OpenAI | 42.0 | 39.0 | 91.7 | 68 tok/s | View → |
| 61 | Claude 4.1 Opus (Reasoning)Anthropic | 42.0 | 36.5 | 80.3 | 36 tok/s | View → |
| 63 | MiniMax-M2.5MiniMax | 41.9 | 37.4 | — | 67 tok/s | View → |
| 64 | DeepSeek V3.2 (Reasoning)DeepSeek | 41.7 | 36.7 | 92.0 | — | View → |
| 65 | 41.6 | 34.7 | — | 159 tok/s | View → | |
| 67 | Grok 4xAI | 41.5 | 40.5 | 92.7 | 42 tok/s | View → |
| 69 | GPT-5 mini (high)OpenAI | 41.2 | 35.3 | 90.7 | 66 tok/s | View → |
| 70 | GPT-5.5 (Non-reasoning)OpenAI | 40.9 | 48.6 | — | 62 tok/s | View → |
| 71 | Kimi K2 ThinkingMoonshot | 40.9 | 34.8 | 94.7 | 128 tok/s | View → |
| 72 | o3-proOpenAI | 40.7 | — | — | 22 tok/s | View → |
| 73 | 40.6 | 39.0 | — | 62 tok/s | View → | |
| 74 | 40.1 | 37.4 | — | 53 tok/s | View → | |
| 75 | Qwen3 Max ThinkingAlibaba | 39.9 | 30.5 | — | 41 tok/s | View → |
| 76 | MiniMax-M2.1MiniMax | 39.4 | 32.8 | 82.7 | 78 tok/s | View → |
| 77 | DeepSeek V4 Pro (Non-reasoning)DeepSeek | 39.3 | 38.4 | — | 34 tok/s | View → |
| 78 | Gemma 4 31B (Reasoning)Google | 39.2 | 38.7 | — | 35 tok/s | View → |
| 80 | GPT-5 (low)OpenAI | 39.2 | 30.7 | 83.0 | 69 tok/s | View → |
| 82 | Claude 4 Opus (Reasoning)Anthropic | 39.0 | 34.0 | 73.3 | 36 tok/s | View → |
| 83 | GPT-5 mini (medium)OpenAI | 38.9 | 32.9 | 85.0 | 70 tok/s | View → |
| 84 | Claude 4 Sonnet (Reasoning)Anthropic | 38.7 | 34.1 | 74.3 | 50 tok/s | View → |
| 85 | 38.6 | 30.9 | 89.3 | 101 tok/s | View → | |
| 87 | 38.6 | 36.4 | 91.7 | 200 tok/s | View → | |
| 89 | o3OpenAI | 38.4 | 38.4 | 88.3 | 87 tok/s | View → |
| 90 | GPT-5.4 nano (medium)OpenAI | 38.1 | 35.0 | — | 156 tok/s | View → |
| 92 | GPT-5.4 mini (medium)OpenAI | 37.7 | 37.5 | — | 174 tok/s | View → |
| 93 | Kimi K2.5 (Non-reasoning)Moonshot | 37.3 | 25.8 | — | 48 tok/s | View → |
| 94 | Qwen3.5 27B (Non-reasoning)Alibaba | 37.2 | 33.4 | — | 92 tok/s | View → |
| 95 | Claude 4.5 Haiku (Reasoning)Anthropic | 37.1 | 32.6 | 83.7 | 125 tok/s | View → |
| 96 | Qwen3.6 27B (Non-reasoning)Alibaba | 37.1 | 26.6 | — | 64 tok/s | View → |
| 97 | Claude 4.5 Sonnet (Non-reasoning)Anthropic | 37.1 | 33.5 | 37.0 | 50 tok/s | View → |
| 98 | Qwen3.5 35B A3B (Reasoning)Alibaba | 37.1 | 30.3 | — | 121 tok/s | View → |
| 99 | 36.5 | 35.1 | — | 71 tok/s | View → | |
| 100 | MiniMax-M2MiniMax | 36.1 | 29.2 | 78.3 | 72 tok/s | View → |
| 101 | 36.0 | 31.2 | — | 201 tok/s | View → | |
| 103 | Claude 4.1 Opus (Non-reasoning)Anthropic | 36.0 | — | — | 37 tok/s | View → |
| 104 | 35.9 | 31.6 | — | 152 tok/s | View → | |
| 107 | GPT-5.4 (Non-reasoning)OpenAI | 35.4 | 41.0 | — | 65 tok/s | View → |
| 109 | 35.0 | 37.8 | 55.7 | 210 tok/s | View → | |
| 111 | Gemini 2.5 ProGoogle | 34.6 | 31.9 | 87.7 | 124 tok/s | View → |
| 113 | 34.2 | 32.0 | 48.0 | 106 tok/s | View → | |
| 114 | 33.9 | 33.7 | 89.7 | — | View → | |
| 117 | GPT-5.2 (Non-reasoning)OpenAI | 33.6 | 34.7 | 51.0 | 64 tok/s | View → |
| 118 | 33.5 | 30.1 | — | 319 tok/s | View → | |
| 120 | gpt-oss-120B (high)OpenAI | 33.3 | 28.6 | 93.4 | 245 tok/s | View → |
| 121 | o4-mini (high)OpenAI | 33.1 | 25.6 | 90.7 | 130 tok/s | View → |
| 122 | Claude 4 Opus (Non-reasoning)Anthropic | 33.0 | — | 36.3 | 36 tok/s | View → |
| 123 | Claude 4 Sonnet (Non-reasoning)Anthropic | 33.0 | 30.6 | 38.0 | 48 tok/s | View → |
| 124 | DeepSeek V3.2 Exp (Reasoning)DeepSeek | 32.9 | 33.3 | 87.7 | — | View → |
| 126 | GLM-4.6 (Reasoning)Zhipu | 32.5 | 29.5 | 86.0 | 30 tok/s | View → |
| 128 | Qwen3.5 9B (Reasoning)Alibaba | 32.4 | 25.3 | — | 63 tok/s | View → |
| 129 | 32.3 | 33.9 | — | — | View → | |
| 130 | 32.1 | 25.2 | 84.7 | 165 tok/s | View → | |
| 132 | DeepSeek V3.2 (Non-reasoning)DeepSeek | 32.1 | 34.6 | 59.0 | — | View → |
| 134 | 31.9 | 27.2 | — | 119 tok/s | View → | |
| 135 | 31.5 | 17.6 | — | 185 tok/s | View → | |
| 136 | Qwen3 MaxAlibaba | 31.4 | 26.4 | 80.7 | 33 tok/s | View → |
| 137 | 31.2 | 22.4 | — | — | View → | |
| 138 | Claude 4.5 Haiku (Non-reasoning)Anthropic | 31.1 | 29.6 | 39.0 | 107 tok/s | View → |
| 139 | 31.1 | 24.6 | 78.3 | — | View → | |
| 140 | 31.0 | 25.1 | — | 64 tok/s | View → | |
| 141 | Kimi K2 0905Moonshot | 30.9 | 25.9 | 57.3 | 19 tok/s | View → |
| 142 | o1OpenAI | 30.8 | 20.5 | — | 97 tok/s | View → |
| 144 | 30.7 | 16.8 | — | 136 tok/s | View → | |
| 146 | 30.3 | 46.7 | — | — | View → | |
| 148 | 30.2 | 30.2 | 44.3 | 29 tok/s | View → | |
| 149 | 30.1 | 25.9 | — | 94 tok/s | View → | |
| 151 | 29.7 | 25.4 | — | 83 tok/s | View → | |
| 152 | 29.5 | — | — | — | View → | |
| 153 | 29.5 | 23.2 | 91.0 | 58 tok/s | View → | |
| 156 | 29.0 | 22.0 | — | 89 tok/s | View → | |
| 158 | 28.5 | 31.9 | 53.7 | — | View → | |
| 160 | 28.4 | 30.0 | 57.7 | — | View → | |
| 161 | Qwen3 Coder NextAlibaba | 28.3 | 22.9 | — | 161 tok/s | View → |
| 163 | DeepSeek V3.1 (Non-reasoning)DeepSeek | 28.1 | 28.4 | 49.7 | — | View → |
| 166 | DeepSeek V3.1 (Reasoning)DeepSeek | 27.7 | 29.7 | 89.7 | — | View → |
| 169 | GPT-5.1 (Non-reasoning)OpenAI | 27.4 | 27.3 | 38.0 | 116 tok/s | View → |
| 170 | Qwen3.5 9B (Non-reasoning)Alibaba | 27.3 | 21.4 | — | — | View → |
| 171 | 27.1 | 29.1 | — | — | View → | |
| 172 | Magistral Medium 1.2Mistral | 27.1 | 21.7 | 82.0 | 43 tok/s | View → |
| 173 | DeepSeek R1 0528 (May '25)DeepSeek | 27.1 | 24.0 | 76.0 | — | View → |
| 174 | Qwen3.5 4B (Reasoning)Alibaba | 27.1 | 17.5 | — | 200 tok/s | View → |
| 175 | 27.0 | 22.2 | 73.3 | 204 tok/s | View → | |
| 176 | GPT-5 nano (high)OpenAI | 26.8 | 20.3 | 83.7 | 124 tok/s | View → |
| 177 | 26.7 | 19.5 | 84.3 | 168 tok/s | View → | |
| 179 | GPT-4.1OpenAI | 26.3 | 21.8 | 34.7 | 101 tok/s | View → |
| 185 | o3-miniOpenAI | 25.9 | 17.9 | — | 126 tok/s | View → |
| 186 | GPT-5 nano (medium)OpenAI | 25.9 | 22.9 | 78.3 | 119 tok/s | View → |
| 187 | o1-proOpenAI | 25.8 | — | — | — | View → |
| 188 | 25.7 | 22.1 | 56.7 | — | View → | |
| 190 | o3-mini (high)OpenAI | 25.2 | 17.3 | — | 127 tok/s | View → |
| 193 | 25.0 | 22.1 | 71.7 | 68 tok/s | View → | |
| 197 | Sonar Reasoning ProPerplexity | 24.6 | — | — | — | View → |
| 198 | gpt-oss-120B (low)OpenAI | 24.5 | 15.5 | 66.7 | 247 tok/s | View → |
| 199 | gpt-oss-20B (high)OpenAI | 24.5 | 18.5 | 89.3 | 282 tok/s | View → |
| 200 | 24.4 | 27.9 | — | 143 tok/s | View → | |
| 201 | MiniMax M1 80kMiniMax | 24.4 | 14.5 | 61.0 | — | View → |
| 202 | 24.3 | 19.0 | 91.0 | 183 tok/s | View → | |
| 203 | 24.3 | — | — | — | View → | |
| 211 | GLM-4.6V (Reasoning)Zhipu | 23.4 | 19.7 | 85.3 | 34 tok/s | View → |
| 212 | 23.3 | 25.3 | — | 160 tok/s | View → | |
| 214 | GLM-4.5-AirZhipu | 23.2 | 23.8 | 80.7 | 69 tok/s | View → |
| 218 | GPT-4.1 miniOpenAI | 22.9 | 18.5 | 46.3 | 78 tok/s | View → |
| 219 | Mistral Large 3Mistral | 22.8 | 22.7 | 38.0 | 56 tok/s | View → |
| 221 | Qwen3.5 4B (Non-reasoning)Alibaba | 22.6 | 13.7 | — | 214 tok/s | View → |
| 223 | DeepSeek V3 0324DeepSeek | 22.3 | 22.0 | 41.0 | — | View → |
| 224 | INTELLECT-3PrimeIntellect | 22.2 | 19.1 | 88.0 | — | View → |
| 225 | 22.1 | 11.0 | — | 101 tok/s | View → | |
| 229 | 21.6 | 18.1 | 68.7 | — | View → | |
| 234 | gpt-oss-20B (low)OpenAI | 20.8 | 14.4 | 62.3 | 230 tok/s | View → |
| 235 | Qwen3 VL 235B A22B InstructAlibaba | 20.8 | 16.5 | 70.7 | 52 tok/s | View → |
| 238 | 20.6 | 17.8 | 60.3 | 185 tok/s | View → | |
| 240 | Qwen3 Next 80B A3B InstructAlibaba | 20.1 | 15.3 | 66.3 | 152 tok/s | View → |
| 243 | Qwen3 Coder 30B A3B InstructAlibaba | 20.0 | 19.4 | 29.0 | 115 tok/s | View → |
| 244 | Qwen3 235B A22B (Reasoning)Alibaba | 19.8 | 17.4 | 82.0 | 68 tok/s | View → |
| 246 | Qwen3 VL 30B A3B (Reasoning)Alibaba | 19.7 | 13.1 | 82.3 | 126 tok/s | View → |
| 249 | 19.4 | 14.5 | 46.7 | — | View → | |
| 265 | GPT-4o (Aug '24)OpenAI | 18.6 | 16.6 | — | 97 tok/s | View → |
| 268 | 18.5 | 13.6 | 21.7 | — | View → | |
| 270 | Magistral Small 1.2Mistral | 18.2 | 14.8 | 80.3 | 113 tok/s | View → |
| 277 | Sonar ReasoningPerplexity | 17.9 | — | — | — | View → |
| 278 | 17.8 | — | — | — | View → | |
| 280 | 17.6 | 9.5 | 53.3 | 288 tok/s | View → | |
| 282 | GPT-4o (Nov '24)OpenAI | 17.3 | 16.7 | 6.0 | 114 tok/s | View → |
| 285 | 17.1 | 11.1 | 26.3 | 35 tok/s | View → | |
| 286 | 17.0 | 14.0 | 23.7 | 65 tok/s | View → | |
| 288 | Magistral Small 1Mistral | 16.8 | 11.1 | 41.3 | — | View → |
| 292 | DeepSeek V3 (Dec '24)DeepSeek | 16.5 | 16.4 | 26.0 | — | View → |
| 293 | Qwen3 32B (Reasoning)Alibaba | 16.5 | 13.8 | 73.0 | 92 tok/s | View → |
| 295 | Qwen3.5 2B (Reasoning)Alibaba | 16.3 | 3.5 | — | — | View → |
| 299 | Qwen3 VL 30B A3B InstructAlibaba | 16.1 | 14.3 | 72.3 | 124 tok/s | View → |
| 300 | Ministral 3 14BMistral | 16.0 | 10.9 | 30.0 | 143 tok/s | View → |
| 310 | Qwen2.5 Instruct 72BAlibaba | 15.6 | 11.9 | 14.0 | 55 tok/s | View → |
| 311 | SonarPerplexity | 15.5 | — | — | — | View → |
| 313 | Qwen3 30B A3B (Reasoning)Alibaba | 15.3 | 11.0 | 72.3 | 69 tok/s | View → |
| 316 | Sonar ProPerplexity | 15.2 | — | — | — | View → |
| 320 | GLM-4.5V (Reasoning)Zhipu | 15.1 | 10.9 | 73.0 | 36 tok/s | View → |
| 325 | 14.9 | 11.8 | 75.0 | 134 tok/s | View → | |
| 327 | Ministral 3 8BMistral | 14.8 | 10.0 | 31.7 | 160 tok/s | View → |
| 328 | 14.8 | 8.3 | 69.7 | 125 tok/s | View → | |
| 331 | Qwen3.5 2B (Non-reasoning)Alibaba | 14.7 | 4.9 | — | 343 tok/s | View → |
| 332 | 14.7 | — | — | — | View → | |
| 334 | 14.5 | 10.7 | 7.7 | 94 tok/s | View → | |
| 335 | GPT-4o (May '24)OpenAI | 14.5 | 24.2 | — | 113 tok/s | View → |
| 337 | Mistral Small 3.1Mistral | 14.5 | 13.9 | 3.7 | 145 tok/s | View → |
| 338 | Qwen3 32B (Non-reasoning)Alibaba | 14.5 | — | 19.7 | 99 tok/s | View → |
| 343 | Qwen3 VL 8B InstructAlibaba | 14.3 | 7.3 | 27.3 | 147 tok/s | View → |
| 357 | Llama 4 ScoutMeta | 13.5 | 6.7 | 14.0 | 135 tok/s | View → |
| 359 | Nova ProAmazon | 13.5 | 11.0 | 7.0 | — | View → |
| 362 | 13.2 | 7.5 | 62.3 | 132 tok/s | View → | |
| 363 | 13.2 | 15.8 | 13.3 | 68 tok/s | View → | |
| 366 | GPT-4.1 nanoOpenAI | 13.0 | 11.2 | 24.0 | 116 tok/s | View → |
| 372 | 12.7 | 7.4 | 35.3 | 264 tok/s | View → | |
| 374 | Nova LiteAmazon | 12.7 | 5.1 | 7.0 | 186 tok/s | View → |
| 375 | 12.7 | 10.8 | 15.3 | 48 tok/s | View → | |
| 377 | GPT-4o miniOpenAI | 12.6 | — | 14.7 | 60 tok/s | View → |
| 381 | 12.5 | 13.3 | 21.7 | 68 tok/s | View → | |
| 385 | Claude 3 HaikuAnthropic | 12.3 | 6.7 | — | — | View → |
| 402 | Ministral 3 3BMistral | 11.2 | 4.8 | 22.0 | 291 tok/s | View → |
| 415 | Qwen3.5 0.8B (Reasoning)Alibaba | 10.5 | 0.0 | — | — | View → |
| 418 | Nova MicroAmazon | 10.3 | 4.1 | 6.0 | 309 tok/s | View → |
| 419 | Gemma 3 27B InstructGoogle | 10.3 | 9.6 | 20.7 | 36 tok/s | View → |
| 422 | 10.1 | 5.9 | 26.7 | 212 tok/s | View → | |
| 428 | Qwen3.5 0.8B (Non-reasoning)Alibaba | 9.9 | 1.0 | — | 364 tok/s | View → |
Benchmarks via Artificial Analysis. Indices are composite scores (0–100). "On Opper" column links to the model's detail page when we host it.