LLM Leaderboard
Frontier LLMs ranked
Industry-standard rankings by Intelligence, Coding, and Math indices. Sourced from Artificial Analysis. Click into any model on Opper for full benchmarks, hosting routes, and privacy posture.
# | Model | Intelligence | Coding | Math | Speed | On Opper |
|---|---|---|---|---|---|---|
| 1 | 61.4 | 56.7 | — | 55 tok/s | View → | |
| 2 | GPT-5.5 (xhigh)OpenAI | 60.2 | 59.1 | — | 75 tok/s | View → |
| 3 | GPT-5.5 (high)OpenAI | 58.9 | 58.5 | — | 60 tok/s | View → |
| 4 | 57.3 | 52.5 | — | 51 tok/s | View → | |
| 5 | Gemini 3.1 Pro PreviewGoogle | 57.2 | 55.5 | — | 123 tok/s | View → |
| 6 | GPT-5.4 (xhigh)OpenAI | 56.8 | 57.2 | — | 80 tok/s | View → |
| 7 | GPT-5.5 (medium)OpenAI | 56.7 | 56.2 | — | 54 tok/s | View → |
| 8 | Qwen3.7 MaxAlibaba | 56.6 | 50.1 | — | 171 tok/s | View → |
| 9 | Gemini 3.5 Flash (high)Google | 55.3 | 45.0 | — | 198 tok/s | View → |
| 10 | 54.8 | 43.9 | — | 206 tok/s | View → | |
| 12 | Kimi K2.6Moonshot | 53.9 | 47.1 | — | 42 tok/s | View → |
| 14 | GPT-5.3 Codex (xhigh)OpenAI | 53.6 | 53.1 | — | 76 tok/s | View → |
| 16 | 53.2 | 41.0 | — | 123 tok/s | View → | |
| 17 | 52.9 | 48.1 | — | 49 tok/s | View → | |
| 19 | 51.8 | 53.1 | — | 42 tok/s | View → | |
| 21 | 51.7 | 50.9 | — | 66 tok/s | View → | |
| 22 | 51.5 | 47.5 | — | 50 tok/s | View → | |
| 23 | 51.4 | 43.4 | — | 50 tok/s | View → | |
| 24 | GPT-5.2 (xhigh)OpenAI | 51.3 | 48.7 | 99.0 | 71 tok/s | View → |
| 25 | GPT-5.5 (low)OpenAI | 50.8 | 52.1 | — | 52 tok/s | View → |
| 27 | 49.8 | 43.2 | — | 46 tok/s | View → | |
| 28 | 49.8 | 44.2 | — | 79 tok/s | View → | |
| 29 | Claude Opus 4.5 (Reasoning)Anthropic | 49.7 | 47.8 | 91.3 | 54 tok/s | View → |
| 30 | MiniMax-M2.7MiniMax | 49.6 | 41.9 | — | 67 tok/s | View → |
| 34 | GPT-5.2 Codex (xhigh)OpenAI | 49.0 | 43.0 | — | 108 tok/s | View → |
| 35 | GPT-5.4 mini (xhigh)OpenAI | 48.9 | 51.5 | — | 157 tok/s | View → |
| 36 | 48.8 | 35.1 | — | 127 tok/s | View → | |
| 39 | GPT-5.4 (low)OpenAI | 47.9 | 45.6 | — | 64 tok/s | View → |
| 41 | GPT-5.1 (high)OpenAI | 47.7 | 44.7 | 94.0 | 125 tok/s | View → |
| 42 | GLM-5-TurboZ.ai | 46.8 | 36.8 | — | — | View → |
| 43 | Kimi K2.5 (Reasoning)Moonshot | 46.8 | 39.6 | — | 34 tok/s | View → |
| 44 | GPT-5.2 (medium)OpenAI | 46.6 | 44.2 | 96.7 | — | View → |
| 45 | 46.5 | 38.7 | — | 120 tok/s | View → | |
| 46 | 46.5 | 47.6 | — | 44 tok/s | View → | |
| 47 | 46.4 | 42.6 | 97.0 | 181 tok/s | View → | |
| 48 | 46.0 | 39.8 | — | — | View → | |
| 49 | Qwen3.6 27B (Reasoning)Alibaba | 45.8 | 36.5 | — | 56 tok/s | View → |
| 50 | 45.0 | 41.3 | — | 52 tok/s | View → | |
| 52 | GPT-5 (high)OpenAI | 44.6 | 36.0 | 94.3 | 119 tok/s | View → |
| 53 | GPT-5 Codex (high)OpenAI | 44.6 | 38.9 | 98.7 | 174 tok/s | View → |
| 54 | 44.4 | 46.4 | — | 46 tok/s | View → | |
| 55 | GPT-5.4 nano (xhigh)OpenAI | 44.0 | 43.9 | — | 161 tok/s | View → |
| 56 | 43.9 | 31.6 | — | 146 tok/s | View → | |
| 58 | 43.8 | 35.8 | — | 50 tok/s | View → | |
| 59 | Qwen3.6 35B A3B (Reasoning)Alibaba | 43.5 | 35.2 | — | 162 tok/s | View → |
| 62 | GPT-5.1 Codex (high)OpenAI | 43.1 | 36.6 | 95.7 | 182 tok/s | View → |
| 63 | Claude Opus 4.5 (Non-reasoning)Anthropic | 43.1 | 42.9 | 62.7 | 48 tok/s | View → |
| 64 | Claude 4.5 Sonnet (Reasoning)Anthropic | 43.0 | 38.6 | 88.0 | 53 tok/s | View → |
| 65 | Kimi K2.6 (Non-reasoning)Moonshot | 42.9 | 38.4 | — | 41 tok/s | View → |
| 66 | 42.9 | 36.2 | — | — | View → | |
| 67 | 42.6 | 43.0 | — | 46 tok/s | View → | |
| 69 | 42.1 | 36.3 | 95.0 | 80 tok/s | View → | |
| 70 | Qwen3.5 27B (Reasoning)Alibaba | 42.1 | 34.9 | — | 83 tok/s | View → |
| 71 | GPT-5 (medium)OpenAI | 42.0 | 38.9 | 91.7 | 84 tok/s | View → |
| 72 | Claude 4.1 Opus (Reasoning)Anthropic | 42.0 | 36.5 | 80.3 | 34 tok/s | View → |
| 74 | MiniMax-M2.5MiniMax | 41.9 | 37.4 | — | 202 tok/s | View → |
| 76 | DeepSeek V3.2 (Reasoning)DeepSeek | 41.7 | 36.7 | 92.0 | — | View → |
| 77 | 41.6 | 34.7 | — | 140 tok/s | View → | |
| 79 | Grok 4xAI | 41.5 | 40.5 | 92.7 | — | View → |
| 81 | GPT-5 mini (high)OpenAI | 41.2 | 35.3 | 90.7 | 90 tok/s | View → |
| 82 | GPT-5.5 (Non-reasoning)OpenAI | 40.9 | 48.6 | — | 51 tok/s | View → |
| 83 | Kimi K2 ThinkingMoonshot | 40.9 | 34.8 | 94.7 | 131 tok/s | View → |
| 84 | o3-proOpenAI | 40.7 | — | — | 24 tok/s | View → |
| 85 | 40.6 | 39.0 | — | 65 tok/s | View → | |
| 86 | 40.1 | 37.4 | — | 53 tok/s | View → | |
| 87 | Qwen3 Max ThinkingAlibaba | 39.8 | 30.5 | — | — | View → |
| 88 | MiniMax-M2.1MiniMax | 39.4 | 32.8 | 82.7 | 175 tok/s | View → |
| 89 | DeepSeek V4 Pro (Non-reasoning)DeepSeek | 39.3 | 38.4 | — | 45 tok/s | View → |
| 90 | Gemma 4 31B (Reasoning)Google | 39.2 | 38.7 | — | 35 tok/s | View → |
| 92 | GPT-5 (low)OpenAI | 39.2 | 30.7 | 83.0 | 86 tok/s | View → |
| 94 | Claude 4 Opus (Reasoning)Anthropic | 39.0 | 34.0 | 73.3 | 37 tok/s | View → |
| 95 | GPT-5 mini (medium)OpenAI | 38.9 | 32.8 | 85.0 | 88 tok/s | View → |
| 96 | Claude 4 Sonnet (Reasoning)Anthropic | 38.7 | 34.1 | 74.3 | 46 tok/s | View → |
| 98 | 38.6 | 36.4 | 91.7 | 211 tok/s | View → | |
| 99 | 38.6 | 30.9 | 89.3 | — | View → | |
| 102 | o3OpenAI | 38.4 | 38.4 | 88.3 | 113 tok/s | View → |
| 103 | GPT-5.4 nano (medium)OpenAI | 38.1 | 35.0 | — | 160 tok/s | View → |
| 105 | GPT-5.4 mini (medium)OpenAI | 37.7 | 37.5 | — | 166 tok/s | View → |
| 106 | Kimi K2.5 (Non-reasoning)Moonshot | 37.3 | 25.8 | — | 33 tok/s | View → |
| 108 | Qwen3.5 27B (Non-reasoning)Alibaba | 37.2 | 33.4 | — | 91 tok/s | View → |
| 109 | Claude 4.5 Haiku (Reasoning)Anthropic | 37.1 | 32.6 | 83.7 | 135 tok/s | View → |
| 110 | Qwen3.6 27B (Non-reasoning)Alibaba | 37.1 | 26.6 | — | 53 tok/s | View → |
| 111 | Claude 4.5 Sonnet (Non-reasoning)Anthropic | 37.1 | 33.5 | 37.0 | 43 tok/s | View → |
| 112 | Qwen3.5 35B A3B (Reasoning)Alibaba | 37.1 | 30.3 | — | 133 tok/s | View → |
| 113 | 36.5 | 35.2 | — | 109 tok/s | View → | |
| 115 | MiniMax-M2MiniMax | 36.1 | 29.2 | 78.3 | 112 tok/s | View → |
| 116 | 36.0 | 31.2 | — | 150 tok/s | View → | |
| 118 | Claude 4.1 Opus (Non-reasoning)Anthropic | 36.0 | — | — | 33 tok/s | View → |
| 119 | 35.9 | 31.6 | — | 152 tok/s | View → | |
| 122 | GPT-5.4 (Non-reasoning)OpenAI | 35.4 | 41.0 | — | 59 tok/s | View → |
| 124 | 35.0 | 37.8 | 55.7 | 191 tok/s | View → | |
| 126 | Gemini 2.5 ProGoogle | 34.6 | 32.0 | 87.7 | 125 tok/s | View → |
| 128 | 34.2 | 32.0 | 48.0 | 81 tok/s | View → | |
| 129 | 33.9 | 33.7 | 89.7 | — | View → | |
| 132 | GPT-5.2 (Non-reasoning)OpenAI | 33.6 | 34.7 | 51.0 | 61 tok/s | View → |
| 133 | Gemini 3.1 Flash-LiteGoogle | 33.5 | 30.1 | — | 291 tok/s | View → |
| 135 | gpt-oss-120b (high)OpenAI | 33.3 | 28.6 | 93.4 | 360 tok/s | View → |
| 136 | o4-mini (high)OpenAI | 33.1 | 25.6 | 90.7 | 153 tok/s | View → |
| 137 | Claude 4 Sonnet (Non-reasoning)Anthropic | 33.0 | 30.6 | 38.0 | 45 tok/s | View → |
| 138 | Claude 4 Opus (Non-reasoning)Anthropic | 33.0 | — | 36.3 | 35 tok/s | View → |
| 139 | DeepSeek V3.2 Exp (Reasoning)DeepSeek | 32.9 | 33.3 | 87.7 | — | View → |
| 141 | 32.5 | 29.5 | 86.0 | 40 tok/s | View → | |
| 143 | Qwen3.5 9B (Reasoning)Alibaba | 32.4 | 25.3 | — | 73 tok/s | View → |
| 144 | 32.3 | 33.9 | — | 28 tok/s | View → | |
| 146 | DeepSeek V3.2 (Non-reasoning)DeepSeek | 32.1 | 34.6 | 59.0 | — | View → |
| 147 | 32.1 | 25.2 | 84.7 | 58 tok/s | View → | |
| 149 | 31.9 | 27.2 | — | 171 tok/s | View → | |
| 150 | 31.5 | 17.6 | — | 158 tok/s | View → | |
| 151 | Qwen3 MaxAlibaba | 31.4 | 26.4 | 80.7 | 49 tok/s | View → |
| 152 | 31.2 | 22.4 | — | — | View → | |
| 153 | 31.1 | 24.6 | 78.3 | — | View → | |
| 154 | Claude 4.5 Haiku (Non-reasoning)Anthropic | 31.0 | 29.6 | 39.0 | 98 tok/s | View → |
| 155 | 31.0 | 25.1 | — | 127 tok/s | View → | |
| 156 | Kimi K2 0905Moonshot | 30.9 | 25.9 | 57.3 | 24 tok/s | View → |
| 158 | 30.7 | 16.8 | — | 147 tok/s | View → | |
| 159 | o1OpenAI | 30.7 | 20.5 | — | 120 tok/s | View → |
| 161 | 30.3 | 46.7 | — | — | View → | |
| 163 | 30.2 | 30.2 | 44.3 | 48 tok/s | View → | |
| 164 | 30.1 | 25.9 | — | 93 tok/s | View → | |
| 166 | 29.7 | 25.4 | — | 164 tok/s | View → | |
| 167 | 29.5 | — | — | — | View → | |
| 168 | 29.5 | 23.2 | 91.0 | 62 tok/s | View → | |
| 171 | 29.0 | 22.0 | — | 164 tok/s | View → | |
| 173 | 28.5 | 31.9 | 53.7 | — | View → | |
| 175 | 28.4 | 30.0 | 57.7 | — | View → | |
| 176 | Qwen3 Coder NextAlibaba | 28.3 | 22.9 | — | 102 tok/s | View → |
| 178 | DeepSeek V3.1 (Non-reasoning)DeepSeek | 28.1 | 28.4 | 49.7 | — | View → |
| 181 | DeepSeek V3.1 (Reasoning)DeepSeek | 27.7 | 29.7 | 89.7 | — | View → |
| 184 | GPT-5.1 (Non-reasoning)OpenAI | 27.4 | 27.3 | 38.0 | 107 tok/s | View → |
| 185 | Qwen3.5 9B (Non-reasoning)Alibaba | 27.3 | 21.3 | — | — | View → |
| 186 | 27.1 | 29.1 | — | 62 tok/s | View → | |
| 187 | Magistral Medium 1.2Mistral | 27.1 | 21.7 | 82.0 | 39 tok/s | View → |
| 188 | Qwen3.5 4B (Reasoning)Alibaba | 27.1 | 17.5 | — | 198 tok/s | View → |
| 189 | DeepSeek R1 0528 (May '25)DeepSeek | 27.1 | 24.0 | 76.0 | — | View → |
| 190 | 27.0 | 22.2 | 73.3 | 219 tok/s | View → | |
| 191 | GPT-5 nano (high)OpenAI | 26.8 | 20.3 | 83.7 | 153 tok/s | View → |
| 192 | 26.7 | 19.5 | 84.3 | 142 tok/s | View → | |
| 194 | GPT-4.1OpenAI | 26.3 | 21.8 | 34.7 | 132 tok/s | View → |
| 200 | o3-miniOpenAI | 25.9 | 17.9 | — | 200 tok/s | View → |
| 201 | GPT-5 nano (medium)OpenAI | 25.9 | 22.9 | 78.3 | 168 tok/s | View → |
| 202 | o1-proOpenAI | 25.8 | — | — | — | View → |
| 203 | 25.7 | 22.1 | 56.7 | — | View → | |
| 205 | o3-mini (high)OpenAI | 25.2 | 17.3 | — | 212 tok/s | View → |
| 208 | 25.0 | 22.1 | 71.7 | 44 tok/s | View → | |
| 212 | Sonar Reasoning ProPerplexity | 24.6 | — | — | — | View → |
| 213 | gpt-oss-120b (low)OpenAI | 24.5 | 15.5 | 66.7 | 364 tok/s | View → |
| 214 | gpt-oss-20B (high)OpenAI | 24.5 | 18.5 | 89.3 | 268 tok/s | View → |
| 215 | 24.4 | 27.9 | — | 155 tok/s | View → | |
| 216 | MiniMax M1 80kMiniMax | 24.4 | 14.5 | 61.0 | — | View → |
| 217 | 24.3 | 19.0 | 91.0 | 148 tok/s | View → | |
| 218 | 24.3 | — | — | — | View → | |
| 226 | 23.4 | 19.7 | 85.3 | 61 tok/s | View → | |
| 227 | 23.3 | 25.3 | — | 150 tok/s | View → | |
| 229 | GLM-4.5-AirZ.ai | 23.2 | 23.8 | 80.7 | 68 tok/s | View → |
| 233 | GPT-4.1 miniOpenAI | 22.9 | 18.5 | 46.3 | 85 tok/s | View → |
| 234 | Mistral Large 3Mistral | 22.8 | 22.7 | 38.0 | 60 tok/s | View → |
| 236 | Qwen3.5 4B (Non-reasoning)Alibaba | 22.6 | 13.7 | — | 206 tok/s | View → |
| 238 | DeepSeek V3 0324DeepSeek | 22.3 | 22.0 | 41.0 | — | View → |
| 239 | INTELLECT-3PrimeIntellect | 22.2 | 19.1 | 88.0 | — | View → |
| 240 | 22.1 | 11.0 | — | 120 tok/s | View → | |
| 244 | 21.6 | 18.2 | 68.7 | — | View → | |
| 249 | gpt-oss-20B (low)OpenAI | 20.8 | 14.4 | 62.3 | 274 tok/s | View → |
| 250 | Qwen3 VL 235B A22B InstructAlibaba | 20.8 | 16.5 | 70.7 | 48 tok/s | View → |
| 253 | 20.6 | 17.8 | 60.3 | 186 tok/s | View → | |
| 255 | Qwen3 Next 80B A3B InstructAlibaba | 20.1 | 15.3 | 66.3 | 132 tok/s | View → |
| 258 | Qwen3 Coder 30B A3B InstructAlibaba | 20.0 | 19.4 | 29.0 | 82 tok/s | View → |
| 259 | Qwen3 235B A22B (Reasoning)Alibaba | 19.8 | 17.4 | 82.0 | 60 tok/s | View → |
| 260 | Qwen3 VL 30B A3B (Reasoning)Alibaba | 19.7 | 13.1 | 82.3 | 127 tok/s | View → |
| 264 | 19.4 | 14.5 | 46.7 | — | View → | |
| 280 | GPT-4o (Aug '24)OpenAI | 18.6 | 16.6 | — | 90 tok/s | View → |
| 282 | 18.5 | 13.6 | 21.7 | — | View → | |
| 285 | Magistral Small 1.2Mistral | 18.2 | 14.8 | 80.3 | 110 tok/s | View → |
| 294 | Sonar ReasoningPerplexity | 17.9 | — | — | — | View → |
| 295 | 17.8 | — | — | — | View → | |
| 297 | 17.6 | 9.5 | 53.3 | 307 tok/s | View → | |
| 299 | GPT-4o (Nov '24)OpenAI | 17.3 | 16.7 | 6.0 | 143 tok/s | View → |
| 300 | DeepSeek R1 Distill Qwen 32BDeepSeek | 17.2 | — | 63.0 | — | View → |
| 302 | 17.1 | 11.1 | 26.3 | 68 tok/s | View → | |
| 303 | 17.0 | 14.0 | 23.7 | 65 tok/s | View → | |
| 305 | Magistral Small 1Mistral | 16.8 | 11.1 | 41.3 | — | View → |
| 309 | DeepSeek V3 (Dec '24)DeepSeek | 16.5 | 16.4 | 26.0 | — | View → |
| 310 | Qwen3 32B (Reasoning)Alibaba | 16.5 | 13.8 | 73.0 | 98 tok/s | View → |
| 312 | Qwen3.5 2B (Reasoning)Alibaba | 16.3 | 3.5 | — | — | View → |
| 316 | Ministral 3 14BMistral | 16.0 | 10.9 | 30.0 | 88 tok/s | View → |
| 319 | DeepSeek R1 Distill Llama 70BDeepSeek | 16.0 | 11.4 | 53.7 | 45 tok/s | View → |
| 321 | Qwen3 VL 30B A3B InstructAlibaba | 16.0 | 14.3 | 72.3 | 125 tok/s | View → |
| 324 | DeepSeek R1 Distill Qwen 14BDeepSeek | 15.8 | — | 55.7 | — | View → |
| 327 | Qwen2.5 Instruct 72BAlibaba | 15.6 | 11.9 | 14.0 | — | View → |
| 329 | SonarPerplexity | 15.5 | — | — | — | View → |
| 330 | Qwen3 30B A3B (Reasoning)Alibaba | 15.3 | 11.0 | 72.3 | 67 tok/s | View → |
| 333 | Sonar ProPerplexity | 15.2 | — | — | — | View → |
| 337 | 15.1 | 10.9 | 73.0 | 31 tok/s | View → | |
| 342 | 14.9 | 11.7 | 75.0 | 191 tok/s | View → | |
| 344 | Ministral 3 8BMistral | 14.8 | 10.0 | 31.7 | 101 tok/s | View → |
| 345 | 14.8 | 8.3 | 69.7 | 115 tok/s | View → | |
| 348 | Qwen3.5 2B (Non-reasoning)Alibaba | 14.7 | 4.9 | — | 315 tok/s | View → |
| 349 | 14.7 | — | — | — | View → | |
| 351 | 14.5 | 10.7 | 7.7 | 91 tok/s | View → | |
| 352 | GPT-4o (May '24)OpenAI | 14.5 | 24.2 | — | 94 tok/s | View → |
| 354 | Mistral Small 3.1Mistral | 14.5 | 13.9 | 3.7 | 155 tok/s | View → |
| 355 | Qwen3 32B (Non-reasoning)Alibaba | 14.5 | — | 19.7 | 97 tok/s | View → |
| 360 | Qwen3 VL 8B InstructAlibaba | 14.3 | 7.3 | 27.3 | 148 tok/s | View → |
| 374 | Llama 4 ScoutMeta | 13.5 | 6.7 | 14.0 | 106 tok/s | View → |
| 376 | Nova ProAmazon | 13.5 | 11.0 | 7.0 | — | View → |
| 379 | 13.2 | 7.5 | 62.3 | 134 tok/s | View → | |
| 380 | 13.2 | 15.8 | 13.3 | 89 tok/s | View → | |
| 383 | GPT-4.1 nanoOpenAI | 13.0 | 11.2 | 24.0 | 107 tok/s | View → |
| 390 | 12.7 | 7.4 | 35.3 | 231 tok/s | View → | |
| 392 | Nova LiteAmazon | 12.7 | 5.1 | 7.0 | 185 tok/s | View → |
| 393 | 12.7 | 10.8 | 15.3 | 48 tok/s | View → | |
| 395 | GPT-4o miniOpenAI | 12.6 | — | 14.7 | 68 tok/s | View → |
| 399 | 12.5 | 13.3 | 21.7 | 69 tok/s | View → | |
| 403 | Claude 3 HaikuAnthropic | 12.3 | 6.7 | — | — | View → |
| 420 | Ministral 3 3BMistral | 11.2 | 4.8 | 22.0 | 204 tok/s | View → |
| 429 | Hermes 3 - Llama-3.1 70BNous Research | 10.6 | — | — | 32 tok/s | View → |
| 433 | Qwen3.5 0.8B (Reasoning)Alibaba | 10.5 | 0.0 | — | — | View → |
| 436 | Nova MicroAmazon | 10.3 | 4.1 | 6.0 | 292 tok/s | View → |
| 437 | Gemma 3 27B InstructGoogle | 10.3 | 9.6 | 20.7 | — | View → |
| 440 | 10.1 | 5.9 | 26.7 | 224 tok/s | View → | |
| 446 | Qwen3.5 0.8B (Non-reasoning)Alibaba | 9.9 | 1.0 | — | 89 tok/s | View → |
Benchmarks via Artificial Analysis. Indices are composite scores (0–100). "On Opper" column links to the model's detail page when we host it.