LLM Performance Leaderboard

Interactive comparison of large language models across multiple benchmarks

xAIOpenAIGoogleDeepSeekAlibabaAnthropicZhipuStepFun01.AINexusflowTencentMetaRWKVTsinghuaOpenAssistantLMSYSDatabricksStability AIHuggingFaceMicrosoftUWTogether AIMistralAllen AIUC BerkeleyNomic AIMosaicMLStanfordNousResearchIBMCognitive ComputationsNvidiaUpstage AITII01 AIOpenChatSnowflakeDeepSeek AIAllenAI/UWAi2CohereReka AIInternLMZhipu AIAmazonAI21 LabsNexusFlowPrinceton

Filter by BenchmarkClick a benchmark to sort models

RankModel NameOrganizationLicensearenaElo ScoreVotes
1Grok-3-Preview-02-24xAI
Proprietary
1412
3,364
2GPT-4.5-PreviewOpenAI
Proprietary
1411
3,242
3Gemini-2.0-Flash-Thinking-Exp-01-21Google
Proprietary
1384
17,487
4Gemini-2.0-Pro-Exp-02-05Google
Proprietary
1380
15,466
5ChatGPT-4o-latest (2025-01-29)OpenAI
Proprietary
1377
17,221
6DeepSeek-R1DeepSeek
MIT
1363
8,580
7Gemini-2.0-Flash-001Google
Proprietary
1357
13,257
8o1-2024-12-17OpenAI
Proprietary
1352
19,785
9Qwen2.5-MaxAlibaba
Proprietary
1336
11,930
10o3-mini-highOpenAI
Proprietary
1329
9,102
11o3-mini-highOpenAI
1325
17,988
12DeepSeek-V3DeepSeek
DeepSeek
1318
22,007
13DeepSeek-V3DeepSeek
1318
22,846
14QwQ-32BAlibaba
1316
7,735
15GLM-4-Plus-0111Zhipu
Proprietary
1311
6,035
16Gemini-2.0-Flash-LiteGoogle
1311
2,212
17Qwen-Plus-0125Alibaba
Proprietary
1310
6,054
18GLM-4-Plus-0111Zhipu
1310
6,026
19Claude 3.7 SonnetAnthropic
Proprietary
1309
4,254
20Gemini-2.0-Flash-Lite-Preview-02-05Google
Proprietary
1308
12,774
21Step-2-16K-ExpStepFun
Proprietary
1305
5,132
22Qwen-Plus-0125Alibaba
1305
6,055
23o3-miniOpenAI
1305
24,877
24o1-miniOpenAI
Proprietary
1304
54,923
25o3-miniOpenAI
Proprietary
1304
15,463
26Step-2-16K-ExpStepFun
1304
5,128
27o1-miniOpenAI
1304
54,960
28Command A.Cohere
1303
7,547
29Gemini-1.5-Pro-002Google
Proprietary
1302
57,551
30Hunyuan-TurboSTencent
1302
2,452
31Gemini-1.5-Pro-0902Google
1302
58,660
32Hunyuan-Turbo-0110Tencent
1302
2,511
33llama-3.3-Nemotron-Super-49B-v1Nvidia
1296
1,236
34Grok-2-08-13xAI
Proprietary
1288
67,038
35Grok-2-08-13xAI
1288
67,102
36Yi-Lightning01.AI
Proprietary
1287
28,946
37Yi-Lightning01 AI
1287
28,972
38GPT-4o (2024-05-13)OpenAI
1285
117,769
39Claude 3.5 Sonnet (20241022)Anthropic
Proprietary
1284
59,139
40Claude 3.5 SonnetAnthropic
1283
64,670
41Qwen2.5-plus-1127Alibaba
1282
10,723
42Deepseek-v2.5-1210DeepSeek
DeepSeek
1279
7,247
43Deepseek-v2.5-1210DeepSeek
1279
7,246
44Athene-v2-Chat-72BNexusflow
Athene V2
1275
26,092
45Athene-v2-Chat-72BNexusFlow
1275
26,093
46GPT-4o-mini-2024-07-18OpenAI
Proprietary
1272
66,710
47Hunyuan-Large-2025-02-10Tencent
1272
3,859
48GPT-4o-mini-2024-07-18OpenAI
1272
71,388
49Hunyuan-Large-2025-02-10Tencent
Proprietary
1271
3,860
50Gemini-1.5-Flash-002Google
Proprietary
1271
36,979
51Gemini-1.5-Flash-0902Google
1271
37,025
52llama-4-Maverick-17B-128F-InstructMeta
1271
4,917
53Llama-3.1-405B-Instruct-bf16Meta
Llama 3.1
1269
34,228
54Llama-3.1-Nemotron-70B-InstructNvidia
1269
7,580
55Meta-llama-3.1-405B-Instruct-bf16Meta
1269
43,795
56Claude 3.5 Sonnet (20240620)Anthropic
1268
86,162
57Meta-llama-3.1-405B-Instruct-fp8Meta
1267
63,055
58Gemini Advanced App. (2024-05-14)Google
1267
52,143
59Grok-2-Mini-08-13xAI
1266
55,452
60GPT-4o-2024-08-06OpenAI
1265
47,982
61Qwen-Max-0919Alibaba
1263
17,440
62Gemini-1.5-Pro-0901Google
1260
82,436
63Hunyuan-Standard-2025-02-10Tencent
1260
4,017
64Deepseek-v2.5DeepSeek
1258
26,344
65Qwen2.5-72B-InstructAlibaba
1257
41,532
66Llama-3.3-70B-InstructMeta
1257
38,101
67GPT-4-Turbo-2024-04-09OpenAI
1256
102,158
68Mistral-Large-2407Mistral
1251
48,212
69Athene-70BNexusFlow
1250
20,584
70GPT-4-1106-previewOpenAI
1250
103,746
71Mistral-Large-2411Mistral
1249
29,895
72Meta-llama-3.1-70B-InstructMeta
1248
58,654
73Claude 3 OpusAnthropic
1247
202,697
74GLM-4-PlusZhipu AI
1246
27,784
75Amazon-Nova-Pro-1.0Amazon
1245
24,285
76GPT-4-0125-previewOpenAI
1245
97,076
77Llama-3.1-Tulu-3-70BAi2
1244
3,014
78Claude 3.5 Haiku (20241022)Anthropic
1237
33,322
79Reka-Core-20240904Reka AI
1235
7,938
80Gemini-1.5-Flash-0901Google
1227
65,565
81Jamba-1.5-LargeAI21 Labs
1222
65,665
82Gemma-2-27B-itGoogle
1220
79,536
83Mistral-Small-24B-Instruct-2501Mistral
1217
14,573
84Amazon-Nova-Lite-1.0Amazon
1217
20,648
85Qwen2.5-Coder-32B-InstructAlibaba
1217
5,729
86Command R+ (08-2024)Cohere
1215
20,612
87Gemini-1.5-Flash-88-091Google
1212
37,686
88Llama-3.1-Nemotron-518B-InstructNvidia
1211
3,887
89Aya-Expanse-32BCohere
1209
28,749
90Gemma-2-9B-it-SimPOPrinceton
1209
10,548
91GLM-4-0529Zhipu AI
1207
10,220
92Llama-3-70B-InstructMeta
1206
163,632
93Reka-Flash-20240904Reka AI
1205
8,138
94Phi-4Microsoft
1205
25,224
95Claude 3 SonnetAnthropic
1201
113,061
96Nemotron-4-340B-InstructNvidia
1199
10,540
97Amazon-Nova-Micro-1.0Amazon
1198
20,663
98Zephyr-ORPO-141b-A35b-v0.1HuggingFace
1197
4,857
99Gemma-2-9B-itGoogle
1192
57,211
100Command R+ (04-2024)Cohere
1190
80,859
101Hunyuan-Standard-256KTencent
1189
2,901
102Qwen2-72B-InstructAlibaba
1187
38,877
103GPT-4-0314OpenAI
1186
55,978
104Llama-3.1-Tulu-3-8BAi2
1185
3,076
105Ministral-8B-2410Mistral
1182
5,113
106Aya-Expanse-8BCohere
1180
10,391
107Command R (08-2024)Cohere
1180
10,848
108Claude 3 HaikuAnthropic
1179
122,304
109DeepSeek-Coder-V2-InstructDeepSeek AI
1178
15,754
110Meta-llama-3.1-8B-InstructMeta
1176
52,597
111Jamba-1.5-MiniAI21 Labs
1176
9,272
112GPT-4-0613OpenAI
1163
91,640
113Qwen1.5-110B-ChatAlibaba
1161
27,431
114Yi-1.5-34B-Chat01 AI
1157
25,137
115Mistral-Large-2402Mistral
1157
64,928
116Reka-Flash-21B-onlineReka AI
1156
16,038
117QwQ-32B-PreviewAlibaba
1153
3,411
118Llama-3-8B-InstructMeta
1152
109,106
119Command R+ (04-2024)Cohere
1149
56,391
120InternLM2.5-20B-chatInternLM
1149
10,597
121Mixtral-8x22b-Instruct-v0.1Mistral
1148
53,756
122Mistral MediumMistral
1148
35,561
123Qwen1.5-72B-ChatAlibaba
1147
40,677
124Reka-Flash-21BReka AI
1147
25,809
125Gemma-2-2b-itGoogle
1144
48,905
126Granite-3.1-8B-InstructIBM
1143
3,294
127Gemini-1.0-Pro-0901Google
1131
18,803
128Qwen1.5-32B-ChatAlibaba
1125
22,759
129Phi-3-Medium-4k-InstructMicrosoft
1123
26,109
130Granite-3.1-2B-InstructIBM
1120
3,383
131Starling-LM-7B-betaNexusflow
1119
16,672
132Mixtral-8x7B-Instruct-v0.1Mistral
1114
76,135
133Yi-34B-Chat01 AI
1111
15,914
134Gemini ProGoogle
1111
6,556
135Qwen1.5-14B-ChatAlibaba
1109
18,680
136WizardLM-70B-v1.0Microsoft
1106
8,384
137GPT-3.5-Turbo-0125OpenAI
1106
68,878
138DBRX-Instruct-PreviewDatabricks
1103
33,734
139Meta-llama-3.2-3B-InstructMeta
1103
8,399
140Phi-3-Small-8k-InstructMicrosoft
1102
18,473
141Tulu-2-DPO-70BAllenAI/UW
1098
6,662
142Llama-2-70B-chatMeta
1093
39,599
143Granite-3.0-8B-InstructIBM
1093
7,002
144OpenChat-3.5-0106OpenChat
1091
12,987
145Vicuna-33BLMSYS
1091
22,948
146Snowflake Arctic InstructSnowflake
1090
34,177
147Starling-LM-7B-alphaUC Berkeley
1088
10,417
148Granite-3.0-2B-InstructIBM
1084
7,184
149Gemma-1.4-7B-itGoogle
1084
25,066
150NV-Llama2-70B-SteerLM-ChatNvidia
1081
3,636
151DeepSeek-LLM-67B-ChatDeepSeek AI
1077
4,988
152OpenChat-3.5OpenChat
1076
8,106
153OpenHermes-2.5-Mistral-7BNousResearch
1074
5,089
154Mistral-7B-Instruct-v0.2Mistral
1072
20,065
155Qwen1.5-7B-ChatAlibaba
1070
4,871
156Phi-3-Mini-4K-Instruct-June-24Microsoft
1070
12,803
157GPT-3.5-Turbo-1106OpenAI
1067
17,035
158Phi-3-Mini-4K-InstructMicrosoft
1066
21,098
159Dolphin-2.2.1-Mistral-7BCognitive Computations
1063
1,713
160Llama-2-13b-chatMeta
1063
19,721
161SOLAR-10.7B-Instruct-v1.0Upstage AI
1062
4,288
162Nous-Hermes-2-Mixtral-8x7B-DPONousResearch
1061
3,839
163WizardLM-13b-v1.2Microsoft
1059
7,175
164Meta-llama-3.2-13B-InstructMeta
1054
8,518
165Vicuna-13BLMSYS
1054
19,776
166Zephyr-7B-betaHuggingFace
1053
11,318
167SmoLLM2-1.7B-InstructHuggingFace
1046
2,376
168MPT-30B-chatMosaicML
1045
2,645
169falcon-180b-chatTII
1044
1,328
170CodeLlama-34B-instructMeta
1043
7,512
171CodeLlama-70B-instructMeta
1042
1,190
172Zephyr-7B-alphaHuggingFace
1040
1,811
173Gemma-7B-itGoogle
1037
9,176
174Phi-3-Mini-128k-InstructMicrosoft
1037
21,628
175Llama-2-7B-chatMeta
1037
14,535
176Qwen-14B-ChatAlibaba
1035
5,068
177Guanaco-33BUW
1033
2,998
178Gemma-1.1-2b-itGoogle
1021
11,354
179StripedHyena-Nous-7BTogether AI
1017
5,278
180OLMo-7B-instructAllen AI
1015
6,499
181Mistral-7B-Instruct-v0.1Mistral
1008
9,145
182Vicuna-7BLMSYS
1005
7,014
183PaLM-Chat-Bison-001Google
1003
8,712
184Gemma-2B-itGoogle
989
4,922
185Qwen1.5-4B-ChatAlibaba
988
7,819
186Koala-13BUC Berkeley
964
7,021
187ChatGLM3-6BTsinghua
955
4,763
188GPT4All-13B-SnoozyNomic AI
932
1,788
189MPT-7B-ChatMosaicML
928
3,997
190ChatGLM2-6BTsinghua
924
2,711
191RWKV-4-Raven-14BRWKV
922
4,920
192Alpaca-13BStanford
901
5,865
193OpenAssistant-Pythia-12BOpenAssistant
893
6,368
194ChatGLM-6BTsinghua
879
4,987
195FastChat-T5-3BLMSYS
868
4,290
196StableLM-Tuned-Alpha-7BStability AI
840
3,339
197Dolly-V2-12BDatabricks
822
3,481
198LLaMA-13BMeta
799
2,445

🚀 Real-time updates | 🔍 Interactive visualizations | 📊 Data-driven insights

Data aggregated from multiple benchmark sources • Last updated: March 2025