Model Rankings
This is a leaderboard based on Chatbot Arena (lmarena.ai) data, generated through an automated process.
Data update time: 2026-02-11 08:18:25 UTC / 2026-02-11 16:18:25 CST (Beijing Time)
Leaderboard
1
12
Anthropic claude-opus-4-6-thinking Anthropic · Proprietary
1505±10
3,679
2
12
Anthropic claude-opus-4-6 Anthropic · Proprietary
1503±9
4,427
3
33
gemini-3-pro Google · Proprietary
1486±4
35,513
4
47
grok-4.1-thinking xAI · Proprietary
1475±4
35,214
5
49
gemini-3-flash Google · Proprietary
1472±5
26,167
6
49
Anthropic claude-opus-4-5-20251101-thinking-32k Anthropic · Proprietary
1471±5
27,282
7
49
Anthropic claude-opus-4-5-20251101 Anthropic · Proprietary
1467±4
32,179
8
510
grok-4.1 xAI · Proprietary
1464±4
39,284
9
511
gemini-3-flash (thinking-minimal) Google · Proprietary
1462±5
17,361
10
814
gpt-5.1-high OpenAI · Proprietary
1458±4
31,519
11
921
ernie-5.0-0110 Baidu · Proprietary
1452±6 Preliminary
11,080
12
1021
Anthropic claude-sonnet-4-5-20250929 Anthropic · Proprietary
1451±4
43,519
13
1121
Anthropic claude-sonnet-4-5-20250929-thinking-32k Anthropic · Proprietary
1450±4
45,809
14
1023
ernie-5.0-preview-1203 Baidu · Proprietary
1449±7
9,773
15
1121
gemini-2.5-pro Google · Proprietary
1449±3
94,838
16
1122
Anthropic claude-opus-4-1-20250805-thinking-16k Anthropic · Proprietary
1449±4
49,947
17
1025
Moonshot AI kimi-k2.5-thinking Moonshot · Modified MIT
1448±7
8,083
18
1125
Anthropic claude-opus-4-1-20250805 Anthropic · Proprietary
1445±3
74,911
19
1127
gpt-4.5-preview-2025-02-27 OpenAI · Proprietary
1444±6
14,549
20
1525
chatgpt-4o-latest-20250326 OpenAI · Proprietary
1442±3
82,329
21
1129
glm-4.7 Z.ai · MIT
1441±6
12,013
22
1136
Moonshot AI kimi-k2.5-instant Moonshot · Modified MIT
1438±10
3,830
23
1729
gpt-5.1 OpenAI · Proprietary
1438±4
33,736
24
1630
gpt-5.2 OpenAI · Proprietary
1438±6
12,759
25
1732
gpt-5.2-high OpenAI · Proprietary
1436±6
16,100
26
2032
gpt-5-high OpenAI · Proprietary
1434±5
32,619
27
2035
qwen3-max-preview Alibaba · Proprietary
1434±5
27,842
28
2136
o3-2025-04-16 OpenAI · Proprietary
1432±4
61,351
29
2138
grok-4-1-fast-reasoning xAI · Proprietary
1431±4
28,142
30
2343
Moonshot AI kimi-k2-thinking-turbo Moonshot · Modified MIT
1429±4
33,118
31
2447
gpt-5-chat OpenAI · Proprietary
1426±4
31,821
32
2750
glm-4.6 Z.ai · MIT
1425±4
35,331
33
2450
qwen3-max-2025-09-23 Alibaba · Proprietary
1425±6
9,223
34
2950
Anthropic claude-opus-4-20250514-thinking-16k Anthropic · Proprietary
1424±4
37,973
35
2652
deepseek-v3.2-exp DeepSeek · MIT
1423±6
11,762
36
2652
deepseek-v3.2-exp-thinking DeepSeek · MIT
1423±7
8,999
37
3050
qwen3-235b-a22b-instruct-2507 Alibaba · Apache 2.0
1422±3
69,176
38
2654
grok-4-fast-chat xAI · Proprietary
1422±8
6,989
39
3052
deepseek-v3.2 DeepSeek · MIT
1421±5
27,695
40
3052
deepseek-v3.2-thinking DeepSeek · MIT
1421±5
22,732
41
3157
deepseek-r1-0528 DeepSeek · MIT
1418±6
19,289
42
2958
ernie-5.0-preview-1022 Baidu · Proprietary
1418±9
4,616
43
3157
deepseek-v3.1 DeepSeek · MIT
1418±6
15,298
44
3157
Moonshot AI kimi-k2-0905-preview Moonshot · Modified MIT
1417±6
11,973
45
3157
deepseek-v3.1-thinking DeepSeek · MIT
1417±7
11,985
46
3257
Moonshot AI kimi-k2-0711-preview Moonshot · Modified MIT
1417±5
28,662
47
3257
mistral-large-3 Mistral · Apache 2.0
1416±5
24,050
48
3060
deepseek-v3.1-terminus DeepSeek · MIT
1416±10
3,761
49
3064
deepseek-v3.1-terminus-thinking DeepSeek · MIT
1415±10
3,548
50
3259
qwen3-vl-235b-a22b-instruct Alibaba · Apache 2.0
1415±6
11,684
51
3657
gpt-4.1-2025-04-14 OpenAI · Proprietary
1413±4
52,211
52
3657
Anthropic claude-opus-4-20250514 Anthropic · Proprietary
1413±4
45,573
53
4060
grok-3-preview-02-24 xAI · Proprietary
1411±4
33,970
54
4160
mistral-medium-2508 Mistral · Proprietary
1411±3
63,013
55
4160
gemini-2.5-flash Google · Proprietary
1410±3
94,114
56
4064
glm-4.5 Z.ai · MIT
1410±5
24,792
57
4163
grok-4-0709 xAI · Proprietary
1410±4
42,143
58
4970
gemini-2.5-flash-preview-09-2025 Google · Proprietary
1405±4
32,875
59
5170
Anthropic claude-haiku-4-5-20251001 Anthropic · Proprietary
1404±4
44,511
60
5071
grok-4-fast-reasoning xAI · Proprietary
1403±5
18,637
61
5572
o1-2024-12-17 OpenAI · Proprietary
1402±4
27,822
62
5572
qwen3-235b-a22b-no-thinking Alibaba · Apache 2.0
1401±4
39,546
63
5573
qwen3-next-80b-a3b-instruct Alibaba · Apache 2.0
1401±5
22,856
64
5873
Anthropic claude-sonnet-4-20250514-thinking-32k Anthropic · Proprietary
1400±4
36,207
65
5679
longcat-flash-chat Meituan · MIT
1399±6
11,548
66
5881
qwen3-235b-a22b-thinking-2507 Alibaba · Apache 2.0
1398±6
9,254
67
5879
deepseek-r1 DeepSeek · MIT
1398±5
18,537
68
5886
qwen3-vl-235b-a22b-thinking Alibaba · Apache 2.0
1395±7
7,971
69
5887
amazon-nova-experimental-chat-12-10 Amazon · Proprietary
1395±10
3,718
70
6084
mimo-v2-flash (non-thinking) Xiaomi · MIT
1394±5
16,651
71
6182
deepseek-v3-0324 DeepSeek · MIT
1394±4
46,690
72
5888
Tencent hunyuan-vision-1.5-thinking Tencent · Proprietary
1393±12
2,225
73
6386
mai-1-preview Microsoft AI · Proprietary
1391±5
18,134
74
6586
o4-mini-2025-04-16 OpenAI · Proprietary
1391±4
46,676
75
6587
gpt-5-mini-high OpenAI · Proprietary
1390±5
27,145
76
6587
Anthropic claude-sonnet-4-20250514 Anthropic · Proprietary
1390±4
41,645
77
6787
Anthropic claude-3-7-sonnet-20250219-thinking-32k Anthropic · Proprietary
1388±4
39,889
78
6588
o1-preview OpenAI · Proprietary
1388±5
31,120
79
6788
Minimax minimax-m2.1-preview MiniMax · MIT
1387±5
16,114
80
6590
Tencent hunyuan-t1-20250711 Tencent · Proprietary
1387±9
4,801
81
6888
qwen3-coder-480b-a35b-instruct Alibaba · Apache 2.0
1386±5
26,652
82
6590
Stepfun step-3.5-flash StepFun · Apache 2.0
1386±8
5,571
83
6988
mistral-medium-2505 Mistral · Proprietary
1385±5
34,548
84
7090
qwen3-30b-a3b-instruct-2507 Alibaba · Apache 2.0
1383±5
24,092
85
7091
Tencent hunyuan-turbos-20250416 Tencent · Proprietary
1382±6
11,053
86
7391
gpt-4.1-mini-2025-04-14 OpenAI · Proprietary
1382±4
40,532
87
7791
gemini-2.5-flash-lite-preview-09-2025-no-thinking Google · Proprietary
1380±4
47,414
88
69101
glm-4.6v Z.ai · MIT
1378±11
2,819
89
8298
gemini-2.5-flash-lite-preview-06-17-thinking Google · Proprietary
1375±4
33,929
90
8298
qwen3-235b-a22b Alibaba · Apache 2.0
1375±5
27,164
91
8598
qwen2.5-max Alibaba · Proprietary
1374±4
33,294
92
8898
Anthropic claude-3-5-sonnet-20241022 Anthropic · Proprietary
1373±3
89,471
93
88101
Anthropic claude-3-7-sonnet-20250219 Anthropic · Proprietary
1372±4
44,434
94
88101
glm-4.5-air Z.ai · MIT
1371±4
31,382
95
88104
qwen3-next-80b-a3b-thinking Alibaba · Apache 2.0
1369±6
13,859
96
88104
Minimax minimax-m1 MiniMax · Apache 2.0
1367±4
36,838
97
88106
amazon-nova-experimental-chat-11-10 Amazon · Proprietary
1367±5
16,393
98
92105
gemma-3-27b-it Google · Gemma
1365±4
48,694
99
92109
o3-mini-high OpenAI · Proprietary
1364±5
18,584
100
88114
glm-4.7-flash Z.ai · MIT
1363±7
7,376
101
92112
grok-3-mini-high xAI · Proprietary
1363±5
17,523
102
95112
gemini-2.0-flash-001 Google · Proprietary
1361±4
44,828
103
95119
deepseek-v3 DeepSeek · DeepSeek
1358±5
21,788
104
98121
grok-3-mini-beta xAI · Proprietary
1357±5
23,765
105
95126
intellect-3 Prime Intellect · MIT
1356±8
5,325
106
99125
mistral-small-2506 Mistral · Apache 2.0
1356±5
18,343
107
100125
gpt-oss-120b OpenAI · Apache 2.0
1354±4
30,963
108
100126
gemini-2.0-flash-lite-preview-02-05 Google · Proprietary
1353±4
24,951
109
97127
glm-4.5v Z.ai · MIT
1353±8
4,973
110
102125
Cohere command-a-03-2025 Cohere · CC-BY-NC-4.0
1353±3
57,439
111
103126
gemini-1.5-pro-002 Google · Proprietary
1351±3
55,607
112
103129
amazon-nova-experimental-chat-10-20 Amazon · Proprietary
1349±6
11,439
113
99139
Tencent hunyuan-turbos-20250226 Tencent · Proprietary
1349±12
2,226
114
104128
o3-mini OpenAI · Proprietary
1348±3
58,638
115
100139
amazon-nova-experimental-chat-10-09 Amazon · Proprietary
1347±11
2,890
116
99140
Nvidia llama-3.1-nemotron-ultra-253b-v1 Nvidia · Nvidia Open Model
1347±12
2,546
117
102138
qwen3-32b Alibaba · Apache 2.0
1347±9
3,932
118
103133
ling-flash-2.0 Ant Group · MIT
1347±7
7,038
119
103136
Minimax minimax-m2 MiniMax · Apache 2.0
1347±8
6,742
120
104136
Stepfun step-3 StepFun · Apache 2.0
1346±7
6,592
121
103137
qwen-plus-0125 Alibaba · Proprietary
1346±8
5,823
122
108130
gpt-4o-2024-05-13 OpenAI · Proprietary
1346±3
112,863
123
105140
glm-4-plus-0111 Zhipu · Proprietary
1343±8
5,760
124
111133
Anthropic claude-3-5-sonnet-20240620 Anthropic · Proprietary
1343±3
82,417
125
105142
gemma-3-12b-it Google · Gemma
1342±10
3,829
126
105144
Nvidia nvidia-llama-3.3-nemotron-super-49b-v1.5 Nvidia · Nvidia Open
1341±10
3,432
127
104145
Tencent hunyuan-turbo-0110 Tencent · Proprietary
1340±12
2,295
128
112144
gpt-5-nano-high OpenAI · Proprietary
1338±7
8,399
129
113144
nova-2-lite Amazon · Proprietary
1337±6
12,258
130
115141
o1-mini OpenAI · Proprietary
1337±4
51,986
131
115144
qwq-32b Alibaba · Apache 2.0
1336±4
26,118
132
117143
Metallama-3.1-405b-instruct-bf16 Meta · Llama 3.1 Community
1335±4
41,392
133
117144
gpt-4o-2024-08-06 OpenAI · Proprietary
1335±4
45,498
134
119144
grok-2-2024-08-13 xAI · Proprietary
1335±4
63,495
135
115144
gemini-advanced-0514 Google · Proprietary
1335±5
50,142
136
114151
Stepfun step-2-16k-exp-202412 StepFun · Proprietary
1334±9
4,829
137
121144
Metallama-3.1-405b-instruct-fp8 Meta · Llama 3.1 Community
1334±4
59,655
138
120152
olmo-3.1-32b-instruct Allen AI · Apache 2.0
1331±6
11,634
139
125157
01.AI yi-lightning 01 AI · Proprietary
1329±5
27,340
140
126157
qwen3-30b-a3b Alibaba · Apache 2.0
1328±5
27,431
141
127157
Metallama-4-maverick-17b-128e-instruct Meta · Llama 4
1328±4
41,136
142
117166
Nvidia llama-3.3-nemotron-49b-super-v1 Nvidia · Nvidia
1327±12
2,230
143
123166
Tencent hunyuan-large-2025-02-10 Tencent · Proprietary
1326±10
3,738
144
137162
gpt-4-turbo-2024-04-09 OpenAI · Proprietary
1324±4
98,130
145
137162
Anthropic claude-3-5-haiku-20241022 Anthropic · Proprietary
1324±3
71,183
146
128166
deepseek-v2.5-1210 DeepSeek · DeepSeek
1323±8
6,793
147
137162
gemini-1.5-pro-001 Google · Proprietary
1323±4
79,132
148
137163
Metallama-4-scout-17b-16e-instruct Meta · Llama
1323±5
31,235
149
138162
Anthropic claude-3-opus-20240229 Anthropic · Proprietary
1322±3
194,904
150
136167
gpt-4.1-nano-2025-04-14 OpenAI · Proprietary
1322±8
6,107
151
137166
Stepfun step-1o-turbo-202506 StepFun · Proprietary
1321±7
9,687
152
137168
ring-flash-2.0 Ant Group · MIT
1320±7
7,204
153
142166
Metallama-3.3-70b-instruct Meta · Llama-3.3
1320±3
55,576
154
139166
glm-4-plus Zhipu AI · Proprietary
1319±5
26,134
155
139168
gemma-3n-e4b-it Google · Gemma
1319±5
23,342
156
139169
qwen-max-0919 Alibaba · Qwen
1318±6
16,479
157
142166
gpt-4o-mini-2024-07-18 OpenAI · Proprietary
1318±3
68,801
158
139173
gpt-oss-20b OpenAI · Apache 2.0
1317±6
10,810
159
142173
Nvidia nvidia-nemotron-3-nano-30b-a3b-bf16 Nvidia · NVIDIA Open Model
1317±6
15,500
160
142175
qwen2.5-plus-1127 Alibaba · Proprietary
1315±6
10,179
161
146173
athene-v2-chat NexusFlow · NexusFlow
1314±5
24,746
162
147173
mistral-large-2407 Mistral · Mistral Research
1314±4
45,460
163
147174
gpt-4-0125-preview OpenAI · Proprietary
1313±4
93,439
164
147173
gpt-4-1106-preview OpenAI · Proprietary
1313±4
100,107
165
142178
Tencent hunyuan-standard-2025-02-10 Tencent · Proprietary
1312±10
3,905
166
139182
mercury Inception AI · Proprietary
1310±14
1,920
167
154177
gemini-1.5-flash-002 Google · Proprietary
1310±4
34,909
168
158178
grok-2-mini-2024-08-13 xAI · Proprietary
1308±4
52,574
169
158178
deepseek-v2.5 DeepSeek · DeepSeek
1307±5
24,574
170
158178
athene-70b-0725 NexusFlow · CC-BY-NC-4.0
1306±6
19,622
171
154178
olmo-3-32b-think Allen AI · Apache 2.0
1306±8
5,892
172
162178
mistral-large-2411 Mistral · MRL
1305±4
28,081
173
158178
magistral-medium-2506 Mistral · Proprietary
1305±6
12,065
174
164178
mistral-small-3.1-24b-instruct-2503 Mistral · Apache 2.0
1305±4
34,130
175
157185
gemma-3-4b-it Google · Gemma
1303±9
4,177
176
165178
qwen2.5-72b-instruct Alibaba · Qwen
1303±4
39,409
177
165188
Nvidia llama-3.1-nemotron-70b-instruct Nvidia · Llama 3.1
1299±8
7,136
178
166189
Tencent hunyuan-large-vision Tencent · Proprietary
1296±9
5,600
179
175189
Metallama-3.1-70b-instruct Meta · Llama 3.1 Community
1294±4
55,234
180
176190
amazon-nova-pro-v1.0 Amazon · Proprietary
1290±5
24,753
181
176193
jamba-1.5-large AI21 Labs · Jamba Open
1289±7
8,659
182
177191
gemma-2-27b-it Google · Gemma license
1288±3
75,764
183
175196
ibm-granite-h-small IBM · Apache 2.0
1288±8
5,682
184
176193
reka-core-20240904 Reka AI · Proprietary
1288±7
7,309
185
177193
gpt-4-0314 OpenAI · Proprietary
1287±5
54,167
186
175199
llama-3.1-tulu-3-70b Ai2 · Llama 3.1
1287±10
2,846
187
175199
Nvidia llama-3.1-nemotron-51b-instruct Nvidia · Llama 3.1
1287±10
3,749
188
177198
olmo-3.1-32b-think Allen AI · Apache 2.0
1286±7
8,479
189
178193
gemini-1.5-flash-001 Google · Proprietary
1286±4
62,823
190
181199
Anthropic claude-3-sonnet-20240229 Anthropic · Proprietary
1281±4
109,289
191
180199
gemma-2-9b-it-simpo Princeton · MIT
1280±7
10,069
192
182199
Nvidia nemotron-4-340b-instruct Nvidia · NVIDIA Open Model
1278±5
19,661
193
182201
Cohere command-r-plus-08-2024 Cohere · CC-BY-NC-4.0
1277±7
9,869
194
186199
Metallama-3-70b-instruct Meta · Llama 3 Community
1276±4
156,880
195
187200
gpt-4-0613 OpenAI · Proprietary
1276±4
88,721
196
186202
mistral-small-24b-instruct-2501 Mistral · Apache 2.0
1274±6
14,677
197
186203
glm-4-0520 Zhipu AI · Proprietary
1274±7
9,788
198
187205
reka-flash-20240904 Reka AI · Proprietary
1272±7
7,537
199
188208
qwen2.5-coder-32b-instruct Alibaba · Apache 2.0
1271±8
5,430
200
194208
Cohere c4ai-aya-expanse-32b Cohere · CC-BY-NC-4.0
1267±5
27,123
201
196208
gemma-2-9b-it Google · Gemma license
1266±4
54,615
202
195209
deepseek-coder-v2 DeepSeek · DeepSeek License
1264±6
15,147
203
198209
Cohere command-r-plus Cohere · CC-BY-NC-4.0
1262±4
77,556
204
197209
qwen2-72b-instruct Alibaba · Qianwen LICENSE
1262±5
37,325
205
199209
Anthropic claude-3-haiku-20240307 Anthropic · Proprietary
1261±4
117,705
206
198210
amazon-nova-lite-v1.0 Amazon · Proprietary
1261±5
19,376
207
199210
gemini-1.5-flash-8b-001 Google · Proprietary
1259±4
35,556
208
202210
Azure phi-4 Microsoft · MIT
1256±5
24,126
209
199216
olmo-2-0325-32b-instruct Allen AI · Apache-2.0
1252±11
3,335
210
206215
Cohere command-r-08-2024 Cohere · CC-BY-NC-4.0
1251±7
10,141
211
209219
mistral-large-2402 Mistral · Proprietary
1243±5
62,437
212
209219
amazon-nova-micro-v1.0 Amazon · Proprietary
1241±5
19,355
213
209223
jamba-1.5-mini AI21 Labs · Jamba Open
1239±7
8,854
214
209227
ministral-8b-2410 Mistral · MRL
1237±9
4,780
215
210227
gemini-pro-dev-api Google · Proprietary
1235±7
18,352
216
211227
qwen1.5-110b-chat Alibaba · Qianwen LICENSE
1234±6
26,191
217
211228
reka-flash-21b-20240226-online Reka AI · Proprietary
1234±7
15,451
218
211227
qwen1.5-72b-chat Alibaba · Qianwen LICENSE
1234±5
39,296
219
209229
Tencent hunyuan-standard-256k Tencent · Proprietary
1233±12
2,729
220
213228
mixtral-8x22b-instruct-v0.1 Mistral · Apache 2.0
1230±5
51,417
221
213229
Cohere command-r Cohere · CC-BY-NC-4.0
1227±5
54,038
222
213229
reka-flash-21b-20240226 Reka AI · Proprietary
1227±6
24,806
223
214230
gpt-3.5-turbo-0125 OpenAI · Proprietary
1224±5
66,191
224
218230
Metallama-3-8b-instruct Meta · Llama 3 Community
1223±4
104,636
225
214231
mistral-medium Mistral · Proprietary
1223±6
34,552
226
214231
Cohere c4ai-aya-expanse-8b Cohere · CC-BY-NC-4.0
1223±7
9,827
227
213234
gemini-pro Google · Proprietary
1222±12
6,390
228
214234
llama-3.1-tulu-3-8b Ai2 · Llama 3.1
1221±11
2,895
229
225234
01.AI yi-1.5-34b-chat 01 AI · Apache-2.0
1214±5
24,142
230
220236
HuggingFace zephyr-orpo-141b-A35b-v0.1 HuggingFace · Apache 2.0
1213±11
4,653
231
227234
Metallama-3.1-8b-instruct Meta · Llama 3.1 Community
1212±4
49,605
232
223240
granite-3.1-8b-instruct IBM · Apache 2.0
1209±11
3,092
233
227240
qwen1.5-32b-chat Alibaba · Qianwen LICENSE
1204±6
21,744
234
227242
gpt-3.5-turbo-1106 OpenAI · Proprietary
1203±9
16,616
235
231241
gemma-2-2b-it Google · Gemma license
1199±4
46,618
236
231242
Azure phi-3-medium-4k-instruct Microsoft · MIT
1198±5
25,055
237
232242
mixtral-8x7b-instruct-v0.1 Mistral · Apache 2.0
1198±4
73,505
238
232247
dbrx-instruct-preview Databricks · DBRX LICENSE
1195±6
32,196
239
232251
InternLM internlm2_5-20b-chat InternLM · Other
1191±7
9,902
240
232251
qwen1.5-14b-chat Alibaba · Qianwen LICENSE
1191±7
17,841
241
235257
Azure wizardlm-70b Microsoft · Llama 2 Community
1185±9
8,214
242
234258
deepseek-llm-67b-chat DeepSeek · DeepSeek License
1185±12
4,933
243
238254
01.AI yi-34b-chat 01 AI · Yi License
1184±7
15,483
244
238257
OpenChat openchat-3.5-0106 OpenChat · Apache-2.0
1183±8
12,636
245
238258
OpenChat openchat-3.5 OpenChat · Apache-2.0
1183±10
7,967
246
238258
granite-3.0-8b-instruct IBM · Apache 2.0
1182±9
6,643
247
239258
gemma-1.1-7b-it Google · Gemma license
1180±6
23,893
248
239258
Snowflake snowflake-arctic-instruct Snowflake · Apache 2.0
1180±6
32,836
249
238259
granite-3.1-2b-instruct IBM · Apache 2.0
1180±11
3,191
250
239259
tulu-2-dpo-70b AllenAI/UW · AI2 ImpACT Low-risk
1178±10
6,534
251
239262
openhermes-2.5-mistral-7b Nous Research · Apache-2.0
1176±10
5,006
252
241261
vicuna-33b LMSYS · Non-commercial
1173±6
22,479
253
241264
starling-lm-7b-beta Nexusflow · Apache-2.0
1172±7
16,057
254
241262
Azure phi-3-small-8k-instruct Microsoft · MIT
1172±6
17,763
255
242262
Metallama-2-70b-chat Meta · Llama 2 Community
1171±6
38,491
256
242265
starling-lm-7b-alpha UC Berkeley · CC-BY-NC-4.0
1168±8
10,224
257
244265
Metallama-3.2-3b-instruct Meta · Llama 3.2
1167±8
7,936
258
242268
nous-hermes-2-mixtral-8x7b-dpo Nous Research · Apache-2.0
1165±12
3,776
259
249275
qwq-32b-preview Alibaba · Apache 2.0
1157±12
3,233
260
255272
granite-3.0-2b-instruct IBM · Apache 2.0
1156±8
6,837
261
251275
Nvidia llama2-70b-steerlm-chat Nvidia · Llama 2 Community
1156±13
3,584
262
252278
solar-10.7b-instruct-v1.0 Upstage AI · CC-BY-NC-4.0
1153±13
4,155
263
251280
dolphin-2.2.1-mistral-7b Cognitive Computations · Apache-2.0
1152±15
1,679
264
256278
mpt-30b-chat MosaicML · CC-BY-NC-SA-4.0
1151±12
2,571
265
258275
mistral-7b-instruct-v0.2 Mistral · Apache-2.0
1150±7
19,402
266
258277
Azure wizardlm-13b Microsoft · Llama 2 Community
1150±9
7,046
267
255282
falcon-180b-chat TII · Falcon-180B TII License
1147±17
1,295
268
258281
qwen1.5-7b-chat Alibaba · Qianwen LICENSE
1144±10
4,735
269
259280
Azure phi-3-mini-4k-instruct-june-2024 Microsoft · MIT
1143±6
12,296
270
259281
Metallama-2-13b-chat Meta · Llama 2 Community
1142±7
19,171
271
259281
vicuna-13b LMSYS · Llama 2 Community
1141±7
19,366
272
259283
qwen-14b-chat Alibaba · Qianwen LICENSE
1139±11
4,964
273
260283
palm-2 Google · Proprietary
1138±9
8,554
274
260283
Metacode llama-34b-instruct Meta · Llama 2 Community
1137±9
7,363
275
260283
gemma-7b-it Google · Gemma license
1136±10
8,925
276
263284
HuggingFace zephyr-7b-beta HuggingFace · MIT
1131±9
11,116
277
266284
Azure phi-3-mini-128k-instruct Microsoft · MIT
1130±7
20,691
278
268284
Azure phi-3-mini-4k-instruct Microsoft · MIT
1129±6
20,115
279
264287
guanaco-33b UW · Non-commercial
1128±12
2,921
280
263288
HuggingFace zephyr-7b-alpha HuggingFace · MIT
1127±16
1,785
281
271288
stripedhyena-nous-7b Together AI · Apache 2.0
1121±11
5,184
282
266289
Metacode llama-70b-instruct Meta · Llama 2 Community
1119±18
1,143
283
276288
vicuna-7b LMSYS · Llama 2 Community
1115±9
6,923
284
272289
HuggingFace smollm2-1.7b-instruct HuggingFace · Apache 2.0
1115±14
2,201
285
279288
gemma-1.1-2b-it Google · Gemma license
1114±8
10,853
286
279288
Metallama-3.2-1b-instruct Meta · Llama 3.2
1112±8
8,045
287
279289
mistral-7b-instruct Mistral · Apache 2.0
1110±9
8,977
288
280289
Metallama-2-7b-chat Meta · Llama 2 Community
1108±7
14,148
289
285293
gemma-2b-it Google · Gemma license
1092±12
4,779
290
289292
qwen1.5-4b-chat Alibaba · Qianwen LICENSE
1091±9
7,598
291
289296
olmo-7b-instruct Allen AI · Apache-2.0
1075±11
6,329
292
290296
koala-13b UC Berkeley · Non-commercial
1071±10
6,964
293
291296
alpaca-13b Stanford · Non-commercial
1068±12
5,745
294
289297
gpt4all-13b-snoozy Nomic AI · Non-commercial
1066±15
1,743
295
291297
mpt-7b-chat MosaicML · CC-BY-NC-SA-4.0
1062±12
3,925
296
291297
chatglm3-6b Tsinghua · Apache-2.0
1056±12
4,658
297
294299
RWKV RWKV-4-Raven-14B RWKV · Apache 2.0
1042±11
4,845
298
297299
chatglm2-6b Tsinghua · Apache-2.0
1025±14
2,657
299
297299
oasst-pythia-12b OpenAssistant · Apache 2.0
1023±11
6,311
300
300303
chatglm-6b Tsinghua · Non-commercial
996±13
4,914
301
300303
fastchat-t5-3b LMSYS · Apache 2.0
992±12
4,203
302
300303
dolly-v2-12b Databricks · MIT
981±14
3,412
303
300304
Metallama-13b Meta · Non-commercial
973±16
2,391
304
303304
Stability stablelm-tuned-alpha-7b Stability AI · CC-BY-NC-SA-4.0
953±13
3,287
Notes
Rank (UB): Rankings computed based on the Bradley-Terry model. This ranking reflects the models' overall performance in the arena and provides their Elo scores' upper bound estimate, helping to understand the models' potential competitiveness.
Model: Names of the large language models (LLMs). Some model names may contain embedded links.
Score: The models' Elo scores in the arena obtained via user votes. Elo is a relative ranking system; higher scores indicate better performance.
95% confidence interval (±): The 95% confidence interval for the model's Elo score (for example:
±6). A smaller interval indicates more stable and reliable model scoring.Votes: The total number of votes the model received in the arena. More votes generally imply higher statistical reliability of its score.
Organization/Company: The organization or company that provides the model.
License: The type of license for the model, e.g., Proprietary, Apache 2.0, MIT, etc.
Data sources and update frequency
The data for this leaderboard is fetched automatically by scripts directly from the 1 2 official website. This leaderboard is automatically updated daily by GitHub Actions.
Disclaimer
This report is for reference only. The leaderboard data is dynamic and based on user preference votes on Chatbot Arena during specific time periods. The completeness and accuracy of the data depend on upstream sources. Different models may use different license agreements; please consult the model provider's official documentation when using them.
Last updated
Was this helpful?