Model Rankings
This is a leaderboard based on Chatbot Arena (lmarena.ai) data, generated through an automated process.
Data update time: 2025-12-22 08:09:23 UTC / 2025-12-22 16:09:23 CST (Beijing Time)
Leaderboard
1
1◄─►2
gemini-3-pro
1490
±6
19,199
Proprietary
2
2◄─►6
grok-4.1-thinking
1477
±6
20,074
xAI
Proprietary
3
1◄─►7
gemini-3-flash
1475Preliminary
±10
4,431
Proprietary
4
2◄─►8
Anthropicclaude-opus-4-5-20251101-thinking-32k
1469
±6
12,829
Anthropic
Proprietary
5
2◄─►8
Anthropicclaude-opus-4-5-20251101
1465
±6
13,654
Anthropic
Proprietary
6
3◄─►8
grok-4.1
1465
±6
20,553
xAI
Proprietary
7
2◄─►11
gemini-3-flash (thinking-minimal)
1463Preliminary
±9
5,605
Proprietary
8
4◄─►11
gpt-5.1-high
1457
±6
17,084
OpenAI
Proprietary
9
7◄─►15
gemini-2.5-pro
1451
±3
79,900
Proprietary
10
7◄─►15
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
1450
±4
31,225
Anthropic
Proprietary
11
9◄─►15
Anthropicclaude-opus-4-1-20250805-thinking-16k
1448
±4
46,958
Anthropic
Proprietary
12
9◄─►18
Anthropicclaude-sonnet-4-5-20250929
1446
±5
26,513
Anthropic
Proprietary
13
9◄─►22
gpt-4.5-preview-2025-02-27
1442
±6
14,644
OpenAI
Proprietary
14
7◄─►27
gpt-5.2
1442Preliminary
±12
2,422
OpenAI
Proprietary
15
12◄─►22
Anthropicclaude-opus-4-1-20250805
1440
±4
60,044
Anthropic
Proprietary
16
12◄─►22
chatgpt-4o-latest-20250326
1440
±3
66,721
OpenAI
Proprietary
17
9◄─►24
gpt-5.2-high
1440Preliminary
±9
6,192
OpenAI
Proprietary
18
12◄─►24
gpt-5.1
1437
±6
18,423
OpenAI
Proprietary
19
13◄─►24
gpt-5-high
1436
±5
32,779
OpenAI
Proprietary
20
13◄─►25
o3-2025-04-16
1434
±4
61,385
OpenAI
Proprietary
21
13◄─►29
qwen3-max-preview
1433
±5
28,039
Alibaba
Proprietary
22
13◄─►33
grok-4-1-fast-reasoning
1431
±6
12,526
xAI
Proprietary
23
16◄─►38
MoonshotAIkimi-k2-thinking-turbo
1428
±5
18,724
Moonshot
Modified MIT
24
16◄─►43
ernie-5.0-preview-1103
1427Preliminary
±8
6,699
Baidu
Proprietary
25
20◄─►43
glm-4.6
1425
±5
27,300
Z.ai
MIT
26
21◄─►43
gpt-5-chat
1425
±4
32,045
OpenAI
Proprietary
27
19◄─►43
qwen3-max-2025-09-23
1424
±6
9,227
Alibaba
Proprietary
28
20◄─►43
deepseek-v3.2-exp
1423
±7
11,929
DeepSeek AI
MIT
29
22◄─►43
Anthropicclaude-opus-4-20250514-thinking-16k
1423
±4
37,714
Anthropic
Proprietary
30
21◄─►46
deepseek-v3.2-thinking
1422
±7
8,916
DeepSeek AI
MIT
31
23◄─►43
qwen3-235b-a22b-instruct-2507
1422
±4
54,369
Alibaba
Apache 2.0
32
22◄─►46
deepseek-v3.2-exp-thinking
1421
±7
9,198
DeepSeek AI
MIT
33
22◄─►49
grok-4-fast-chat
1420
±8
7,040
xAI
Proprietary
34
22◄─►49
mistral-large-3
1419Preliminary
±7
9,518
Mistral
Apache 2.0
35
23◄─►49
MoonshotAIkimi-k2-0905-preview
1418
±7
11,845
Moonshot
Modified MIT
36
23◄─►49
deepseek-r1-0528
1418
±6
19,162
DeepSeek
MIT
37
24◄─►50
deepseek-v3.1
1416
±6
15,213
DeepSeek
MIT
38
24◄─►49
MoonshotAIkimi-k2-0711-preview
1416
±5
28,532
Moonshot
Modified MIT
39
24◄─►51
deepseek-v3.1-thinking
1416
±7
11,947
DeepSeek
MIT
40
24◄─►51
deepseek-v3.2
1415
±7
10,253
DeepSeek AI
MIT
41
23◄─►54
deepseek-v3.1-terminus
1415
±10
3,737
DeepSeek AI
MIT
42
24◄─►51
qwen3-vl-235b-a22b-instruct
1414
±7
9,011
Alibaba
Apache 2.0
43
23◄─►57
deepseek-v3.1-terminus-thinking
1414
±10
3,511
DeepSeek AI
MIT
44
31◄─►51
gpt-4.1-2025-04-14
1412
±4
52,398
OpenAI
Proprietary
45
31◄─►51
Anthropicclaude-opus-4-20250514
1412
±4
45,477
Anthropic
Proprietary
46
31◄─►51
mistral-medium-2508
1411
±4
48,471
Mistral
Proprietary
47
33◄─►54
grok-3-preview-02-24
1410
±4
34,039
xAI
Proprietary
48
33◄─►56
grok-4-0709
1409
±4
42,410
xAI
Proprietary
49
33◄─►57
glm-4.5
1408
±5
24,736
Z.ai
MIT
50
38◄─►56
gemini-2.5-flash
1408
±3
79,419
Proprietary
51
39◄─►60
gemini-2.5-flash-preview-09-2025
1405
±4
32,599
Proprietary
52
45◄─►62
Anthropicclaude-haiku-4-5-20251001
1402
±4
29,594
Anthropic
Proprietary
53
45◄─►63
grok-4-fast-reasoning
1402
±5
18,821
xAI
Proprietary
54
47◄─►63
o1-2024-12-17
1400
±4
28,039
OpenAI
Proprietary
55
47◄─►65
qwen3-next-80b-a3b-instruct
1400
±5
23,035
Alibaba
Apache 2.0
56
45◄─►66
longcat-flash-chat
1400
±6
11,470
Meituan
MIT
57
49◄─►65
qwen3-235b-a22b-no-thinking
1399
±4
39,246
Alibaba
Apache 2.0
58
51◄─►66
Anthropicclaude-sonnet-4-20250514-thinking-32k
1399
±4
36,070
Anthropic
Proprietary
59
51◄─►71
qwen3-235b-a22b-thinking-2507
1397
±6
9,311
Alibaba
Apache 2.0
60
52◄─►71
deepseek-r1
1396
±5
18,718
DeepSeek
MIT
61
52◄─►72
qwen3-vl-235b-a22b-thinking
1394
±7
7,965
Alibaba
Apache 2.0
62
53◄─►72
gpt-5-mini-high
1392
±5
27,344
OpenAI
Proprietary
63
55◄─►71
deepseek-v3-0324
1392
±4
46,624
DeepSeek
MIT
64
51◄─►79
Tencenthunyuan-vision-1.5-thinking
1391
±12
2,210
Tencent
Proprietary
65
57◄─►72
o4-mini-2025-04-16
1391
±4
46,703
OpenAI
Proprietary
66
55◄─►74
mai-1-preview
1390
±5
18,116
Microsoft AI
Proprietary
67
59◄─►75
Anthropicclaude-sonnet-4-20250514
1389
±4
41,493
Anthropic
Proprietary
68
59◄─►75
o1-preview
1387
±5
31,505
OpenAI
Proprietary
69
59◄─►75
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
1387
±4
39,817
Anthropic
Proprietary
70
59◄─►77
qwen3-coder-480b-a35b-instruct
1386
±5
23,591
Alibaba
Apache 2.0
71
59◄─►80
Tencenthunyuan-t1-20250711
1385
±9
4,803
Tencent
Proprietary
72
62◄─►79
mistral-medium-2505
1383
±5
34,401
Mistral
Proprietary
73
65◄─►79
qwen3-30b-a3b-instruct-2507
1382
±5
24,120
Alibaba
Apache 2.0
74
66◄─►80
gpt-4.1-mini-2025-04-14
1380
±4
40,370
OpenAI
Proprietary
75
65◄─►83
Tencenthunyuan-turbos-20250416
1380
±6
11,094
Tencent
Proprietary
76
69◄─►83
gemini-2.5-flash-lite-preview-09-2025-no-thinking
1378
±4
32,630
Proprietary
77
70◄─►84
gemini-2.5-flash-lite-preview-06-17-thinking
1375
±4
33,818
Proprietary
78
70◄─►85
qwen3-235b-a22b
1374
±5
27,077
Alibaba
Apache 2.0
79
73◄─►85
qwen2.5-max
1373
±4
33,489
Alibaba
Proprietary
80
75◄─►85
Anthropicclaude-3-5-sonnet-20241022
1372
±3
89,727
Anthropic
Proprietary
81
75◄─►88
Anthropicclaude-3-7-sonnet-20250219
1371
±4
44,473
Anthropic
Proprietary
82
69◄─►94
amazon-nova-experimental-chat-11-10
1370Preliminary
±12
2,743
Amazon
Proprietary
83
75◄─►88
glm-4.5-air
1370
±4
31,551
Z.ai
MIT
84
77◄─►91
qwen3-next-80b-a3b-thinking
1367
±6
13,779
Alibaba
Apache 2.0
85
78◄─►90
Minimaxminimax-m1
1366
±4
36,732
MiniMax
Apache 2.0
86
81◄─►91
gemma-3-27b-it
1364
±4
49,181
Gemma
87
81◄─►96
o3-mini-high
1362
±5
18,735
OpenAI
Proprietary
88
81◄─►96
grok-3-mini-high
1362
±5
17,505
xAI
Proprietary
89
83◄─►98
gemini-2.0-flash-001
1360
±4
45,015
Proprietary
90
83◄─►107
deepseek-v3
1357
±5
21,994
DeepSeek
DeepSeek
91
84◄─►107
grok-3-mini-beta
1357
±5
23,701
xAI
Proprietary
92
86◄─►112
mistral-small-2506
1355
±5
18,254
Mistral
Apache 2.0
93
89◄─►113
gpt-oss-120b
1352
±4
31,145
OpenAI
Apache 2.0
94
89◄─►112
gemini-2.0-flash-lite-preview-02-05
1352
±4
25,215
Proprietary
95
86◄─►115
glm-4.5v
1352
±8
4,971
Z.ai
MIT
96
90◄─►112
Coherecommand-a-03-2025
1352
±3
57,648
Cohere
CC-BY-NC-4.0
97
90◄─►113
gemini-1.5-pro-002
1351
±3
56,012
Proprietary
98
86◄─►116
intellect-3
1351
±9
4,753
Prime Intellect
MIT
99
90◄─►116
amazon-nova-experimental-chat-10-20
1349
±6
10,971
Amazon
Proprietary
100
92◄─►115
o3-mini
1348
±3
58,693
OpenAI
Proprietary
101
87◄─►126
Tencenthunyuan-turbos-20250226
1346
±12
2,250
Tencent
Proprietary
102
90◄─►119
ling-flash-2.0
1346
±7
7,126
Ant Group
MIT
103
90◄─►122
Minimaxminimax-m2
1346
±8
7,093
MiniMax
Apache 2.0
104
90◄─►121
Stepfunstep-3
1346
±7
6,620
StepFun
Apache 2.0
105
87◄─►127
Nvidiallama-3.1-nemotron-ultra-253b-v1
1346
±12
2,573
Nvidia
Nvidia Open Model
106
89◄─►126
amazon-nova-experimental-chat-10-09
1345
±11
2,878
Amazon
Proprietary
107
94◄─►116
gpt-4o-2024-05-13
1345
±3
113,568
OpenAI
Proprietary
108
90◄─►126
qwen3-32b
1345
±9
3,943
Alibaba
Apache 2.0
109
90◄─►126
qwen-plus-0125
1344
±8
5,861
Alibaba
Proprietary
110
92◄─►126
glm-4-plus-0111
1343
±8
5,806
Zhipu
Proprietary
111
97◄─►119
Anthropicclaude-3-5-sonnet-20240620
1342
±3
82,864
Anthropic
Proprietary
112
92◄─►129
gemma-3-12b-it
1340
±9
3,866
Gemma
113
92◄─►131
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
1340
±10
3,469
Nvidia
Nvidia Open
114
92◄─►133
Tencenthunyuan-turbo-0110
1339
±11
2,322
Tencent
Proprietary
115
97◄─►128
gpt-5-nano-high
1338
±7
8,369
OpenAI
Proprietary
116
99◄─►131
nova-2-lite
1336
±7
8,997
Amazon
Proprietary
117
102◄─►128
o1-mini
1335
±4
52,301
OpenAI
Proprietary
118
104◄─►128
Metallama-3.1-405b-instruct-bf16
1335
±4
41,932
Meta
Llama 3.1 Community
119
104◄─►131
gpt-4o-2024-08-06
1334
±4
45,787
OpenAI
Proprietary
120
106◄─►129
grok-2-2024-08-13
1334
±4
63,725
xAI
Proprietary
121
105◄─►131
qwq-32b
1334
±4
26,196
Alibaba
Apache 2.0
122
102◄─►131
gemini-advanced-0514
1334
±5
50,654
Proprietary
123
106◄─►131
Metallama-3.1-405b-instruct-fp8
1333
±3
60,272
Meta
Llama 3.1 Community
124
102◄─►141
Stepfunstep-2-16k-exp-202412
1332
±9
4,895
StepFun
Proprietary
125
112◄─►142
01.AIyi-lightning
1328
±5
27,624
01 AI
Proprietary
126
115◄─►143
Metallama-4-maverick-17b-128e-instruct
1327
±4
41,093
Meta
Llama 4
127
106◄─►153
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16
1326Preliminary
±10
4,247
Nvidia
NVIDIA Open Model
128
117◄─►145
qwen3-30b-a3b
1326
±5
27,405
Alibaba
Apache 2.0
129
106◄─►154
Nvidiallama-3.3-nemotron-49b-super-v1
1325
±12
2,243
Nvidia
Nvidia
130
111◄─►153
Tencenthunyuan-large-2025-02-10
1325
±10
3,760
Tencent
Proprietary
131
123◄─►146
gpt-4-turbo-2024-04-09
1324
±4
98,965
OpenAI
Proprietary
132
124◄─►148
Anthropicclaude-3-5-haiku-20241022
1322
±3
71,241
Anthropic
Proprietary
133
124◄─►148
Metallama-4-scout-17b-16e-instruct
1322
±5
31,074
Meta
Llama
134
117◄─►154
deepseek-v2.5-1210
1322
±8
6,877
DeepSeek
DeepSeek
135
124◄─►148
gemini-1.5-pro-001
1322
±4
79,769
Proprietary
136
124◄─►148
Anthropicclaude-3-opus-20240229
1322
±3
196,368
Anthropic
Proprietary
137
123◄─►154
gpt-4.1-nano-2025-04-14
1321
±8
6,143
OpenAI
Proprietary
138
124◄─►154
ring-flash-2.0
1320
±7
7,250
Ant Group
MIT
139
124◄─►154
Stepfunstep-1o-turbo-202506
1319
±7
9,635
StepFun
Proprietary
140
127◄─►153
Metallama-3.3-70b-instruct
1319
±3
55,961
Meta
Llama-3.3
141
125◄─►154
gemma-3n-e4b-it
1318
±5
23,421
Gemma
142
126◄─►154
glm-4-plus
1318
±5
26,342
Zhipu AI
Proprietary
143
124◄─►155
gpt-oss-20b
1318
±6
10,828
OpenAI
Apache 2.0
144
127◄─►155
qwen-max-0919
1317
±6
16,598
Alibaba
Qwen
145
129◄─►154
gpt-4o-mini-2024-07-18
1316
±3
69,290
OpenAI
Proprietary
146
128◄─►161
qwen2.5-plus-1127
1314
±6
10,252
Alibaba
Proprietary
147
133◄─►159
gpt-4-1106-preview
1313
±4
101,117
OpenAI
Proprietary
148
133◄─►159
mistral-large-2407
1313
±4
45,968
Mistral
Mistral Research
149
133◄─►159
gpt-4-0125-preview
1313
±4
94,534
OpenAI
Proprietary
150
133◄─►160
athene-v2-chat
1313
±4
24,880
NexusFlow
NexusFlow
151
124◄─►164
mercury
1311
±14
1,961
Inception AI
Proprietary
152
129◄─►164
Tencenthunyuan-standard-2025-02-10
1310
±10
3,920
Tencent
Proprietary
153
136◄─►161
gemini-1.5-flash-002
1310
±4
35,180
Proprietary
154
133◄─►164
olmo-3-32b-think
1308
±9
5,307
Allen AI
Apache 2.0
155
146◄─►163
grok-2-mini-2024-08-13
1307
±4
52,789
xAI
Proprietary
156
146◄─►164
deepseek-v2.5
1306
±5
24,839
DeepSeek
DeepSeek
157
146◄─►164
athene-70b-0725
1305
±6
19,796
NexusFlow
CC-BY-NC-4.0
158
149◄─►164
mistral-large-2411
1304
±4
28,455
Mistral
MRL
159
146◄─►164
magistral-medium-2506
1304
±6
11,957
Mistral
Proprietary
160
150◄─►164
mistral-small-3.1-24b-instruct-2503
1303
±4
34,030
Mistral
Apache 2.0
161
144◄─►170
gemma-3-4b-it
1302
±9
4,195
Gemma
162
152◄─►164
qwen2.5-72b-instruct
1302
±4
39,632
Alibaba
Qwen
163
152◄─►173
Nvidiallama-3.1-nemotron-70b-instruct
1297
±8
7,216
Nvidia
Llama 3.1
164
153◄─►175
Tencenthunyuan-large-vision
1294
±9
5,575
Tencent
Proprietary
165
162◄─►173
Metallama-3.1-70b-instruct
1293
±4
56,003
Meta
Llama 3.1 Community
166
162◄─►178
jamba-1.5-large
1288
±7
8,730
AI21 Labs
Jamba Open
167
163◄─►176
amazon-nova-pro-v1.0
1288
±4
25,218
Amazon
Proprietary
168
162◄─►183
ibm-granite-h-small
1287
±8
5,690
IBM
Apache 2.0
169
163◄─►176
gemma-2-27b-it
1287
±3
76,195
Gemma license
170
162◄─►178
reka-core-20240904
1287
±7
7,380
Reka AI
Proprietary
171
163◄─►178
gpt-4-0314
1286
±5
54,754
OpenAI
Proprietary
172
162◄─►184
Nvidiallama-3.1-nemotron-51b-instruct
1286
±10
3,777
Nvidia
Llama 3.1
173
162◄─►184
llama-3.1-tulu-3-70b
1286
±10
2,881
Ai2
Llama 3.1
174
165◄─►178
gemini-1.5-flash-001
1284
±4
63,418
Proprietary
175
166◄─►183
Anthropicclaude-3-sonnet-20240229
1281
±4
110,173
Anthropic
Proprietary
176
165◄─►184
gemma-2-9b-it-simpo
1278
±7
10,108
Princeton
MIT
177
168◄─►184
Nvidianemotron-4-340b-instruct
1277
±5
19,913
Nvidia
NVIDIA Open Model
178
168◄─►185
Coherecommand-r-plus-08-2024
1277
±7
9,931
Cohere
CC-BY-NC-4.0
179
172◄─►184
Metallama-3-70b-instruct
1276
±3
158,908
Meta
Llama 3 Community
180
172◄─►185
gpt-4-0613
1275
±4
89,612
OpenAI
Proprietary
181
172◄─►187
mistral-small-24b-instruct-2501
1273
±6
14,830
Mistral
Apache 2.0
182
172◄─►189
glm-4-0520
1273
±7
9,857
Zhipu AI
Proprietary
183
172◄─►189
reka-flash-20240904
1272
±7
7,583
Reka AI
Proprietary
184
174◄─►193
qwen2.5-coder-32b-instruct
1269
±8
5,452
Alibaba
Apache 2.0
185
179◄─►193
Coherec4ai-aya-expanse-32b
1266
±5
27,362
Cohere
CC-BY-NC-4.0
186
181◄─►193
gemma-2-9b-it
1264
±4
54,954
Gemma license
187
181◄─►195
deepseek-coder-v2
1264
±6
15,242
DeepSeek AI
DeepSeek License
188
182◄─►194
Coherecommand-r-plus
1263
±4
78,401
Cohere
CC-BY-NC-4.0
189
182◄─►195
qwen2-72b-instruct
1262
±5
37,688
Alibaba
Qianwen LICENSE
190
184◄─►195
Anthropicclaude-3-haiku-20240307
1261
±4
118,626
Anthropic
Proprietary
191
184◄─►195
amazon-nova-lite-v1.0
1259
±5
19,760
Amazon
Proprietary
192
184◄─►195
gemini-1.5-flash-8b-001
1259
±4
35,914
Proprietary
193
187◄─►195
Azurephi-4
1255
±4
24,354
Microsoft
MIT
194
184◄─►201
olmo-2-0325-32b-instruct
1252
±11
3,377
Allen AI
Apache-2.0
195
188◄─►200
Coherecommand-r-08-2024
1251
±7
10,229
Cohere
CC-BY-NC-4.0
196
194◄─►204
mistral-large-2402
1242
±5
63,404
Mistral
Proprietary
197
194◄─►204
amazon-nova-micro-v1.0
1241
±5
19,774
Amazon
Proprietary
198
194◄─►209
jamba-1.5-mini
1239
±7
8,918
AI21 Labs
Jamba Open
199
194◄─►212
ministral-8b-2410
1237
±9
4,833
Mistral
MRL
200
195◄─►212
gemini-pro-dev-api
1234
±7
18,454
Proprietary
201
196◄─►212
qwen1.5-110b-chat
1234
±5
26,679
Alibaba
Qianwen LICENSE
202
196◄─►212
qwen1.5-72b-chat
1234
±5
39,689
Alibaba
Qianwen LICENSE
203
196◄─►213
reka-flash-21b-20240226-online
1233
±7
15,606
Reka AI
Proprietary
204
194◄─►214
Tencenthunyuan-standard-256k
1233
±12
2,761
Tencent
Proprietary
205
198◄─►213
mixtral-8x22b-instruct-v0.1
1230
±4
52,214
Mistral
Apache 2.0
206
198◄─►214
Coherecommand-r
1228
±5
54,710
Cohere
CC-BY-NC-4.0
207
198◄─►214
reka-flash-21b-20240226
1227
±6
25,026
Reka AI
Proprietary
208
199◄─►215
gpt-3.5-turbo-0125
1224
±5
67,214
OpenAI
Proprietary
209
199◄─►216
mistral-medium
1223
±5
34,893
Mistral
Proprietary
210
199◄─►216
Coherec4ai-aya-expanse-8b
1223
±7
9,922
Cohere
CC-BY-NC-4.0
211
203◄─►215
Metallama-3-8b-instruct
1223
±4
106,055
Meta
Llama 3 Community
212
198◄─►219
gemini-pro
1222
±12
6,418
Proprietary
213
198◄─►218
llama-3.1-tulu-3-8b
1221
±11
2,943
Ai2
Llama 3.1
214
205◄─►220
HuggingFacezephyr-orpo-141b-A35b-v0.1
1214
±11
4,712
HuggingFace
Apache 2.0
215
210◄─►219
01.AIyi-1.5-34b-chat
1213
±5
24,417
01 AI
Apache-2.0
216
212◄─►219
Metallama-3.1-8b-instruct
1211
±4
50,234
Meta
Llama 3.1 Community
217
208◄─►225
granite-3.1-8b-instruct
1210
±11
3,142
IBM
Apache 2.0
218
212◄─►224
qwen1.5-32b-chat
1205
±6
22,068
Alibaba
Qianwen LICENSE
219
213◄─►227
gpt-3.5-turbo-1106
1202
±9
16,760
OpenAI
Proprietary
220
216◄─►227
Azurephi-3-medium-4k-instruct
1198
±5
25,301
Microsoft
MIT
221
217◄─►227
gemma-2-2b-it
1198
±4
46,901
Gemma license
222
217◄─►227
mixtral-8x7b-instruct-v0.1
1198
±4
74,303
Mistral
Apache 2.0
223
217◄─►232
dbrx-instruct-preview
1196
±6
32,760
Databricks
DBRX LICENSE
224
217◄─►236
qwen1.5-14b-chat
1193
±7
18,066
Alibaba
Qianwen LICENSE
225
218◄─►236
InternLMinternlm2_5-20b-chat
1192
±7
10,038
InternLM
Other
226
219◄─►242
Azurewizardlm-70b
1185
±9
8,270
Microsoft
Llama 2 Community
227
219◄─►243
deepseek-llm-67b-chat
1184
±12
4,950
DeepSeek AI
DeepSeek License
228
223◄─►240
01.AIyi-34b-chat
1184
±7
15,624
01 AI
Yi License
229
223◄─►242
granite-3.0-8b-instruct
1183
±9
6,727
IBM
Apache 2.0
230
223◄─►242
OpenChatopenchat-3.5-0106
1183
±8
12,712
OpenChat
Apache-2.0
231
223◄─►243
OpenChatopenchat-3.5
1182
±10
8,009
OpenChat
Apache-2.0
232
223◄─►244
granite-3.1-2b-instruct
1181
±11
3,235
IBM
Apache 2.0
233
224◄─►242
Snowflakesnowflake-arctic-instruct
1180
±6
33,272
Snowflake
Apache 2.0
234
224◄─►243
gemma-1.1-7b-it
1180
±6
24,327
Gemma license
235
224◄─►245
tulu-2-dpo-70b
1179
±10
6,579
AllenAI/UW
AI2 ImpACT Low-risk
236
224◄─►247
openhermes-2.5-mistral-7b
1176
±10
5,026
NousResearch
Apache-2.0
237
226◄─►246
vicuna-33b
1173
±6
22,613
LMSYS
Non-commercial
238
226◄─►247
starling-lm-7b-beta
1173
±7
16,190
Nexusflow
Apache-2.0
239
226◄─►247
Azurephi-3-small-8k-instruct
1172
±6
17,983
Microsoft
MIT
240
227◄─►247
Metallama-2-70b-chat
1171
±5
38,767
Meta
Llama 2 Community
241
227◄─►250
starling-lm-7b-alpha
1168
±8
10,267
UC Berkeley
CC-BY-NC-4.0
242
231◄─►251
Metallama-3.2-3b-instruct
1167
±8
8,043
Meta
Llama 3.2
243
226◄─►252
nous-hermes-2-mixtral-8x7b-dpo
1166
±12
3,792
NousResearch
Apache-2.0
244
234◄─►257
qwq-32b-preview
1159
±11
3,256
Alibaba
Apache 2.0
245
235◄─►260
Nvidiallama2-70b-steerlm-chat
1157
±13
3,605
Nvidia
Llama 2 Community
246
241◄─►257
granite-3.0-2b-instruct
1157
±8
6,922
IBM
Apache 2.0
247
237◄─►261
solar-10.7b-instruct-v1.0
1154
±13
4,187
Upstage AI
CC-BY-NC-4.0
248
236◄─►265
dolphin-2.2.1-mistral-7b
1152
±15
1,685
Cognitive Computations
Apache-2.0
249
241◄─►263
mpt-30b-chat
1151
±12
2,606
MosaicML
CC-BY-NC-SA-4.0
250
243◄─►261
mistral-7b-instruct-v0.2
1151
±7
19,603
Mistral
Apache-2.0
251
242◄─►261
Azurewizardlm-13b
1150
±9
7,122
Microsoft
Llama 2 Community
252
241◄─►268
falcon-180b-chat
1147
±17
1,312
TII
Falcon-180B TII License
253
244◄─►266
qwen1.5-7b-chat
1144
±10
4,782
Alibaba
Qianwen LICENSE
254
244◄─►265
Azurephi-3-mini-4k-instruct-june-2024
1143
±6
12,415
Microsoft
MIT
255
244◄─►265
Metallama-2-13b-chat
1143
±7
19,357
Meta
Llama 2 Community
256
244◄─►266
vicuna-13b
1142
±7
19,539
LMSYS
Llama 2 Community
257
244◄─►268
qwen-14b-chat
1139
±11
5,004
Alibaba
Qianwen LICENSE
258
246◄─►268
palm-2
1137
±9
8,634
Proprietary
259
246◄─►268
Metacodellama-34b-instruct
1137
±9
7,417
Meta
Llama 2 Community
260
246◄─►268
gemma-7b-it
1135
±9
9,034
Gemma license
261
250◄─►269
HuggingFacezephyr-7b-beta
1132
±9
11,220
HuggingFace
MIT
262
251◄─►269
Azurephi-3-mini-128k-instruct
1131
±7
21,024
Microsoft
MIT
263
254◄─►269
Azurephi-3-mini-4k-instruct
1129
±6
20,539
Microsoft
MIT
264
247◄─►273
HuggingFacezephyr-7b-alpha
1128
±16
1,803
HuggingFace
MIT
265
250◄─►272
guanaco-33b
1128
±12
2,955
UW
Non-commercial
266
256◄─►273
stripedhyena-nous-7b
1121
±11
5,214
Together AI
Apache 2.0
267
251◄─►274
Metacodellama-70b-instruct
1119
±18
1,151
Meta
Llama 2 Community
268
256◄─►273
HuggingFacesmollm2-1.7b-instruct
1119
±14
2,244
HuggingFace
Apache 2.0
269
261◄─►273
vicuna-7b
1115
±9
6,972
LMSYS
Llama 2 Community
270
264◄─►273
gemma-1.1-2b-it
1114
±8
11,035
Gemma license
271
264◄─►273
Metallama-3.2-1b-instruct
1113
±8
8,166
Meta
Llama 3.2
272
264◄─►274
mistral-7b-instruct
1111
±9
9,042
Mistral
Apache 2.0
273
265◄─►274
Metallama-2-7b-chat
1109
±7
14,272
Meta
Llama 2 Community
274
271◄─►278
gemma-2b-it
1091
±12
4,817
Gemma license
275
274◄─►276
qwen1.5-4b-chat
1091
±9
7,662
Alibaba
Qianwen LICENSE
276
274◄─►281
olmo-7b-instruct
1074
±11
6,412
Allen AI
Apache-2.0
277
275◄─►281
koala-13b
1071
±10
6,998
UC Berkeley
Non-commercial
278
276◄─►281
alpaca-13b
1067
±11
5,828
Stanford
Non-commercial
279
275◄─►282
gpt4all-13b-snoozy
1066
±15
1,773
Nomic AI
Non-commercial
280
276◄─►282
mpt-7b-chat
1062
±12
3,977
MosaicML
CC-BY-NC-SA-4.0
281
276◄─►282
chatglm3-6b
1057
±12
4,692
Tsinghua
Apache-2.0
282
279◄─►284
RWKVRWKV-4-Raven-14B
1042
±11
4,898
RWKV
Apache 2.0
283
282◄─►284
chatglm2-6b
1026
±14
2,683
Tsinghua
Apache-2.0
284
282◄─►284
oasst-pythia-12b
1023
±11
6,343
OpenAssistant
Apache 2.0
285
285◄─►288
chatglm-6b
996
±13
4,968
Tsinghua
Non-commercial
286
285◄─►288
fastchat-t5-3b
992
±12
4,270
LMSYS
Apache 2.0
287
285◄─►288
dolly-v2-12b
980
±14
3,471
Databricks
MIT
288
285◄─►289
Metallama-13b
972
±16
2,441
Meta
Non-commercial
289
288◄─►289
Stabilitystablelm-tuned-alpha-7b
953
±13
3,325
Stability AI
CC-BY-NC-SA-4.0
Description
Rank (UB): Rank computed based on the Bradley-Terry model. This ranking reflects the model's overall performance in the arena and provides its Elo score's upper bound estimate, helping to understand the model's potential competitiveness.
Model: The name of the large language model (LLM). Some model names may include embedded links.
Score: The model's Elo score in the arena determined by user votes. The higher the Elo score, the better the model performed.
95% Confidence Interval (±): The 95% confidence interval for the model's Elo score (for example:
±6). A smaller interval indicates the model's score is more stable and reliable.Votes: The total number of votes the model received in the arena. More votes generally imply greater statistical reliability of its score.
Organization/Company: The organization or company that provides the model.
License: The model's license type, such as Proprietary, Apache 2.0, MIT, etc.
Data sources and update frequency
The leaderboard data is automatically fetched by scripts directly from the 1 2 official website. This leaderboard is automatically updated daily via GitHub Actions.
Disclaimer
This report is for reference only. Leaderboard data is dynamic and based on users' preference votes on the Chatbot Arena during specific time periods. The integrity and accuracy of the data depend on upstream data sources. Different models may use different licenses; please refer to the model provider's official documentation when using them.
Last updated
Was this helpful?