模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2026-01-23 08:09:03 UTC / 2026-01-23 16:09:03 CST (北京时间)
排行榜
1
1◄─►1
gemini-3-pro
1490
±5
27,827
Proprietary
2
2◄─►4
grok-4.1-thinking
1477
±5
27,985
xAI
Proprietary
3
2◄─►8
gemini-3-flash
1472
±6
13,245
Proprietary
4
2◄─►8
Anthropicclaude-opus-4-5-20251101-thinking-32k
1470
±5
19,898
Anthropic
Proprietary
5
3◄─►9
Anthropicclaude-opus-4-5-20251101
1467
±5
21,241
Anthropic
Proprietary
6
3◄─►9
grok-4.1
1465
±5
32,015
xAI
Proprietary
7
3◄─►10
gemini-3-flash (thinking-minimal)
1462
±7
9,644
Proprietary
8
3◄─►15
ernie-5.0-0110
1459Preliminary
±9
4,829
Baidu
Proprietary
9
5◄─►14
gpt-5.1-high
1458
±5
24,439
OpenAI
Proprietary
10
8◄─►18
gemini-2.5-pro
1451
±3
87,641
Proprietary
11
8◄─►18
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
1451
±4
38,441
Anthropic
Proprietary
12
7◄─►20
ernie-5.0-preview-1203
1450
±7
9,709
Baidu
Proprietary
13
8◄─►18
Anthropicclaude-sonnet-4-5-20250929
1450
±4
35,025
Anthropic
Proprietary
14
9◄─►18
Anthropicclaude-opus-4-1-20250805-thinking-16k
1449
±4
50,061
Anthropic
Proprietary
15
10◄─►20
Anthropicclaude-opus-4-1-20250805
1445
±3
67,599
Anthropic
Proprietary
16
8◄─►24
gpt-5.2
1445
±9
5,187
OpenAI
Proprietary
17
10◄─►22
gpt-4.5-preview-2025-02-27
1444
±6
14,549
OpenAI
Proprietary
18
14◄─►22
chatgpt-4o-latest-20250326
1442
±3
74,853
OpenAI
Proprietary
19
10◄─►25
glm-4.7
1441Preliminary
±7
9,556
Z.ai
MIT
20
14◄─►33
gpt-5.2-high
1436
±8
8,594
OpenAI
Proprietary
21
16◄─►27
gpt-5.1
1435
±5
26,241
OpenAI
Proprietary
22
16◄─►28
gpt-5-high
1435
±5
32,688
OpenAI
Proprietary
23
18◄─►31
qwen3-max-preview
1434
±5
27,894
Alibaba
Proprietary
24
18◄─►32
o3-2025-04-16
1433
±4
61,435
OpenAI
Proprietary
25
19◄─►35
grok-4-1-fast-reasoning
1430
±5
21,151
xAI
Proprietary
26
20◄─►38
MoonshotAIkimi-k2-thinking-turbo
1429
±5
26,054
Moonshot
Modified MIT
27
21◄─►44
gpt-5-chat
1426
±4
31,883
OpenAI
Proprietary
28
23◄─►45
glm-4.6
1425
±4
33,537
Z.ai
MIT
29
20◄─►45
qwen3-max-2025-09-23
1424
±6
9,225
Alibaba
Proprietary
30
24◄─►45
Anthropicclaude-opus-4-20250514-thinking-16k
1424
±4
38,020
Anthropic
Proprietary
31
22◄─►48
deepseek-v3.2-exp
1423
±7
11,812
DeepSeek
MIT
32
22◄─►48
deepseek-v3.2-exp-thinking
1423
±7
9,017
DeepSeek
MIT
33
26◄─►45
qwen3-235b-a22b-instruct-2507
1422
±3
62,099
Alibaba
Apache 2.0
34
22◄─►50
grok-4-fast-chat
1422
±8
7,001
xAI
Proprietary
35
26◄─►51
deepseek-v3.2-thinking
1420
±6
15,802
DeepSeek
MIT
36
27◄─►52
deepseek-v3.2
1418
±5
20,503
DeepSeek
MIT
37
27◄─►52
deepseek-r1-0528
1418
±6
19,306
DeepSeek
MIT
38
27◄─►53
MoonshotAIkimi-k2-0905-preview
1418
±6
11,981
Moonshot
Modified MIT
39
25◄─►55
ernie-5.0-preview-1022
1417
±9
4,643
Baidu
Proprietary
40
27◄─►52
MoonshotAIkimi-k2-0711-preview
1417
±5
28,683
Moonshot
Modified MIT
41
27◄─►53
deepseek-v3.1-thinking
1417
±7
11,984
DeepSeek
MIT
42
27◄─►53
deepseek-v3.1
1417
±6
15,294
DeepSeek
MIT
43
26◄─►57
deepseek-v3.1-terminus
1416
±10
3,764
DeepSeek
MIT
44
25◄─►57
deepseek-v3.1-terminus-thinking
1416
±10
3,552
DeepSeek
MIT
45
28◄─►55
qwen3-vl-235b-a22b-instruct
1415
±6
11,700
Alibaba
Apache 2.0
46
32◄─►53
Anthropicclaude-opus-4-20250514
1413
±4
45,596
Anthropic
Proprietary
47
32◄─►53
gpt-4.1-2025-04-14
1413
±4
52,274
OpenAI
Proprietary
48
34◄─►55
mistral-medium-2508
1412
±3
55,868
Mistral
Proprietary
49
32◄─►57
mistral-large-3
1411
±5
16,782
Mistral
Apache 2.0
50
34◄─►57
grok-3-preview-02-24
1410
±4
33,991
xAI
Proprietary
51
36◄─►59
grok-4-0709
1409
±4
42,229
xAI
Proprietary
52
35◄─►63
glm-4.5
1409
±5
24,811
Z.ai
MIT
53
39◄─►57
gemini-2.5-flash
1409
±3
86,990
Proprietary
54
44◄─►67
gemini-2.5-flash-preview-09-2025
1404
±4
32,974
Proprietary
55
44◄─►67
grok-4-fast-reasoning
1404
±5
18,700
xAI
Proprietary
56
47◄─►67
Anthropicclaude-haiku-4-5-20251001
1403
±4
37,029
Anthropic
Proprietary
57
47◄─►67
o1-2024-12-17
1402
±4
27,822
OpenAI
Proprietary
58
53◄─►69
qwen3-235b-a22b-no-thinking
1401
±4
39,576
Alibaba
Apache 2.0
59
52◄─►69
qwen3-next-80b-a3b-instruct
1401
±5
22,903
Alibaba
Apache 2.0
60
53◄─►69
Anthropicclaude-sonnet-4-20250514-thinking-32k
1400
±4
36,250
Anthropic
Proprietary
61
53◄─►75
longcat-flash-chat
1399
±6
11,571
Meituan
MIT
62
54◄─►75
qwen3-235b-a22b-thinking-2507
1398
±6
9,258
Alibaba
Apache 2.0
63
54◄─►75
deepseek-r1
1397
±5
18,537
DeepSeek
MIT
64
52◄─►82
amazon-nova-experimental-chat-12-10
1396
±10
3,398
Amazon
Proprietary
65
54◄─►80
mimo-v2-flash (non-thinking)
1395
±7
9,293
Xiaomi
MIT
66
54◄─►80
qwen3-vl-235b-a22b-thinking
1395
±7
7,981
Alibaba
Apache 2.0
67
58◄─►76
deepseek-v3-0324
1394
±4
46,730
DeepSeek
MIT
68
53◄─►83
Tencenthunyuan-vision-1.5-thinking
1393
±12
2,225
Tencent
Proprietary
69
58◄─►82
mai-1-preview
1391
±5
18,164
Microsoft AI
Proprietary
70
61◄─►81
o4-mini-2025-04-16
1391
±4
46,724
OpenAI
Proprietary
71
61◄─►82
gpt-5-mini-high
1390
±5
27,213
OpenAI
Proprietary
72
61◄─►82
Anthropicclaude-sonnet-4-20250514
1390
±4
41,675
Anthropic
Proprietary
73
61◄─►82
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
1389
±4
39,913
Anthropic
Proprietary
74
61◄─►82
o1-preview
1388
±5
31,120
OpenAI
Proprietary
75
61◄─►85
Tencenthunyuan-t1-20250711
1387
±9
4,806
Tencent
Proprietary
76
64◄─►83
qwen3-coder-480b-a35b-instruct
1386
±5
26,653
Alibaba
Apache 2.0
77
65◄─►83
mistral-medium-2505
1384
±5
34,576
Mistral
Proprietary
78
67◄─►85
qwen3-30b-a3b-instruct-2507
1383
±5
24,110
Alibaba
Apache 2.0
79
65◄─►88
Tencenthunyuan-turbos-20250416
1382
±6
11,060
Tencent
Proprietary
80
68◄─►86
gpt-4.1-mini-2025-04-14
1382
±4
40,558
OpenAI
Proprietary
81
65◄─►89
Minimaxminimax-m2.1-preview
1382Preliminary
±7
8,722
MiniMax
MIT
82
74◄─►90
gemini-2.5-flash-lite-preview-09-2025-no-thinking
1378
±4
40,079
Proprietary
83
65◄─►95
glm-4.6v
1378
±11
2,831
Z.ai
MIT
84
77◄─►92
gemini-2.5-flash-lite-preview-06-17-thinking
1375
±4
33,956
Proprietary
85
77◄─►92
qwen3-235b-a22b
1374
±5
27,180
Alibaba
Apache 2.0
86
79◄─►92
qwen2.5-max
1374
±4
33,297
Alibaba
Proprietary
87
80◄─►92
Anthropicclaude-3-5-sonnet-20241022
1373
±3
89,483
Anthropic
Proprietary
88
80◄─►94
Anthropicclaude-3-7-sonnet-20250219
1372
±4
44,462
Anthropic
Proprietary
89
81◄─►95
glm-4.5-air
1371
±4
31,456
Z.ai
MIT
90
82◄─►98
qwen3-next-80b-a3b-thinking
1369
±6
13,862
Alibaba
Apache 2.0
91
83◄─►98
Minimaxminimax-m1
1367
±4
36,897
MiniMax
Apache 2.0
92
87◄─►98
gemma-3-27b-it
1365
±4
48,730
Gemma
93
87◄─►102
o3-mini-high
1364
±5
18,584
OpenAI
Proprietary
94
83◄─►106
amazon-nova-experimental-chat-11-10
1364
±7
9,583
Amazon
Proprietary
95
88◄─►106
grok-3-mini-high
1362
±5
17,542
xAI
Proprietary
96
90◄─►106
gemini-2.0-flash-001
1361
±4
44,828
Proprietary
97
90◄─►111
deepseek-v3
1358
±5
21,788
DeepSeek
DeepSeek
98
93◄─►117
grok-3-mini-beta
1356
±5
23,789
xAI
Proprietary
99
93◄─►118
mistral-small-2506
1356
±5
18,366
Mistral
Apache 2.0
100
90◄─►120
intellect-3
1355
±8
5,341
Prime Intellect
MIT
101
94◄─►120
gpt-oss-120b
1354
±4
31,010
OpenAI
Apache 2.0
102
94◄─►120
gemini-2.0-flash-lite-preview-02-05
1353
±4
24,951
Proprietary
103
93◄─►121
glm-4.5v
1353
±8
4,983
Z.ai
MIT
104
97◄─►120
Coherecommand-a-03-2025
1353
±3
57,503
Cohere
CC-BY-NC-4.0
105
97◄─►120
gemini-1.5-pro-002
1352
±3
55,607
Proprietary
106
93◄─►132
Tencenthunyuan-turbos-20250226
1349
±12
2,226
Tencent
Proprietary
107
98◄─►121
o3-mini
1348
±3
58,662
OpenAI
Proprietary
108
94◄─►132
amazon-nova-experimental-chat-10-09
1347
±11
2,900
Amazon
Proprietary
109
94◄─►133
Nvidiallama-3.1-nemotron-ultra-253b-v1
1347
±12
2,546
Nvidia
Nvidia Open Model
110
98◄─►124
amazon-nova-experimental-chat-10-20
1347
±6
11,466
Amazon
Proprietary
111
97◄─►132
qwen3-32b
1346
±9
3,932
Alibaba
Apache 2.0
112
98◄─►129
ling-flash-2.0
1346
±7
7,051
Ant Group
MIT
113
98◄─►130
Stepfunstep-3
1346
±7
6,600
StepFun
Apache 2.0
114
97◄─►131
Minimaxminimax-m2
1346
±8
6,767
MiniMax
Apache 2.0
115
100◄─►122
gpt-4o-2024-05-13
1346
±3
112,863
OpenAI
Proprietary
116
97◄─►132
qwen-plus-0125
1346
±8
5,823
Alibaba
Proprietary
117
98◄─►133
glm-4-plus-0111
1343
±8
5,760
Zhipu
Proprietary
118
105◄─►127
Anthropicclaude-3-5-sonnet-20240620
1343
±3
82,417
Anthropic
Proprietary
119
99◄─►136
gemma-3-12b-it
1342
±10
3,829
Gemma
120
100◄─►137
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
1340
±10
3,430
Nvidia
Nvidia Open
121
98◄─►139
Tencenthunyuan-turbo-0110
1340
±12
2,295
Tencent
Proprietary
122
107◄─►137
gpt-5-nano-high
1338
±7
8,401
OpenAI
Proprietary
123
109◄─►134
o1-mini
1337
±4
51,986
OpenAI
Proprietary
124
108◄─►138
nova-2-lite
1336
±6
12,298
Amazon
Proprietary
125
110◄─►134
Metallama-3.1-405b-instruct-bf16
1336
±4
41,392
Meta
Llama 3.1 Community
126
109◄─►137
qwq-32b
1336
±4
26,141
Alibaba
Apache 2.0
127
110◄─►137
gpt-4o-2024-08-06
1335
±4
45,498
OpenAI
Proprietary
128
109◄─►138
gemini-advanced-0514
1335
±5
50,142
Proprietary
129
112◄─►136
grok-2-2024-08-13
1335
±4
63,495
xAI
Proprietary
130
113◄─►137
Metallama-3.1-405b-instruct-fp8
1334
±4
59,655
Meta
Llama 3.1 Community
131
108◄─►146
Stepfunstep-2-16k-exp-202412
1334
±9
4,829
StepFun
Proprietary
132
119◄─►150
01.AIyi-lightning
1329
±5
27,340
01 AI
Proprietary
133
121◄─►150
Metallama-4-maverick-17b-128e-instruct
1328
±4
41,173
Meta
Llama 4
134
120◄─►152
qwen3-30b-a3b
1328
±5
27,448
Alibaba
Apache 2.0
135
111◄─►160
Nvidiallama-3.3-nemotron-49b-super-v1
1327
±12
2,230
Nvidia
Nvidia
136
117◄─►160
Tencenthunyuan-large-2025-02-10
1326
±10
3,738
Tencent
Proprietary
137
131◄─►156
gpt-4-turbo-2024-04-09
1325
±4
98,130
OpenAI
Proprietary
138
131◄─►156
Anthropicclaude-3-5-haiku-20241022
1324
±3
71,211
Anthropic
Proprietary
139
131◄─►156
gemini-1.5-pro-001
1323
±4
79,132
Proprietary
140
123◄─►160
deepseek-v2.5-1210
1323
±8
6,793
DeepSeek
DeepSeek
141
131◄─►158
Metallama-4-scout-17b-16e-instruct
1323
±5
31,266
Meta
Llama
142
131◄─►156
Anthropicclaude-3-opus-20240229
1323
±3
194,904
Anthropic
Proprietary
143
130◄─►161
gpt-4.1-nano-2025-04-14
1322
±8
6,107
OpenAI
Proprietary
144
131◄─►161
Stepfunstep-1o-turbo-202506
1321
±7
9,701
StepFun
Proprietary
145
128◄─►167
olmo-3.1-32b-instruct
1321
±10
4,161
Allen AI
Apache 2.0
146
131◄─►162
ring-flash-2.0
1320
±7
7,207
Ant Group
MIT
147
134◄─►160
Metallama-3.3-70b-instruct
1320
±3
55,593
Meta
Llama-3.3
148
132◄─►160
glm-4-plus
1319
±5
26,134
Zhipu AI
Proprietary
149
132◄─►161
gemma-3n-e4b-it
1319
±5
23,367
Gemma
150
132◄─►163
qwen-max-0919
1318
±6
16,479
Alibaba
Qwen
151
132◄─►167
gpt-oss-20b
1318
±6
10,812
OpenAI
Apache 2.0
152
135◄─►161
gpt-4o-mini-2024-07-18
1318
±3
68,801
OpenAI
Proprietary
153
135◄─►167
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16
1317
±6
11,861
Nvidia
NVIDIA Open Model
154
135◄─►169
qwen2.5-plus-1127
1315
±6
10,179
Alibaba
Proprietary
155
139◄─►167
mistral-large-2407
1314
±4
45,460
Mistral
Mistral Research
156
139◄─►167
athene-v2-chat
1314
±4
24,746
NexusFlow
NexusFlow
157
140◄─►167
gpt-4-1106-preview
1314
±4
100,107
OpenAI
Proprietary
158
140◄─►167
gpt-4-0125-preview
1314
±4
93,439
OpenAI
Proprietary
159
135◄─►172
Tencenthunyuan-standard-2025-02-10
1312
±10
3,905
Tencent
Proprietary
160
145◄─►171
gemini-1.5-flash-002
1310
±4
34,909
Proprietary
161
134◄─►177
mercury
1309
±14
1,928
Inception AI
Proprietary
162
151◄─►172
grok-2-mini-2024-08-13
1308
±4
52,574
xAI
Proprietary
163
151◄─►172
deepseek-v2.5
1307
±5
24,574
DeepSeek
DeepSeek
164
151◄─►172
athene-70b-0725
1306
±6
19,622
NexusFlow
CC-BY-NC-4.0
165
151◄─►172
magistral-medium-2506
1305
±6
12,077
Mistral
Proprietary
166
158◄─►172
mistral-large-2411
1305
±4
28,081
Mistral
MRL
167
149◄─►176
olmo-3-32b-think
1305
±8
5,916
Allen AI
Apache 2.0
168
158◄─►172
mistral-small-3.1-24b-instruct-2503
1304
±4
34,152
Mistral
Apache 2.0
169
150◄─►179
gemma-3-4b-it
1303
±9
4,177
Gemma
170
159◄─►172
qwen2.5-72b-instruct
1303
±4
39,409
Alibaba
Qwen
171
159◄─►182
Nvidiallama-3.1-nemotron-70b-instruct
1298
±8
7,136
Nvidia
Llama 3.1
172
160◄─►183
Tencenthunyuan-large-vision
1296
±9
5,605
Tencent
Proprietary
173
168◄─►183
Metallama-3.1-70b-instruct
1294
±4
55,234
Meta
Llama 3.1 Community
174
170◄─►185
amazon-nova-pro-v1.0
1290
±5
24,753
Amazon
Proprietary
175
169◄─►187
jamba-1.5-large
1289
±7
8,659
AI21 Labs
Jamba Open
176
171◄─►185
gemma-2-27b-it
1288
±3
75,764
Gemma license
177
168◄─►190
ibm-granite-h-small
1288
±8
5,688
IBM
Apache 2.0
178
170◄─►187
reka-core-20240904
1288
±7
7,309
Reka AI
Proprietary
179
171◄─►187
gpt-4-0314
1288
±5
54,167
OpenAI
Proprietary
180
168◄─►193
Nvidiallama-3.1-nemotron-51b-instruct
1287
±10
3,749
Nvidia
Llama 3.1
181
168◄─►193
llama-3.1-tulu-3-70b
1287
±10
2,846
Ai2
Llama 3.1
182
172◄─►187
gemini-1.5-flash-001
1286
±4
62,823
Proprietary
183
171◄─►193
olmo-3.1-32b-think
1285
±7
8,429
Allen AI
Apache 2.0
184
174◄─►193
Anthropicclaude-3-sonnet-20240229
1282
±4
109,289
Anthropic
Proprietary
185
174◄─►193
gemma-2-9b-it-simpo
1280
±7
10,069
Princeton
MIT
186
176◄─►193
Nvidianemotron-4-340b-instruct
1278
±5
19,661
Nvidia
NVIDIA Open Model
187
176◄─►195
Coherecommand-r-plus-08-2024
1277
±7
9,869
Cohere
CC-BY-NC-4.0
188
180◄─►193
Metallama-3-70b-instruct
1277
±4
156,880
Meta
Llama 3 Community
189
180◄─►194
gpt-4-0613
1276
±4
88,721
OpenAI
Proprietary
190
181◄─►196
mistral-small-24b-instruct-2501
1274
±6
14,677
Mistral
Apache 2.0
191
180◄─►198
glm-4-0520
1274
±7
9,788
Zhipu AI
Proprietary
192
181◄─►199
reka-flash-20240904
1273
±7
7,537
Reka AI
Proprietary
193
181◄─►202
qwen2.5-coder-32b-instruct
1271
±8
5,430
Alibaba
Apache 2.0
194
188◄─►202
Coherec4ai-aya-expanse-32b
1267
±5
27,123
Cohere
CC-BY-NC-4.0
195
190◄─►202
gemma-2-9b-it
1266
±4
54,615
Gemma license
196
189◄─►203
deepseek-coder-v2
1265
±6
15,147
DeepSeek
DeepSeek License
197
191◄─►203
Coherecommand-r-plus
1263
±4
77,556
Cohere
CC-BY-NC-4.0
198
191◄─►203
qwen2-72b-instruct
1263
±5
37,325
Alibaba
Qianwen LICENSE
199
193◄─►203
Anthropicclaude-3-haiku-20240307
1262
±4
117,705
Anthropic
Proprietary
200
192◄─►204
amazon-nova-lite-v1.0
1261
±5
19,376
Amazon
Proprietary
201
193◄─►204
gemini-1.5-flash-8b-001
1259
±4
35,556
Proprietary
202
196◄─►204
Azurephi-4
1256
±5
24,126
Microsoft
MIT
203
193◄─►210
olmo-2-0325-32b-instruct
1252
±11
3,335
Allen AI
Apache-2.0
204
200◄─►209
Coherecommand-r-08-2024
1251
±7
10,141
Cohere
CC-BY-NC-4.0
205
203◄─►213
mistral-large-2402
1243
±5
62,437
Mistral
Proprietary
206
203◄─►213
amazon-nova-micro-v1.0
1241
±5
19,355
Amazon
Proprietary
207
203◄─►217
jamba-1.5-mini
1239
±7
8,854
AI21 Labs
Jamba Open
208
203◄─►221
ministral-8b-2410
1237
±9
4,780
Mistral
MRL
209
204◄─►221
gemini-pro-dev-api
1236
±7
18,352
Proprietary
210
205◄─►221
qwen1.5-110b-chat
1235
±6
26,191
Alibaba
Qianwen LICENSE
211
205◄─►222
reka-flash-21b-20240226-online
1234
±7
15,451
Reka AI
Proprietary
212
205◄─►221
qwen1.5-72b-chat
1234
±5
39,296
Alibaba
Qianwen LICENSE
213
203◄─►223
Tencenthunyuan-standard-256k
1233
±12
2,729
Tencent
Proprietary
214
207◄─►222
mixtral-8x22b-instruct-v0.1
1231
±5
51,417
Mistral
Apache 2.0
215
207◄─►223
Coherecommand-r
1228
±5
54,038
Cohere
CC-BY-NC-4.0
216
207◄─►223
reka-flash-21b-20240226
1227
±6
24,806
Reka AI
Proprietary
217
208◄─►224
gpt-3.5-turbo-0125
1225
±5
66,191
OpenAI
Proprietary
218
208◄─►225
mistral-medium
1224
±5
34,552
Mistral
Proprietary
219
212◄─►224
Metallama-3-8b-instruct
1224
±4
104,636
Meta
Llama 3 Community
220
208◄─►225
Coherec4ai-aya-expanse-8b
1223
±7
9,827
Cohere
CC-BY-NC-4.0
221
207◄─►228
gemini-pro
1223
±12
6,390
Proprietary
222
208◄─►228
llama-3.1-tulu-3-8b
1221
±11
2,895
Ai2
Llama 3.1
223
219◄─►228
01.AIyi-1.5-34b-chat
1214
±5
24,142
01 AI
Apache-2.0
224
214◄─►229
HuggingFacezephyr-orpo-141b-A35b-v0.1
1214
±11
4,653
HuggingFace
Apache 2.0
225
221◄─►228
Metallama-3.1-8b-instruct
1212
±4
49,605
Meta
Llama 3.1 Community
226
217◄─►234
granite-3.1-8b-instruct
1209
±11
3,092
IBM
Apache 2.0
227
221◄─►234
qwen1.5-32b-chat
1205
±6
21,744
Alibaba
Qianwen LICENSE
228
221◄─►236
gpt-3.5-turbo-1106
1204
±9
16,616
OpenAI
Proprietary
229
225◄─►236
Azurephi-3-medium-4k-instruct
1199
±5
25,055
Microsoft
MIT
230
226◄─►236
gemma-2-2b-it
1199
±4
46,618
Gemma license
231
226◄─►236
mixtral-8x7b-instruct-v0.1
1198
±4
73,505
Mistral
Apache 2.0
232
226◄─►241
dbrx-instruct-preview
1196
±6
32,196
Databricks
DBRX LICENSE
233
226◄─►245
InternLMinternlm2_5-20b-chat
1192
±7
9,902
InternLM
Other
234
226◄─►245
qwen1.5-14b-chat
1192
±7
17,841
Alibaba
Qianwen LICENSE
235
228◄─►251
Azurewizardlm-70b
1186
±9
8,214
Microsoft
Llama 2 Community
236
228◄─►252
deepseek-llm-67b-chat
1185
±12
4,933
DeepSeek
DeepSeek License
237
232◄─►248
01.AIyi-34b-chat
1185
±7
15,483
01 AI
Yi License
238
232◄─►251
OpenChatopenchat-3.5-0106
1183
±8
12,636
OpenChat
Apache-2.0
239
232◄─►252
OpenChatopenchat-3.5
1183
±10
7,967
OpenChat
Apache-2.0
240
232◄─►252
granite-3.0-8b-instruct
1183
±9
6,643
IBM
Apache 2.0
241
233◄─►252
gemma-1.1-7b-it
1181
±6
23,893
Gemma license
242
233◄─►252
Snowflakesnowflake-arctic-instruct
1181
±6
32,836
Snowflake
Apache 2.0
243
232◄─►254
granite-3.1-2b-instruct
1180
±11
3,191
IBM
Apache 2.0
244
233◄─►252
tulu-2-dpo-70b
1179
±10
6,534
AllenAI/UW
AI2 ImpACT Low-risk
245
233◄─►256
openhermes-2.5-mistral-7b
1176
±10
5,006
NousResearch
Apache-2.0
246
235◄─►255
vicuna-33b
1174
±6
22,479
LMSYS
Non-commercial
247
235◄─►258
starling-lm-7b-beta
1172
±7
16,057
Nexusflow
Apache-2.0
248
235◄─►256
Azurephi-3-small-8k-instruct
1172
±6
17,763
Microsoft
MIT
249
236◄─►256
Metallama-2-70b-chat
1172
±5
38,491
Meta
Llama 2 Community
250
236◄─►259
starling-lm-7b-alpha
1168
±8
10,224
UC Berkeley
CC-BY-NC-4.0
251
238◄─►259
Metallama-3.2-3b-instruct
1167
±8
7,936
Meta
Llama 3.2
252
236◄─►262
nous-hermes-2-mixtral-8x7b-dpo
1166
±12
3,776
NousResearch
Apache-2.0
253
244◄─►269
qwq-32b-preview
1157
±12
3,233
Alibaba
Apache 2.0
254
249◄─►266
granite-3.0-2b-instruct
1157
±8
6,837
IBM
Apache 2.0
255
244◄─►269
Nvidiallama2-70b-steerlm-chat
1156
±13
3,584
Nvidia
Llama 2 Community
256
246◄─►272
solar-10.7b-instruct-v1.0
1153
±13
4,155
Upstage AI
CC-BY-NC-4.0
257
245◄─►274
dolphin-2.2.1-mistral-7b
1153
±15
1,679
Cognitive Computations
Apache-2.0
258
250◄─►272
mpt-30b-chat
1151
±12
2,571
MosaicML
CC-BY-NC-SA-4.0
259
252◄─►269
mistral-7b-instruct-v0.2
1151
±7
19,402
Mistral
Apache-2.0
260
252◄─►271
Azurewizardlm-13b
1150
±9
7,046
Microsoft
Llama 2 Community
261
249◄─►276
falcon-180b-chat
1148
±17
1,295
TII
Falcon-180B TII License
262
252◄─►275
qwen1.5-7b-chat
1145
±10
4,735
Alibaba
Qianwen LICENSE
263
253◄─►274
Azurephi-3-mini-4k-instruct-june-2024
1144
±6
12,296
Microsoft
MIT
264
253◄─►275
Metallama-2-13b-chat
1142
±7
19,171
Meta
Llama 2 Community
265
253◄─►275
vicuna-13b
1142
±7
19,366
LMSYS
Llama 2 Community
266
253◄─►277
qwen-14b-chat
1140
±11
4,964
Alibaba
Qianwen LICENSE
267
254◄─►277
palm-2
1138
±9
8,554
Proprietary
268
254◄─►277
Metacodellama-34b-instruct
1138
±9
7,363
Meta
Llama 2 Community
269
254◄─►277
gemma-7b-it
1137
±10
8,925
Gemma license
270
257◄─►278
HuggingFacezephyr-7b-beta
1132
±9
11,116
HuggingFace
MIT
271
260◄─►278
Azurephi-3-mini-128k-instruct
1130
±7
20,691
Microsoft
MIT
272
262◄─►278
Azurephi-3-mini-4k-instruct
1129
±6
20,115
Microsoft
MIT
273
258◄─►281
guanaco-33b
1128
±12
2,921
UW
Non-commercial
274
257◄─►282
HuggingFacezephyr-7b-alpha
1128
±16
1,785
HuggingFace
MIT
275
265◄─►282
stripedhyena-nous-7b
1122
±11
5,184
Together AI
Apache 2.0
276
260◄─►283
Metacodellama-70b-instruct
1120
±18
1,143
Meta
Llama 2 Community
277
270◄─►282
vicuna-7b
1116
±9
6,923
LMSYS
Llama 2 Community
278
266◄─►283
HuggingFacesmollm2-1.7b-instruct
1115
±14
2,201
HuggingFace
Apache 2.0
279
273◄─►282
gemma-1.1-2b-it
1115
±8
10,853
Gemma license
280
273◄─►282
Metallama-3.2-1b-instruct
1112
±8
8,045
Meta
Llama 3.2
281
273◄─►283
mistral-7b-instruct
1111
±9
8,977
Mistral
Apache 2.0
282
274◄─►283
Metallama-2-7b-chat
1109
±7
14,148
Meta
Llama 2 Community
283
279◄─►287
gemma-2b-it
1092
±12
4,779
Gemma license
284
283◄─►286
qwen1.5-4b-chat
1091
±9
7,598
Alibaba
Qianwen LICENSE
285
283◄─►290
olmo-7b-instruct
1075
±11
6,329
Allen AI
Apache-2.0
286
284◄─►290
koala-13b
1071
±10
6,964
UC Berkeley
Non-commercial
287
285◄─►290
alpaca-13b
1069
±12
5,745
Stanford
Non-commercial
288
283◄─►291
gpt4all-13b-snoozy
1067
±15
1,743
Nomic AI
Non-commercial
289
285◄─►291
mpt-7b-chat
1063
±12
3,925
MosaicML
CC-BY-NC-SA-4.0
290
285◄─►291
chatglm3-6b
1057
±12
4,658
Tsinghua
Apache-2.0
291
288◄─►293
RWKVRWKV-4-Raven-14B
1042
±11
4,845
RWKV
Apache 2.0
292
291◄─►293
chatglm2-6b
1025
±14
2,657
Tsinghua
Apache-2.0
293
291◄─►293
oasst-pythia-12b
1023
±11
6,311
OpenAssistant
Apache 2.0
294
294◄─►297
chatglm-6b
997
±13
4,914
Tsinghua
Non-commercial
295
294◄─►297
fastchat-t5-3b
992
±12
4,203
LMSYS
Apache 2.0
296
294◄─►297
dolly-v2-12b
981
±14
3,412
Databricks
MIT
297
294◄─►298
Metallama-13b
973
±16
2,391
Meta
Non-commercial
298
297◄─►298
Stabilitystablelm-tuned-alpha-7b
954
±13
3,287
Stability AI
CC-BY-NC-SA-4.0
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?