模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2026-03-09 08:17:09 UTC / 2026-03-09 16:17:09 CST (北京时间)
排行榜
1
14
Anthropicclaude-opus-4-6Anthropic · Proprietary
1504±7
9,170
2
14
Anthropicclaude-opus-4-6-thinkingAnthropic · Proprietary
1502±7
8,313
3
14
gemini-3.1-pro-previewGoogle · Proprietary
1500±9Preliminary
4,041
4
17
grok-4.20-beta1xAI · Proprietary
1491±8Preliminary
5,280
5
47
gemini-3-proGoogle · Proprietary
1485±4
39,923
6
412
gpt-5.4-highOpenAI · Proprietary
1479±10Preliminary
3,503
7
412
gpt-5.2-chat-latest-20260210OpenAI · Proprietary
1479±8
5,786
8
612
gemini-3-flashGoogle · Proprietary
1473±4
30,600
9
612
grok-4.1-thinkingxAI · Proprietary
1473±4
39,309
10
615
Anthropicclaude-opus-4-5-20251101-thinking-32kAnthropic · Proprietary
1470±4
32,516
11
616
Anthropicclaude-opus-4-5-20251101Anthropic · Proprietary
1467±4
37,462
12
619
Bytedancedola-seed-2.0-previewBytedance · Proprietary
1465±8Preliminary
6,712
13
1018
grok-4.1xAI · Proprietary
1462±4
43,536
14
1020
gemini-3-flash (thinking-minimal)Google · Proprietary
1462±5
22,846
15
1030
gpt-5.4OpenAI · Proprietary
1457±10Preliminary
3,417
16
1128
Anthropicclaude-sonnet-4-6Anthropic · Proprietary
1457±8
5,509
17
1226
gpt-5.1-highOpenAI · Proprietary
1455±4
36,204
18
1231
glm-5Z.ai · MIT
1452±7
8,095
19
1431
MoonshotAIkimi-k2.5-thinkingMoonshot · Modified MIT
1451±6
12,710
20
1531
ernie-5.0-0110Baidu · Proprietary
1451±6
15,402
21
1530
Anthropicclaude-sonnet-4-5-20250929Anthropic · Proprietary
1451±4
48,736
22
1333
qwen3.5-397b-a17bAlibaba · Apache 2.0
1451±7
6,836
23
1530
Anthropicclaude-sonnet-4-5-20250929-thinking-32kAnthropic · Proprietary
1450±3
50,801
24
1533
ernie-5.0-preview-1203Baidu · Proprietary
1449±7
9,712
25
1530
gemini-2.5-proGoogle · Proprietary
1449±3
99,103
26
1531
Anthropicclaude-opus-4-1-20250805-thinking-16kAnthropic · Proprietary
1448±3
49,560
27
1633
Anthropicclaude-opus-4-1-20250805Anthropic · Proprietary
1446±3
77,173
28
1636
gpt-4.5-preview-2025-02-27OpenAI · Proprietary
1444±6
14,549
29
2135
chatgpt-4o-latest-20250326OpenAI · Proprietary
1443±3
82,864
30
1737
gpt-5.2-highOpenAI · Proprietary
1442±5
21,161
31
1738
glm-4.7Z.ai · MIT
1441±6
11,936
32
2539
gpt-5.2OpenAI · Proprietary
1439±5
18,159
33
2839
gpt-5.1OpenAI · Proprietary
1438±4
38,793
34
2549
gemini-3.1-flash-lite-previewGoogle · Proprietary
1435±9Preliminary
3,829
35
2944
qwen3-max-previewAlibaba · Proprietary
1434±5
27,631
36
3045
gpt-5-highOpenAI · Proprietary
1434±5
32,325
37
2849
MoonshotAIkimi-k2.5-instantMoonshot · Modified MIT
1434±6
8,823
38
3146
o3-2025-04-16OpenAI · Proprietary
1432±4
60,909
39
3249
grok-4-1-fast-reasoningxAI · Proprietary
1430±4
33,097
40
3451
MoonshotAIkimi-k2-thinking-turboMoonshot · Modified MIT
1429±4
37,861
41
3459
gpt-5-chatOpenAI · Proprietary
1426±4
31,590
42
3661
glm-4.6Z.ai · MIT
1425±4
35,102
43
3463
qwen3-max-2025-09-23Alibaba · Proprietary
1425±6
9,168
44
3463
deepseek-v3.2-expDeepSeek · MIT
1424±6
11,672
45
3762
Anthropicclaude-opus-4-20250514-thinking-16kAnthropic · Proprietary
1424±4
37,678
46
3465
deepseek-v3.2-exp-thinkingDeepSeek · MIT
1423±7
8,942
47
4162
qwen3-235b-a22b-instruct-2507Alibaba · Apache 2.0
1422±3
73,293
48
3567
grok-4-fast-chatxAI · Proprietary
1422±8
6,964
49
4165
deepseek-v3.2DeepSeek · MIT
1421±4
32,541
50
4165
deepseek-v3.2-thinkingDeepSeek · MIT
1420±4
27,370
51
4167
deepseek-r1-0528DeepSeek · MIT
1419±6
19,153
52
3771
ernie-5.0-preview-1022Baidu · Proprietary
1419±9
4,555
53
4170
deepseek-v3.1DeepSeek · MIT
1418±6
15,192
54
3775
qwen3.5-122b-a10bAlibaba · Apache 2.0
1418±10
3,251
55
4171
deepseek-v3.1-thinkingDeepSeek · MIT
1417±7
11,916
56
4171
MoonshotAIkimi-k2-0905-previewMoonshot · Modified MIT
1417±6
11,907
57
4270
MoonshotAIkimi-k2-0711-previewMoonshot · Modified MIT
1417±5
28,423
58
4078
deepseek-v3.1-terminusDeepSeek · MIT
1416±10
3,744
59
4080
deepseek-v3.1-terminus-thinkingDeepSeek · MIT
1416±10
3,535
60
4275
qwen3-vl-235b-a22b-instructAlibaba · Apache 2.0
1415±6
11,596
61
4571
mistral-large-3Mistral · Apache 2.0
1414±4
29,075
62
4184
amazon-nova-experimental-chat-26-01-10Amazon · Proprietary
1414±10
3,387
63
4772
gpt-4.1-2025-04-14OpenAI · Proprietary
1413±4
51,800
64
4774
Anthropicclaude-opus-4-20250514Anthropic · Proprietary
1413±4
45,279
65
5075
grok-3-preview-02-24xAI · Proprietary
1411±4
33,823
66
5275
gemini-2.5-flashGoogle · Proprietary
1411±3
98,369
67
4386
qwen3.5-27bAlibaba · Apache 2.0
1411±10
3,368
68
5275
mistral-medium-2508Mistral · Proprietary
1410±3
67,631
69
5081
glm-4.5Z.ai · MIT
1410±5
24,595
70
5280
grok-4-0709xAI · Proprietary
1409±4
41,749
71
5884
Anthropicclaude-haiku-4-5-20251001Anthropic · Proprietary
1406±3
49,326
72
5986
gemini-2.5-flash-preview-09-2025Google · Proprietary
1404±4
32,518
73
5986
grok-4-fast-reasoningxAI · Proprietary
1403±5
18,431
74
5494
qwen3.5-flashAlibaba · Proprietary
1402±9
4,330
75
5991
Minimaxminimax-m2.5MiniMax · Modified MIT
1402±7
8,017
76
6587
qwen3-235b-a22b-no-thinkingAlibaba · Apache 2.0
1402±4
39,264
77
6588
qwen3-next-80b-a3b-instructAlibaba · Apache 2.0
1402±5
22,670
78
6688
o1-2024-12-17OpenAI · Proprietary
1402±4
27,822
79
6594
longcat-flash-chatMeituan · MIT
1400±6
11,486
80
6989
Anthropicclaude-sonnet-4-20250514-thinking-32kAnthropic · Proprietary
1400±4
35,948
81
6996
qwen3-235b-a22b-thinking-2507Alibaba · Apache 2.0
1399±6
9,177
82
7196
deepseek-r1DeepSeek · MIT
1398±5
18,537
83
68104
qwen3.5-35b-a3bAlibaba · Apache 2.0
1395±10
3,427
84
71103
qwen3-vl-235b-a22b-thinkingAlibaba · Apache 2.0
1395±7
7,919
85
69104
amazon-nova-experimental-chat-12-10Amazon · Proprietary
1395±10
3,696
86
7499
deepseek-v3-0324DeepSeek · MIT
1394±4
46,409
87
66106
Tencenthunyuan-vision-1.5-thinkingTencent · Proprietary
1394±12
2,216
88
75103
mai-1-previewMicrosoft AI · Proprietary
1392±5
18,005
89
77103
mimo-v2-flash (non-thinking)Xiaomi · MIT
1392±5
21,546
90
79103
o4-mini-2025-04-16OpenAI · Proprietary
1391±4
46,343
91
79104
gpt-5-mini-highOpenAI · Proprietary
1390±5
26,935
92
79104
Anthropicclaude-sonnet-4-20250514Anthropic · Proprietary
1389±4
41,340
93
78106
Stepfunstep-3.5-flashStepFun · Apache 2.0
1389±6
10,401
94
81106
o1-previewOpenAI · Proprietary
1388±5
31,120
95
83104
Anthropicclaude-3-7-sonnet-20250219-thinking-32kAnthropic · Proprietary
1388±4
39,706
96
81106
mimo-v2-flash (thinking)Xiaomi · MIT
1387±6
10,861
97
78109
Tencenthunyuan-t1-20250711Tencent · Proprietary
1387±9
4,764
98
83106
qwen3-coder-480b-a35b-instructAlibaba · Apache 2.0
1386±5
26,393
99
83106
Minimaxminimax-m2.1-previewMiniMax · MIT
1385±5
17,078
100
84106
mistral-medium-2505Mistral · Proprietary
1385±5
34,358
101
84108
qwen3-30b-a3b-instruct-2507Alibaba · Apache 2.0
1384±5
23,933
102
84109
Tencenthunyuan-turbos-20250416Tencent · Proprietary
1383±6
10,995
103
88109
gpt-4.1-mini-2025-04-14OpenAI · Proprietary
1382±4
40,288
104
93109
gemini-2.5-flash-lite-preview-09-2025-no-thinkingGoogle · Proprietary
1380±4
46,840
105
84119
glm-4.6vZ.ai · MIT
1378±11
2,784
106
100116
gemini-2.5-flash-lite-preview-06-17-thinkingGoogle · Proprietary
1375±4
33,641
107
100116
qwen3-235b-a22bAlibaba · Apache 2.0
1375±5
27,000
108
101116
qwen2.5-maxAlibaba · Proprietary
1374±4
33,189
109
93119
trinity-largeArcee AI · Apache 2.0
1374±9
4,191
110
105118
Anthropicclaude-3-5-sonnet-20241022Anthropic · Proprietary
1372±3
89,270
111
105119
glm-4.5-airZ.ai · MIT
1372±4
31,132
112
105119
Anthropicclaude-3-7-sonnet-20250219Anthropic · Proprietary
1371±4
44,242
113
105122
qwen3-next-80b-a3b-thinkingAlibaba · Apache 2.0
1369±6
13,767
114
105125
glm-4.7-flashZ.ai · MIT
1367±6
11,734
115
105122
Minimaxminimax-m1MiniMax · Apache 2.0
1367±4
36,424
116
105125
amazon-nova-experimental-chat-11-10Amazon · Proprietary
1366±5
20,934
117
108123
gemma-3-27b-itGoogle · Gemma
1365±4
48,420
118
108127
o3-mini-highOpenAI · Proprietary
1364±5
18,584
119
109129
grok-3-mini-highxAI · Proprietary
1363±5
17,400
120
113130
gemini-2.0-flash-001Google · Proprietary
1361±4
44,666
121
113138
deepseek-v3DeepSeek · DeepSeek
1358±5
21,788
122
115138
grok-3-mini-betaxAI · Proprietary
1357±5
23,585
123
113145
intellect-3Prime Intellect · MIT
1356±8
5,285
124
116143
mistral-small-2506Mistral · Apache 2.0
1356±5
18,219
125
119144
gpt-oss-120bOpenAI · Apache 2.0
1354±4
30,747
126
120145
gemini-2.0-flash-lite-preview-02-05Google · Proprietary
1353±4
24,951
127
121144
Coherecommand-a-03-2025Cohere · CC-BY-NC-4.0
1353±3
57,059
128
116146
glm-4.5vZ.ai · MIT
1353±8
4,947
129
121145
gemini-1.5-pro-002Google · Proprietary
1351±3
55,607
130
121146
amazon-nova-experimental-chat-10-20Amazon · Proprietary
1351±6
11,317
131
118156
Tencenthunyuan-turbos-20250226Tencent · Proprietary
1349±12
2,226
132
123146
o3-miniOpenAI · Proprietary
1348±3
58,415
133
119158
amazon-nova-experimental-chat-10-09Amazon · Proprietary
1347±11
2,873
134
121151
ling-flash-2.0Ant Group · MIT
1347±7
6,988
135
118159
Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia · Nvidia Open Model
1347±12
2,546
136
121156
qwen3-32bAlibaba · Apache 2.0
1347±9
3,932
137
121153
Minimaxminimax-m2MiniMax · Apache 2.0
1347±8
6,684
138
121152
Stepfunstep-3StepFun · Apache 2.0
1347±7
6,565
139
121156
qwen-plus-0125Alibaba · Proprietary
1346±8
5,823
140
126149
gpt-4o-2024-05-13OpenAI · Proprietary
1346±3
112,863
141
123159
glm-4-plus-0111Zhipu · Proprietary
1343±8
5,760
142
129154
Anthropicclaude-3-5-sonnet-20240620Anthropic · Proprietary
1342±3
82,417
143
123161
gemma-3-12b-itGoogle · Gemma
1342±10
3,829
144
123163
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia · Nvidia Open
1341±10
3,398
145
122164
Tencenthunyuan-turbo-0110Tencent · Proprietary
1340±12
2,295
146
132163
nova-2-liteAmazon · Proprietary
1338±6
12,099
147
132163
gpt-5-nano-highOpenAI · Proprietary
1337±7
8,348
148
133160
o1-miniOpenAI · Proprietary
1337±4
51,986
149
133162
qwq-32bAlibaba · Apache 2.0
1336±4
25,985
150
136163
grok-2-2024-08-13xAI · Proprietary
1335±4
63,495
151
137163
Metallama-3.1-405b-instruct-bf16Meta · Llama 3.1 Community
1335±4
41,392
152
136163
gpt-4o-2024-08-06OpenAI · Proprietary
1335±4
45,498
153
134163
gemini-advanced-0514Google · Proprietary
1335±5
50,142
154
132170
Stepfunstep-2-16k-exp-202412StepFun · Proprietary
1334±9
4,829
155
140163
Metallama-3.1-405b-instruct-fp8Meta · Llama 3.1 Community
1333±4
59,655
156
140171
olmo-3.1-32b-instructAi2 · Apache 2.0
1331±6
12,238
157
124194
molmo-2-8bAi2 · Apache 2.0
1329±21
812
158
143174
01.AIyi-lightning01 AI · Proprietary
1328±5
27,340
159
144174
qwen3-30b-a3bAlibaba · Apache 2.0
1328±5
27,260
160
145175
Metallama-4-maverick-17b-128e-instructMeta · Llama 4
1327±4
40,913
161
135185
Nvidiallama-3.3-nemotron-49b-super-v1Nvidia · Nvidia
1327±12
2,230
162
141185
Tencenthunyuan-large-2025-02-10Tencent · Proprietary
1326±10
3,738
163
155181
gpt-4-turbo-2024-04-09OpenAI · Proprietary
1324±4
98,130
164
146185
deepseek-v2.5-1210DeepSeek · DeepSeek
1323±8
6,793
165
155181
Anthropicclaude-3-5-haiku-20241022Anthropic · Proprietary
1323±3
70,951
166
155181
gemini-1.5-pro-001Google · Proprietary
1323±4
79,132
167
155183
Metallama-4-scout-17b-16e-instructMeta · Llama
1322±5
31,023
168
155185
Stepfunstep-1o-turbo-202506StepFun · Proprietary
1322±7
9,606
169
154186
gpt-4.1-nano-2025-04-14OpenAI · Proprietary
1322±8
6,107
170
156182
Anthropicclaude-3-opus-20240229Anthropic · Proprietary
1322±3
194,904
171
155187
ring-flash-2.0Ant Group · MIT
1320±7
7,147
172
157186
gemma-3n-e4b-itGoogle · Gemma
1319±5
23,170
173
157185
glm-4-plusZhipu AI · Proprietary
1319±5
26,134
174
160185
Metallama-3.3-70b-instructMeta · Llama-3.3
1319±3
55,436
175
157188
qwen-max-0919Alibaba · Qwen
1318±6
16,479
176
160186
gpt-4o-mini-2024-07-18OpenAI · Proprietary
1318±3
68,794
177
160188
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia · NVIDIA Open Model
1317±6
15,394
178
159192
gpt-oss-20bOpenAI · Apache 2.0
1317±6
10,751
179
160193
qwen2.5-plus-1127Alibaba · Proprietary
1315±6
10,179
180
163192
athene-v2-chatNexusFlow · NexusFlow
1314±5
24,746
181
164192
mistral-large-2407Mistral · Mistral Research
1314±4
45,460
182
165193
gpt-4-0125-previewOpenAI · Proprietary
1313±4
93,439
183
165193
gpt-4-1106-previewOpenAI · Proprietary
1313±4
100,107
184
160197
Tencenthunyuan-standard-2025-02-10Tencent · Proprietary
1311±10
3,905
185
174196
gemini-1.5-flash-002Google · Proprietary
1310±4
34,909
186
160204
mercuryInception AI · Proprietary
1308±14
1,884
187
177197
grok-2-mini-2024-08-13xAI · Proprietary
1308±4
52,574
188
177197
deepseek-v2.5DeepSeek · DeepSeek
1307±5
24,574
189
171197
olmo-3-32b-thinkAi2 · Apache 2.0
1306±8
5,863
190
177197
athene-70b-0725NexusFlow · CC-BY-NC-4.0
1306±6
19,622
191
180197
mistral-large-2411Mistral · MRL
1305±4
28,081
192
177197
magistral-medium-2506Mistral · Proprietary
1305±6
11,974
193
183197
mistral-small-3.1-24b-instruct-2503Mistral · Apache 2.0
1304±4
33,870
194
175204
gemma-3-4b-itGoogle · Gemma
1303±9
4,177
195
184197
qwen2.5-72b-instructAlibaba · Qwen
1303±4
39,409
196
184207
Nvidiallama-3.1-nemotron-70b-instructNvidia · Llama 3.1
1299±8
7,136
197
185208
Tencenthunyuan-large-visionTencent · Proprietary
1296±9
5,563
198
194208
Metallama-3.1-70b-instructMeta · Llama 3.1 Community
1293±4
55,234
199
194209
amazon-nova-pro-v1.0Amazon · Proprietary
1290±5
24,753
200
194212
jamba-1.5-largeAI21 Labs · Jamba Open
1288±7
8,659
201
196210
gemma-2-27b-itGoogle · Gemma license
1288±3
75,764
202
194212
reka-core-20240904Reka AI · Proprietary
1288±7
7,309
203
194218
ibm-granite-h-smallIBM · Apache 2.0
1287±8
5,619
204
196212
gpt-4-0314OpenAI · Proprietary
1287±5
54,167
205
194218
llama-3.1-tulu-3-70bAi2 · Llama 3.1
1286±10
2,846
206
194218
Nvidiallama-3.1-nemotron-51b-instructNvidia · Llama 3.1
1286±10
3,749
207
197212
gemini-1.5-flash-001Google · Proprietary
1285±4
62,823
208
196218
olmo-3.1-32b-thinkAi2 · Apache 2.0
1285±7
8,441
209
200218
Anthropicclaude-3-sonnet-20240229Anthropic · Proprietary
1281±4
109,289
210
199218
gemma-2-9b-it-simpoPrinceton · MIT
1279±7
10,069
211
201218
Nvidianemotron-4-340b-instructNvidia · NVIDIA Open Model
1277±5
19,661
212
201220
Coherecommand-r-plus-08-2024Cohere · CC-BY-NC-4.0
1276±7
9,869
213
205218
Metallama-3-70b-instructMeta · Llama 3 Community
1276±4
156,880
214
205219
gpt-4-0613OpenAI · Proprietary
1275±4
88,721
215
205221
mistral-small-24b-instruct-2501Mistral · Apache 2.0
1274±6
14,677
216
205222
glm-4-0520Zhipu AI · Proprietary
1273±7
9,788
217
205224
reka-flash-20240904Reka AI · Proprietary
1272±7
7,537
218
205227
qwen2.5-coder-32b-instructAlibaba · Apache 2.0
1270±8
5,430
219
213227
Coherec4ai-aya-expanse-32bCohere · CC-BY-NC-4.0
1267±5
27,123
220
215227
gemma-2-9b-itGoogle · Gemma license
1265±4
54,615
221
214228
deepseek-coder-v2DeepSeek · DeepSeek License
1264±6
15,147
222
217228
Coherecommand-r-plusCohere · CC-BY-NC-4.0
1262±4
77,556
223
216228
qwen2-72b-instructAlibaba · Qianwen LICENSE
1261±5
37,325
224
218228
Anthropicclaude-3-haiku-20240307Anthropic · Proprietary
1261±4
117,705
225
217229
amazon-nova-lite-v1.0Amazon · Proprietary
1261±5
19,376
226
218229
gemini-1.5-flash-8b-001Google · Proprietary
1259±4
35,556
227
221229
Azurephi-4Microsoft · MIT
1256±5
24,126
228
218235
olmo-2-0325-32b-instructAi2 · Apache-2.0
1252±11
3,335
229
225234
Coherecommand-r-08-2024Cohere · CC-BY-NC-4.0
1250±7
10,141
230
228238
mistral-large-2402Mistral · Proprietary
1242±5
62,437
231
228238
amazon-nova-micro-v1.0Amazon · Proprietary
1241±5
19,355
232
228241
jamba-1.5-miniAI21 Labs · Jamba Open
1239±7
8,854
233
228246
ministral-8b-2410Mistral · MRL
1237±9
4,780
234
229246
gemini-pro-dev-apiGoogle · Proprietary
1235±7
18,352
235
230245
qwen1.5-110b-chatAlibaba · Qianwen LICENSE
1234±6
26,191
236
228248
Tencenthunyuan-standard-256kTencent · Proprietary
1233±12
2,729
237
230247
reka-flash-21b-20240226-onlineReka AI · Proprietary
1233±7
15,451
238
230246
qwen1.5-72b-chatAlibaba · Qianwen LICENSE
1233±5
39,296
239
232247
mixtral-8x22b-instruct-v0.1Mistral · Apache 2.0
1229±5
51,417
240
233248
Coherecommand-rCohere · CC-BY-NC-4.0
1227±5
54,038
241
232248
reka-flash-21b-20240226Reka AI · Proprietary
1226±6
24,806
242
233249
gpt-3.5-turbo-0125OpenAI · Proprietary
1224±5
66,191
243
237249
Metallama-3-8b-instructMeta · Llama 3 Community
1223±4
104,636
244
233250
Coherec4ai-aya-expanse-8bCohere · CC-BY-NC-4.0
1223±7
9,827
245
234250
mistral-mediumMistral · Proprietary
1223±6
34,552
246
232253
gemini-proGoogle · Proprietary
1221±12
6,390
247
233252
llama-3.1-tulu-3-8bAi2 · Llama 3.1
1220±11
2,895
248
244253
01.AIyi-1.5-34b-chat01 AI · Apache-2.0
1213±5
24,142
249
239255
HuggingFacezephyr-orpo-141b-A35b-v0.1HuggingFace · Apache 2.0
1212±11
4,653
250
246253
Metallama-3.1-8b-instructMeta · Llama 3.1 Community
1211±4
49,605
251
242259
granite-3.1-8b-instructIBM · Apache 2.0
1208±11
3,092
252
247259
qwen1.5-32b-chatAlibaba · Qianwen LICENSE
1204±6
21,744
253
246261
gpt-3.5-turbo-1106OpenAI · Proprietary
1202±9
16,616
254
250260
gemma-2-2b-itGoogle · Gemma license
1199±4
46,618
255
250261
Azurephi-3-medium-4k-instructMicrosoft · MIT
1198±5
25,055
256
251261
mixtral-8x7b-instruct-v0.1Mistral · Apache 2.0
1197±4
73,505
257
251266
dbrx-instruct-previewDatabricks · DBRX LICENSE
1195±6
32,196
258
251270
InternLMinternlm2_5-20b-chatInternLM · Other
1191±7
9,902
259
251270
qwen1.5-14b-chatAlibaba · Qianwen LICENSE
1190±7
17,841
260
254276
Azurewizardlm-70bMicrosoft · Llama 2 Community
1184±9
8,214
261
253277
deepseek-llm-67b-chatDeepSeek · DeepSeek License
1184±12
4,933
262
257273
01.AIyi-34b-chat01 AI · Yi License
1183±7
15,483
263
257277
OpenChatopenchat-3.5-0106OpenChat · Apache-2.0
1182±8
12,636
264
257277
OpenChatopenchat-3.5OpenChat · Apache-2.0
1182±10
7,967
265
257277
granite-3.0-8b-instructIBM · Apache 2.0
1182±9
6,643
266
258277
gemma-1.1-7b-itGoogle · Gemma license
1180±6
23,893
267
258277
Snowflakesnowflake-arctic-instructSnowflake · Apache 2.0
1179±6
32,836
268
257278
granite-3.1-2b-instructIBM · Apache 2.0
1179±11
3,191
269
258278
tulu-2-dpo-70bAllenAI/UW · AI2 ImpACT Low-risk
1178±10
6,534
270
258281
openhermes-2.5-mistral-7bNousResearch · Apache-2.0
1175±10
5,006
271
260280
vicuna-33bLMSYS · Non-commercial
1172±6
22,479
272
260282
starling-lm-7b-betaNexusflow · Apache-2.0
1171±7
16,057
273
260281
Azurephi-3-small-8k-instructMicrosoft · MIT
1171±6
17,763
274
261281
Metallama-2-70b-chatMeta · Llama 2 Community
1170±6
38,491
275
261284
starling-lm-7b-alphaUC Berkeley · CC-BY-NC-4.0
1167±8
10,224
276
262284
Metallama-3.2-3b-instructMeta · Llama 3.2
1166±8
7,936
277
261287
nous-hermes-2-mixtral-8x7b-dpoNousResearch · Apache-2.0
1164±12
3,776
278
268292
qwq-32b-previewAlibaba · Apache 2.0
1157±12
3,233
279
274291
granite-3.0-2b-instructIBM · Apache 2.0
1156±8
6,837
280
270294
Nvidiallama2-70b-steerlm-chatNvidia · Llama 2 Community
1155±13
3,584
281
271297
solar-10.7b-instruct-v1.0Upstage AI · CC-BY-NC-4.0
1152±13
4,155
282
270299
dolphin-2.2.1-mistral-7bCognitive Computations · Apache-2.0
1152±15
1,679
283
275297
mpt-30b-chatMosaicML · CC-BY-NC-SA-4.0
1150±12
2,571
284
277294
mistral-7b-instruct-v0.2Mistral · Apache-2.0
1149±7
19,402
285
277296
Azurewizardlm-13bMicrosoft · Llama 2 Community
1149±9
7,046
286
275301
falcon-180b-chatTII · Falcon-180B TII License
1147±17
1,295
287
277300
qwen1.5-7b-chatAlibaba · Qianwen LICENSE
1144±10
4,735
288
278298
Azurephi-3-mini-4k-instruct-june-2024Microsoft · MIT
1143±6
12,296
289
278300
Metallama-2-13b-chatMeta · Llama 2 Community
1141±7
19,171
290
278300
vicuna-13bLMSYS · Llama 2 Community
1140±7
19,366
291
278302
qwen-14b-chatAlibaba · Qianwen LICENSE
1138±11
4,964
292
279302
palm-2Google · Proprietary
1137±9
8,554
293
280302
Metacodellama-34b-instructMeta · Llama 2 Community
1136±9
7,363
294
280302
gemma-7b-itGoogle · Gemma license
1136±10
8,925
295
282303
HuggingFacezephyr-7b-betaHuggingFace · MIT
1131±9
11,116
296
286304
Azurephi-3-mini-128k-instructMicrosoft · MIT
1129±7
20,691
297
287303
Azurephi-3-mini-4k-instructMicrosoft · MIT
1128±6
20,115
298
283307
guanaco-33bUW · Non-commercial
1127±12
2,921
299
282307
HuggingFacezephyr-7b-alphaHuggingFace · MIT
1127±16
1,785
300
290307
stripedhyena-nous-7bTogether AI · Apache 2.0
1121±11
5,184
301
285308
Metacodellama-70b-instructMeta · Llama 2 Community
1119±18
1,143
302
295307
vicuna-7bLMSYS · Llama 2 Community
1114±9
6,923
303
291308
HuggingFacesmollm2-1.7b-instructHuggingFace · Apache 2.0
1114±14
2,201
304
297307
gemma-1.1-2b-itGoogle · Gemma license
1114±8
10,853
305
298307
Metallama-3.2-1b-instructMeta · Llama 3.2
1111±8
8,045
306
298308
mistral-7b-instructMistral · Apache 2.0
1109±9
8,977
307
298308
Metallama-2-7b-chatMeta · Llama 2 Community
1108±7
14,148
308
304312
gemma-2b-itGoogle · Gemma license
1091±12
4,779
309
308311
qwen1.5-4b-chatAlibaba · Qianwen LICENSE
1090±9
7,598
310
308315
olmo-7b-instructAi2 · Apache-2.0
1074±11
6,329
311
309315
koala-13bUC Berkeley · Non-commercial
1070±10
6,964
312
310315
alpaca-13bStanford · Non-commercial
1067±12
5,745
313
308316
gpt4all-13b-snoozyNomic AI · Non-commercial
1065±15
1,743
314
310316
mpt-7b-chatMosaicML · CC-BY-NC-SA-4.0
1061±12
3,925
315
310316
chatglm3-6bTsinghua · Apache-2.0
1056±12
4,658
316
313318
RWKVRWKV-4-Raven-14BRWKV · Apache 2.0
1041±11
4,845
317
316318
chatglm2-6bTsinghua · Apache-2.0
1024±14
2,657
318
316318
oasst-pythia-12bOpenAssistant · Apache 2.0
1022±11
6,311
319
319322
chatglm-6bTsinghua · Non-commercial
995±13
4,914
320
319322
fastchat-t5-3bLMSYS · Apache 2.0
991±12
4,203
321
319322
dolly-v2-12bDatabricks · MIT
980±14
3,412
322
319323
Metallama-13bMeta · Non-commercial
972±16
2,391
323
322323
Stabilitystablelm-tuned-alpha-7bStability AI · CC-BY-NC-SA-4.0
952±13
3,287
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?