模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2026-02-25 08:18:37 UTC / 2026-02-25 16:18:37 CST (北京时间)
排行榜
1
14
Anthropicclaude-opus-4-6Anthropic · Proprietary
1504±7
7,042
2
14
Anthropicclaude-opus-4-6-thinkingAnthropic · Proprietary
1503±8
6,181
3
15
gemini-3.1-pro-previewGoogle · Proprietary
1500±9Preliminary
4,060
4
16
grok-4.20-beta1xAI · Proprietary
1492±10Preliminary
3,695
5
46
gemini-3-proGoogle · Proprietary
1486±4
37,854
6
311
gpt-5.2-chat-latest-20260210OpenAI · Proprietary
1481±11
3,093
7
611
gemini-3-flashGoogle · Proprietary
1473±5
28,847
8
611
grok-4.1-thinkingxAI · Proprietary
1473±4
37,167
9
614
Bytedancedola-seed-2.0-previewBytedance · Proprietary
1472±9Preliminary
4,180
10
611
Anthropicclaude-opus-4-5-20251101-thinking-32kAnthropic · Proprietary
1472±4
30,067
11
614
Anthropicclaude-opus-4-5-20251101Anthropic · Proprietary
1467±4
35,014
12
1018
grok-4.1xAI · Proprietary
1463±4
41,343
13
1018
gemini-3-flash (thinking-minimal)Google · Proprietary
1462±5
20,204
14
1027
Anthropicclaude-sonnet-4-6Anthropic · Proprietary
1457±10
3,294
15
1222
gpt-5.1-highOpenAI · Proprietary
1457±4
33,984
16
1227
glm-5Z.ai · MIT
1454±8
6,061
17
1227
ernie-5.0-0110Baidu · Proprietary
1453±6
13,419
18
1231
qwen3.5-397b-a17bAlibaba · Apache 2.0
1451±9
4,541
19
1429
MoonshotAIkimi-k2.5-thinkingMoonshot · Modified MIT
1450±6
10,687
20
1427
Anthropicclaude-sonnet-4-5-20250929-thinking-32kAnthropic · Proprietary
1450±3
48,477
21
1427
Anthropicclaude-sonnet-4-5-20250929Anthropic · Proprietary
1450±4
46,264
22
1527
gemini-2.5-proGoogle · Proprietary
1449±3
96,876
23
1429
ernie-5.0-preview-1203Baidu · Proprietary
1449±7
9,723
24
1529
Anthropicclaude-opus-4-1-20250805-thinking-16kAnthropic · Proprietary
1449±4
49,618
25
1529
Anthropicclaude-opus-4-1-20250805Anthropic · Proprietary
1446±3
77,242
26
1534
gpt-4.5-preview-2025-02-27OpenAI · Proprietary
1444±6
14,549
27
2132
chatgpt-4o-latest-20250326OpenAI · Proprietary
1443±3
82,954
28
1536
glm-4.7Z.ai · MIT
1441±6
11,936
29
2135
gpt-5.2-highOpenAI · Proprietary
1440±5
18,792
30
2637
gpt-5.1OpenAI · Proprietary
1437±4
36,267
31
2537
gpt-5.2OpenAI · Proprietary
1437±6
15,640
32
2544
MoonshotAIkimi-k2.5-instantMoonshot · Modified MIT
1435±7
6,722
33
2741
qwen3-max-previewAlibaba · Proprietary
1434±5
27,649
34
2741
gpt-5-highOpenAI · Proprietary
1434±5
32,357
35
2843
o3-2025-04-16OpenAI · Proprietary
1433±4
60,965
36
2945
grok-4-1-fast-reasoningxAI · Proprietary
1431±4
30,701
37
3049
MoonshotAIkimi-k2-thinking-turboMoonshot · Modified MIT
1429±4
35,751
38
3255
gpt-5-chatOpenAI · Proprietary
1426±4
31,614
39
3457
glm-4.6Z.ai · MIT
1425±4
35,129
40
3258
qwen3-max-2025-09-23Alibaba · Proprietary
1425±6
9,172
41
3558
Anthropicclaude-opus-4-20250514-thinking-16kAnthropic · Proprietary
1424±4
37,728
42
3260
deepseek-v3.2-expDeepSeek · MIT
1424±6
11,685
43
3260
deepseek-v3.2-exp-thinkingDeepSeek · MIT
1423±7
8,944
44
3758
qwen3-235b-a22b-instruct-2507Alibaba · Apache 2.0
1422±3
71,166
45
3463
grok-4-fast-chatxAI · Proprietary
1422±8
6,962
46
3760
deepseek-v3.2-thinkingDeepSeek · MIT
1421±5
25,287
47
3862
deepseek-v3.2DeepSeek · MIT
1419±4
30,323
48
3864
deepseek-r1-0528DeepSeek · MIT
1419±6
19,178
49
3665
ernie-5.0-preview-1022Baidu · Proprietary
1418±9
4,563
50
3865
deepseek-v3.1DeepSeek · MIT
1418±6
15,198
51
3865
MoonshotAIkimi-k2-0905-previewMoonshot · Modified MIT
1417±6
11,915
52
3865
deepseek-v3.1-thinkingDeepSeek · MIT
1417±7
11,919
53
3965
MoonshotAIkimi-k2-0711-previewMoonshot · Modified MIT
1417±5
28,445
54
3769
deepseek-v3.1-terminusDeepSeek · MIT
1416±10
3,746
55
3774
deepseek-v3.1-terminus-thinkingDeepSeek · MIT
1416±10
3,538
56
4065
mistral-large-3Mistral · Apache 2.0
1415±4
26,713
57
3968
qwen3-vl-235b-a22b-instructAlibaba · Apache 2.0
1415±6
11,599
58
3875
amazon-nova-experimental-chat-26-01-10Amazon · Proprietary
1415±10
3,252
59
4365
gpt-4.1-2025-04-14OpenAI · Proprietary
1413±4
51,856
60
4365
Anthropicclaude-opus-4-20250514Anthropic · Proprietary
1413±4
45,310
61
4669
grok-3-preview-02-24xAI · Proprietary
1411±4
33,845
62
4768
mistral-medium-2508Mistral · Proprietary
1411±3
65,199
63
4868
gemini-2.5-flashGoogle · Proprietary
1411±3
96,163
64
4674
glm-4.5Z.ai · MIT
1410±5
24,608
65
4974
grok-4-0709xAI · Proprietary
1409±4
41,771
66
5779
Anthropicclaude-haiku-4-5-20251001Anthropic · Proprietary
1405±4
46,912
67
5779
gemini-2.5-flash-preview-09-2025Google · Proprietary
1404±4
32,547
68
5779
grok-4-fast-reasoningxAI · Proprietary
1404±5
18,446
69
6281
o1-2024-12-17OpenAI · Proprietary
1402±4
27,822
70
6281
qwen3-235b-a22b-no-thinkingAlibaba · Apache 2.0
1402±4
39,301
71
6282
qwen3-next-80b-a3b-instructAlibaba · Apache 2.0
1402±5
22,675
72
6287
longcat-flash-chatMeituan · MIT
1400±6
11,492
73
6682
Anthropicclaude-sonnet-4-20250514-thinking-32kAnthropic · Proprietary
1400±4
35,984
74
6091
Minimaxminimax-m2.5MiniMax · Modified MIT
1399±8
5,631
75
6590
qwen3-235b-a22b-thinking-2507Alibaba · Apache 2.0
1399±6
9,189
76
6689
deepseek-r1DeepSeek · MIT
1398±5
18,537
77
6697
qwen3-vl-235b-a22b-thinkingAlibaba · Apache 2.0
1395±7
7,926
78
6698
amazon-nova-experimental-chat-12-10Amazon · Proprietary
1395±10
3,700
79
6992
deepseek-v3-0324DeepSeek · MIT
1394±4
46,442
80
6299
Tencenthunyuan-vision-1.5-thinkingTencent · Proprietary
1394±12
2,216
81
7197
mai-1-previewMicrosoft AI · Proprietary
1392±5
18,020
82
7397
o4-mini-2025-04-16OpenAI · Proprietary
1391±4
46,384
83
6998
Stepfunstep-3.5-flashStepFun · Apache 2.0
1391±7
8,214
84
7398
gpt-5-mini-highOpenAI · Proprietary
1390±5
26,951
85
7398
mimo-v2-flash (non-thinking)Xiaomi · MIT
1390±5
19,235
86
7398
Anthropicclaude-sonnet-4-20250514Anthropic · Proprietary
1390±4
41,371
87
7598
Anthropicclaude-3-7-sonnet-20250219-thinking-32kAnthropic · Proprietary
1388±4
39,733
88
7498
o1-previewOpenAI · Proprietary
1388±5
31,120
89
7499
mimo-v2-flash (thinking)Xiaomi · MIT
1387±6
10,877
90
73102
Tencenthunyuan-t1-20250711Tencent · Proprietary
1387±9
4,767
91
7699
qwen3-coder-480b-a35b-instructAlibaba · Apache 2.0
1387±5
26,414
92
7799
Minimaxminimax-m2.1-previewMiniMax · MIT
1386±5
17,099
93
7899
mistral-medium-2505Mistral · Proprietary
1385±5
34,390
94
78101
qwen3-30b-a3b-instruct-2507Alibaba · Apache 2.0
1384±5
23,952
95
78102
Tencenthunyuan-turbos-20250416Tencent · Proprietary
1383±6
11,000
96
81102
gpt-4.1-mini-2025-04-14OpenAI · Proprietary
1382±4
40,321
97
88102
gemini-2.5-flash-lite-preview-09-2025-no-thinkingGoogle · Proprietary
1380±4
46,869
98
78112
glm-4.6vZ.ai · MIT
1378±11
2,785
99
78116
trinity-largeArcee AI · Apache 2.0
1376±14
1,690
100
93109
qwen3-235b-a22bAlibaba · Apache 2.0
1375±5
27,026
101
93109
gemini-2.5-flash-lite-preview-06-17-thinkingGoogle · Proprietary
1375±4
33,681
102
94109
qwen2.5-maxAlibaba · Proprietary
1374±4
33,204
103
98109
Anthropicclaude-3-5-sonnet-20241022Anthropic · Proprietary
1372±3
89,297
104
98112
glm-4.5-airZ.ai · MIT
1372±4
31,168
105
98112
Anthropicclaude-3-7-sonnet-20250219Anthropic · Proprietary
1371±4
44,277
106
98115
qwen3-next-80b-a3b-thinkingAlibaba · Apache 2.0
1369±6
13,768
107
98115
Minimaxminimax-m1MiniMax · Apache 2.0
1367±4
36,573
108
98119
amazon-nova-experimental-chat-11-10Amazon · Proprietary
1365±5
18,678
109
102117
gemma-3-27b-itGoogle · Gemma
1365±4
48,462
110
98122
glm-4.7-flashZ.ai · MIT
1364±6
10,247
111
102120
o3-mini-highOpenAI · Proprietary
1364±5
18,584
112
102123
grok-3-mini-highxAI · Proprietary
1363±5
17,418
113
105123
gemini-2.0-flash-001Google · Proprietary
1361±4
44,689
114
105131
deepseek-v3DeepSeek · DeepSeek
1359±5
21,788
115
107132
grok-3-mini-betaxAI · Proprietary
1357±5
23,619
116
105138
intellect-3Prime Intellect · MIT
1356±8
5,289
117
109136
mistral-small-2506Mistral · Apache 2.0
1356±5
18,240
118
111137
gpt-oss-120bOpenAI · Apache 2.0
1354±4
30,767
119
112138
gemini-2.0-flash-lite-preview-02-05Google · Proprietary
1353±4
24,951
120
108140
glm-4.5vZ.ai · MIT
1353±8
4,954
121
114137
Coherecommand-a-03-2025Cohere · CC-BY-NC-4.0
1353±3
57,108
122
114138
gemini-1.5-pro-002Google · Proprietary
1351±3
55,607
123
114140
amazon-nova-experimental-chat-10-20Amazon · Proprietary
1350±6
11,338
124
109149
Tencenthunyuan-turbos-20250226Tencent · Proprietary
1349±12
2,226
125
116140
o3-miniOpenAI · Proprietary
1348±3
58,461
126
111151
amazon-nova-experimental-chat-10-09Amazon · Proprietary
1347±11
2,877
127
110152
Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia · Nvidia Open Model
1347±12
2,546
128
114147
Minimaxminimax-m2MiniMax · Apache 2.0
1347±8
6,688
129
114149
qwen3-32bAlibaba · Apache 2.0
1347±9
3,932
130
114145
ling-flash-2.0Ant Group · MIT
1347±7
6,998
131
114147
Stepfunstep-3StepFun · Apache 2.0
1346±7
6,568
132
114149
qwen-plus-0125Alibaba · Proprietary
1346±8
5,823
133
119142
gpt-4o-2024-05-13OpenAI · Proprietary
1346±3
112,863
134
116152
glm-4-plus-0111Zhipu · Proprietary
1343±8
5,760
135
122147
Anthropicclaude-3-5-sonnet-20240620Anthropic · Proprietary
1343±3
82,417
136
116154
gemma-3-12b-itGoogle · Gemma
1342±10
3,829
137
116155
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia · Nvidia Open
1342±10
3,402
138
115157
Tencenthunyuan-turbo-0110Tencent · Proprietary
1340±12
2,295
139
122156
gpt-5-nano-highOpenAI · Proprietary
1338±7
8,355
140
125156
nova-2-liteAmazon · Proprietary
1337±6
12,112
141
126153
o1-miniOpenAI · Proprietary
1337±4
51,986
142
126156
qwq-32bAlibaba · Apache 2.0
1336±4
26,003
143
130155
Metallama-3.1-405b-instruct-bf16Meta · Llama 3.1 Community
1335±4
41,392
144
130156
grok-2-2024-08-13xAI · Proprietary
1335±4
63,495
145
127156
gpt-4o-2024-08-06OpenAI · Proprietary
1335±4
45,498
146
126156
gemini-advanced-0514Google · Proprietary
1335±5
50,142
147
125163
Stepfunstep-2-16k-exp-202412StepFun · Proprietary
1334±9
4,829
148
133156
Metallama-3.1-405b-instruct-fp8Meta · Llama 3.1 Community
1334±4
59,655
149
133165
olmo-3.1-32b-instructAllen AI · Apache 2.0
1330±6
12,252
150
117187
molmo-2-8bAI2 · Apache 2.0
1329±21
816
151
136167
01.AIyi-lightning01 AI · Proprietary
1329±5
27,340
152
137167
qwen3-30b-a3bAlibaba · Apache 2.0
1328±5
27,289
153
138167
Metallama-4-maverick-17b-128e-instructMeta · Llama 4
1328±4
40,938
154
127178
Nvidiallama-3.3-nemotron-49b-super-v1Nvidia · Nvidia
1327±12
2,230
155
134178
Tencenthunyuan-large-2025-02-10Tencent · Proprietary
1327±10
3,738
156
148174
gpt-4-turbo-2024-04-09OpenAI · Proprietary
1324±4
98,130
157
148174
Anthropicclaude-3-5-haiku-20241022Anthropic · Proprietary
1324±3
70,983
158
140178
deepseek-v2.5-1210DeepSeek · DeepSeek
1323±8
6,793
159
148174
gemini-1.5-pro-001Google · Proprietary
1323±4
79,132
160
148176
Metallama-4-scout-17b-16e-instructMeta · Llama
1323±5
31,061
161
149174
Anthropicclaude-3-opus-20240229Anthropic · Proprietary
1322±3
194,904
162
148178
Stepfunstep-1o-turbo-202506StepFun · Proprietary
1322±7
9,622
163
147178
gpt-4.1-nano-2025-04-14OpenAI · Proprietary
1322±8
6,107
164
148180
ring-flash-2.0Ant Group · MIT
1320±7
7,149
165
153178
Metallama-3.3-70b-instructMeta · Llama-3.3
1319±3
55,456
166
150178
glm-4-plusZhipu AI · Proprietary
1319±5
26,134
167
149178
gemma-3n-e4b-itGoogle · Gemma
1319±5
23,197
168
150181
qwen-max-0919Alibaba · Qwen
1318±6
16,479
169
153178
gpt-4o-mini-2024-07-18OpenAI · Proprietary
1318±3
68,794
170
153185
gpt-oss-20bOpenAI · Apache 2.0
1317±6
10,760
171
153184
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia · NVIDIA Open Model
1317±6
15,408
172
153186
qwen2.5-plus-1127Alibaba · Proprietary
1315±6
10,179
173
157185
athene-v2-chatNexusFlow · NexusFlow
1315±5
24,746
174
157185
mistral-large-2407Mistral · Mistral Research
1314±4
45,460
175
158186
gpt-4-0125-previewOpenAI · Proprietary
1313±4
93,439
176
158186
gpt-4-1106-previewOpenAI · Proprietary
1313±4
100,107
177
153190
Tencenthunyuan-standard-2025-02-10Tencent · Proprietary
1312±10
3,905
178
167189
gemini-1.5-flash-002Google · Proprietary
1310±4
34,909
179
153196
mercuryInception AI · Proprietary
1309±14
1,887
180
169190
grok-2-mini-2024-08-13xAI · Proprietary
1308±4
52,574
181
169190
deepseek-v2.5DeepSeek · DeepSeek
1307±5
24,574
182
169190
athene-70b-0725NexusFlow · CC-BY-NC-4.0
1306±6
19,622
183
167190
olmo-3-32b-thinkAllen AI · Apache 2.0
1306±8
5,869
184
173190
mistral-large-2411Mistral · MRL
1305±4
28,081
185
170190
magistral-medium-2506Mistral · Proprietary
1305±6
11,991
186
176190
mistral-small-3.1-24b-instruct-2503Mistral · Apache 2.0
1304±4
33,907
187
168197
gemma-3-4b-itGoogle · Gemma
1303±9
4,177
188
177190
qwen2.5-72b-instructAlibaba · Qwen
1303±4
39,409
189
177200
Nvidiallama-3.1-nemotron-70b-instructNvidia · Llama 3.1
1299±8
7,136
190
178201
Tencenthunyuan-large-visionTencent · Proprietary
1296±9
5,565
191
187201
Metallama-3.1-70b-instructMeta · Llama 3.1 Community
1294±4
55,234
192
188202
amazon-nova-pro-v1.0Amazon · Proprietary
1290±5
24,753
193
187205
jamba-1.5-largeAI21 Labs · Jamba Open
1289±7
8,659
194
189203
gemma-2-27b-itGoogle · Gemma license
1288±3
75,764
195
187205
reka-core-20240904Reka AI · Proprietary
1288±7
7,309
196
187211
ibm-granite-h-smallIBM · Apache 2.0
1287±8
5,622
197
189205
gpt-4-0314OpenAI · Proprietary
1287±5
54,167
198
187211
llama-3.1-tulu-3-70bAi2 · Llama 3.1
1287±10
2,846
199
187211
Nvidiallama-3.1-nemotron-51b-instructNvidia · Llama 3.1
1286±10
3,749
200
190205
gemini-1.5-flash-001Google · Proprietary
1286±4
62,823
201
189211
olmo-3.1-32b-thinkAllen AI · Apache 2.0
1285±7
8,437
202
193211
Anthropicclaude-3-sonnet-20240229Anthropic · Proprietary
1281±4
109,289
203
192211
gemma-2-9b-it-simpoPrinceton · MIT
1279±7
10,069
204
194211
Nvidianemotron-4-340b-instructNvidia · NVIDIA Open Model
1278±5
19,661
205
194213
Coherecommand-r-plus-08-2024Cohere · CC-BY-NC-4.0
1277±7
9,869
206
198211
Metallama-3-70b-instructMeta · Llama 3 Community
1276±4
156,880
207
198212
gpt-4-0613OpenAI · Proprietary
1275±4
88,721
208
198214
mistral-small-24b-instruct-2501Mistral · Apache 2.0
1274±6
14,677
209
198215
glm-4-0520Zhipu AI · Proprietary
1273±7
9,788
210
198217
reka-flash-20240904Reka AI · Proprietary
1272±7
7,537
211
198220
qwen2.5-coder-32b-instructAlibaba · Apache 2.0
1271±8
5,430
212
206220
Coherec4ai-aya-expanse-32bCohere · CC-BY-NC-4.0
1267±5
27,123
213
208220
gemma-2-9b-itGoogle · Gemma license
1266±4
54,615
214
207221
deepseek-coder-v2DeepSeek · DeepSeek License
1264±6
15,147
215
210221
Coherecommand-r-plusCohere · CC-BY-NC-4.0
1262±4
77,556
216
209221
qwen2-72b-instructAlibaba · Qianwen LICENSE
1262±5
37,325
217
211221
Anthropicclaude-3-haiku-20240307Anthropic · Proprietary
1261±4
117,705
218
210222
amazon-nova-lite-v1.0Amazon · Proprietary
1261±5
19,376
219
211222
gemini-1.5-flash-8b-001Google · Proprietary
1259±4
35,556
220
214222
Azurephi-4Microsoft · MIT
1256±5
24,126
221
211228
olmo-2-0325-32b-instructAllen AI · Apache-2.0
1252±11
3,335
222
218227
Coherecommand-r-08-2024Cohere · CC-BY-NC-4.0
1250±7
10,141
223
221231
mistral-large-2402Mistral · Proprietary
1243±5
62,437
224
221231
amazon-nova-micro-v1.0Amazon · Proprietary
1241±5
19,355
225
221235
jamba-1.5-miniAI21 Labs · Jamba Open
1239±7
8,854
226
221239
ministral-8b-2410Mistral · MRL
1237±9
4,780
227
222239
gemini-pro-dev-apiGoogle · Proprietary
1235±7
18,352
228
223239
qwen1.5-110b-chatAlibaba · Qianwen LICENSE
1234±6
26,191
229
221241
Tencenthunyuan-standard-256kTencent · Proprietary
1233±12
2,729
230
223240
reka-flash-21b-20240226-onlineReka AI · Proprietary
1233±7
15,451
231
223239
qwen1.5-72b-chatAlibaba · Qianwen LICENSE
1233±5
39,296
232
225240
mixtral-8x22b-instruct-v0.1Mistral · Apache 2.0
1230±5
51,417
233
225241
Coherecommand-rCohere · CC-BY-NC-4.0
1227±5
54,038
234
225241
reka-flash-21b-20240226Reka AI · Proprietary
1227±6
24,806
235
226242
gpt-3.5-turbo-0125OpenAI · Proprietary
1224±5
66,191
236
230242
Metallama-3-8b-instructMeta · Llama 3 Community
1223±4
104,636
237
226243
mistral-mediumMistral · Proprietary
1223±6
34,552
238
226243
Coherec4ai-aya-expanse-8bCohere · CC-BY-NC-4.0
1223±7
9,827
239
225246
gemini-proGoogle · Proprietary
1222±12
6,390
240
226246
llama-3.1-tulu-3-8bAi2 · Llama 3.1
1221±11
2,895
241
237246
01.AIyi-1.5-34b-chat01 AI · Apache-2.0
1213±5
24,142
242
232248
HuggingFacezephyr-orpo-141b-A35b-v0.1HuggingFace · Apache 2.0
1213±11
4,653
243
239246
Metallama-3.1-8b-instructMeta · Llama 3.1 Community
1212±4
49,605
244
235252
granite-3.1-8b-instructIBM · Apache 2.0
1209±11
3,092
245
239252
qwen1.5-32b-chatAlibaba · Qianwen LICENSE
1204±6
21,744
246
239254
gpt-3.5-turbo-1106OpenAI · Proprietary
1203±9
16,616
247
243253
gemma-2-2b-itGoogle · Gemma license
1199±4
46,618
248
243254
Azurephi-3-medium-4k-instructMicrosoft · MIT
1198±5
25,055
249
244254
mixtral-8x7b-instruct-v0.1Mistral · Apache 2.0
1197±4
73,505
250
244259
dbrx-instruct-previewDatabricks · DBRX LICENSE
1195±6
32,196
251
244263
InternLMinternlm2_5-20b-chatInternLM · Other
1191±7
9,902
252
244263
qwen1.5-14b-chatAlibaba · Qianwen LICENSE
1191±7
17,841
253
247269
Azurewizardlm-70bMicrosoft · Llama 2 Community
1185±9
8,214
254
246270
deepseek-llm-67b-chatDeepSeek · DeepSeek License
1185±12
4,933
255
250266
01.AIyi-34b-chat01 AI · Yi License
1184±7
15,483
256
250269
OpenChatopenchat-3.5-0106OpenChat · Apache-2.0
1182±8
12,636
257
250270
OpenChatopenchat-3.5OpenChat · Apache-2.0
1182±10
7,967
258
250270
granite-3.0-8b-instructIBM · Apache 2.0
1182±9
6,643
259
251270
gemma-1.1-7b-itGoogle · Gemma license
1180±6
23,893
260
251270
Snowflakesnowflake-arctic-instructSnowflake · Apache 2.0
1180±6
32,836
261
250271
granite-3.1-2b-instructIBM · Apache 2.0
1179±11
3,191
262
251271
tulu-2-dpo-70bAllenAI/UW · AI2 ImpACT Low-risk
1178±10
6,534
263
251274
openhermes-2.5-mistral-7bNousResearch · Apache-2.0
1175±10
5,006
264
253273
vicuna-33bLMSYS · Non-commercial
1173±6
22,479
265
253276
starling-lm-7b-betaNexusflow · Apache-2.0
1172±7
16,057
266
253274
Azurephi-3-small-8k-instructMicrosoft · MIT
1171±6
17,763
267
254274
Metallama-2-70b-chatMeta · Llama 2 Community
1171±6
38,491
268
254277
starling-lm-7b-alphaUC Berkeley · CC-BY-NC-4.0
1168±8
10,224
269
256277
Metallama-3.2-3b-instructMeta · Llama 3.2
1167±8
7,936
270
254280
nous-hermes-2-mixtral-8x7b-dpoNousResearch · Apache-2.0
1165±12
3,776
271
261286
qwq-32b-previewAlibaba · Apache 2.0
1157±12
3,233
272
267284
granite-3.0-2b-instructIBM · Apache 2.0
1156±8
6,837
273
263287
Nvidiallama2-70b-steerlm-chatNvidia · Llama 2 Community
1155±13
3,584
274
264290
solar-10.7b-instruct-v1.0Upstage AI · CC-BY-NC-4.0
1153±13
4,155
275
263292
dolphin-2.2.1-mistral-7bCognitive Computations · Apache-2.0
1152±15
1,679
276
268290
mpt-30b-chatMosaicML · CC-BY-NC-SA-4.0
1150±12
2,571
277
270287
mistral-7b-instruct-v0.2Mistral · Apache-2.0
1150±7
19,402
278
270289
Azurewizardlm-13bMicrosoft · Llama 2 Community
1149±9
7,046
279
267294
falcon-180b-chatTII · Falcon-180B TII License
1147±17
1,295
280
270293
qwen1.5-7b-chatAlibaba · Qianwen LICENSE
1144±10
4,735
281
271292
Azurephi-3-mini-4k-instruct-june-2024Microsoft · MIT
1143±6
12,296
282
271293
Metallama-2-13b-chatMeta · Llama 2 Community
1142±7
19,171
283
271293
vicuna-13bLMSYS · Llama 2 Community
1141±7
19,366
284
271295
qwen-14b-chatAlibaba · Qianwen LICENSE
1139±11
4,964
285
272295
palm-2Google · Proprietary
1137±9
8,554
286
272295
Metacodellama-34b-instructMeta · Llama 2 Community
1137±9
7,363
287
273295
gemma-7b-itGoogle · Gemma license
1136±10
8,925
288
275296
HuggingFacezephyr-7b-betaHuggingFace · MIT
1131±9
11,116
289
278297
Azurephi-3-mini-128k-instructMicrosoft · MIT
1129±7
20,691
290
280296
Azurephi-3-mini-4k-instructMicrosoft · MIT
1129±6
20,115
291
276300
guanaco-33bUW · Non-commercial
1127±12
2,921
292
275300
HuggingFacezephyr-7b-alphaHuggingFace · MIT
1127±16
1,785
293
283300
stripedhyena-nous-7bTogether AI · Apache 2.0
1121±11
5,184
294
278301
Metacodellama-70b-instructMeta · Llama 2 Community
1119±18
1,143
295
288300
vicuna-7bLMSYS · Llama 2 Community
1115±9
6,923
296
284301
HuggingFacesmollm2-1.7b-instructHuggingFace · Apache 2.0
1114±14
2,201
297
290300
gemma-1.1-2b-itGoogle · Gemma license
1114±8
10,853
298
291300
Metallama-3.2-1b-instructMeta · Llama 3.2
1111±8
8,045
299
291301
mistral-7b-instructMistral · Apache 2.0
1110±9
8,977
300
291301
Metallama-2-7b-chatMeta · Llama 2 Community
1108±7
14,148
301
297305
gemma-2b-itGoogle · Gemma license
1092±12
4,779
302
301304
qwen1.5-4b-chatAlibaba · Qianwen LICENSE
1090±9
7,598
303
301308
olmo-7b-instructAllen AI · Apache-2.0
1075±11
6,329
304
302308
koala-13bUC Berkeley · Non-commercial
1071±10
6,964
305
303308
alpaca-13bStanford · Non-commercial
1068±12
5,745
306
301309
gpt4all-13b-snoozyNomic AI · Non-commercial
1066±15
1,743
307
303309
mpt-7b-chatMosaicML · CC-BY-NC-SA-4.0
1062±12
3,925
308
303309
chatglm3-6bTsinghua · Apache-2.0
1056±12
4,658
309
306311
RWKVRWKV-4-Raven-14BRWKV · Apache 2.0
1041±11
4,845
310
309311
chatglm2-6bTsinghua · Apache-2.0
1024±14
2,657
311
309311
oasst-pythia-12bOpenAssistant · Apache 2.0
1022±11
6,311
312
312315
chatglm-6bTsinghua · Non-commercial
996±13
4,914
313
312315
fastchat-t5-3bLMSYS · Apache 2.0
991±12
4,203
314
312315
dolly-v2-12bDatabricks · MIT
980±14
3,412
315
312316
Metallama-13bMeta · Non-commercial
972±16
2,391
316
315316
Stabilitystablelm-tuned-alpha-7bStability AI · CC-BY-NC-SA-4.0
953±13
3,287
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?