模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2026-01-16 08:08:37 UTC / 2026-01-16 16:08:37 CST (北京时间)
排行榜
1
1◄─►1
gemini-3-pro
1489
±5
26,385
Proprietary
2
2◄─►6
grok-4.1-thinking
1477
±5
26,505
xAI
Proprietary
3
2◄─►9
gemini-3-flash
1471
±6
11,599
Proprietary
4
2◄─►9
Anthropicclaude-opus-4-5-20251101-thinking-32k
1468
±5
18,518
Anthropic
Proprietary
5
2◄─►9
Anthropicclaude-opus-4-5-20251101
1467
±5
19,770
Anthropic
Proprietary
6
3◄─►9
grok-4.1
1466
±5
30,490
xAI
Proprietary
7
2◄─►10
gemini-3-flash (thinking-minimal)
1464
±9
5,530
Proprietary
8
3◄─►15
ernie-5.0-0110
1460Preliminary
±9
4,813
Baidu
Proprietary
9
3◄─►10
gpt-5.1-high
1460
±5
23,068
OpenAI
Proprietary
10
7◄─►19
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
1452
±4
37,043
Anthropic
Proprietary
11
9◄─►19
gemini-2.5-pro
1450
±3
86,296
Proprietary
12
9◄─►19
Anthropicclaude-sonnet-4-5-20250929
1450
±4
33,416
Anthropic
Proprietary
13
9◄─►19
Anthropicclaude-opus-4-1-20250805-thinking-16k
1449
±4
50,088
Anthropic
Proprietary
14
9◄─►21
ernie-5.0-preview-1203
1447
±7
8,397
Baidu
Proprietary
15
10◄─►20
Anthropicclaude-opus-4-1-20250805
1445
±3
66,158
Anthropic
Proprietary
16
10◄─►22
gpt-4.5-preview-2025-02-27
1444
±6
14,549
OpenAI
Proprietary
17
9◄─►25
gpt-5.2-high
1444
±8
7,230
OpenAI
Proprietary
18
10◄─►24
glm-4.7
1443Preliminary
±7
8,258
Z.ai
MIT
19
14◄─►22
chatgpt-4o-latest-20250326
1442
±3
73,467
OpenAI
Proprietary
20
10◄─►29
gpt-5.2
1441
±10
3,723
OpenAI
Proprietary
21
15◄─►27
gpt-5.1
1436
±5
24,788
OpenAI
Proprietary
22
16◄─►29
gpt-5-high
1435
±5
32,700
OpenAI
Proprietary
23
18◄─►32
qwen3-max-preview
1434
±5
27,923
Alibaba
Proprietary
24
18◄─►32
o3-2025-04-16
1433
±4
61,465
OpenAI
Proprietary
25
19◄─►38
grok-4-1-fast-reasoning
1430
±5
19,706
xAI
Proprietary
26
20◄─►40
MoonshotAIkimi-k2-thinking-turbo
1429
±5
24,742
Moonshot
Modified MIT
27
20◄─►46
ernie-5.0-preview-1103
1428
±7
9,066
Baidu
Proprietary
28
21◄─►45
gpt-5-chat
1426
±4
31,898
OpenAI
Proprietary
29
21◄─►47
qwen3-max-2025-09-23
1424
±6
9,234
Alibaba
Proprietary
30
25◄─►46
glm-4.6
1424
±4
32,342
Z.ai
MIT
31
25◄─►46
Anthropicclaude-opus-4-20250514-thinking-16k
1424
±4
38,036
Anthropic
Proprietary
32
23◄─►49
deepseek-v3.2-exp
1423
±7
11,826
DeepSeek
MIT
33
23◄─►49
deepseek-v3.2-exp-thinking
1423
±7
9,015
DeepSeek
MIT
34
25◄─►46
qwen3-235b-a22b-instruct-2507
1423
±3
60,768
Alibaba
Apache 2.0
35
23◄─►51
grok-4-fast-chat
1422
±8
7,002
xAI
Proprietary
36
25◄─►51
deepseek-v3.2-thinking
1420
±6
14,504
DeepSeek
MIT
37
27◄─►53
deepseek-r1-0528
1418
±6
19,309
DeepSeek
MIT
38
26◄─►54
MoonshotAIkimi-k2-0905-preview
1418
±6
11,985
Moonshot
Modified MIT
39
27◄─►54
deepseek-v3.2
1418
±6
19,082
DeepSeek
MIT
40
27◄─►53
MoonshotAIkimi-k2-0711-preview
1417
±5
28,682
Moonshot
Modified MIT
41
25◄─►56
ernie-5.0-preview-1022
1417
±9
4,647
Baidu
Proprietary
42
27◄─►54
deepseek-v3.1
1417
±6
15,298
DeepSeek
MIT
43
27◄─►54
deepseek-v3.1-thinking
1417
±7
11,990
DeepSeek
MIT
44
25◄─►59
deepseek-v3.1-terminus-thinking
1416
±10
3,559
DeepSeek
MIT
45
26◄─►59
deepseek-v3.1-terminus
1416
±10
3,768
DeepSeek
MIT
46
28◄─►56
qwen3-vl-235b-a22b-instruct
1415
±6
11,704
Alibaba
Apache 2.0
47
33◄─►54
Anthropicclaude-opus-4-20250514
1414
±4
45,618
Anthropic
Proprietary
48
33◄─►54
gpt-4.1-2025-04-14
1413
±4
52,289
OpenAI
Proprietary
49
32◄─►56
mistral-large-3
1413
±6
15,465
Mistral
Apache 2.0
50
35◄─►56
mistral-medium-2508
1412
±4
54,571
Mistral
Proprietary
51
35◄─►58
grok-3-preview-02-24
1411
±4
34,000
xAI
Proprietary
52
37◄─►59
grok-4-0709
1410
±4
42,260
xAI
Proprietary
53
37◄─►64
glm-4.5
1409
±5
24,816
Z.ai
MIT
54
39◄─►59
gemini-2.5-flash
1409
±3
85,645
Proprietary
55
45◄─►68
gemini-2.5-flash-preview-09-2025
1404
±4
33,008
Proprietary
56
45◄─►68
grok-4-fast-reasoning
1404
±5
18,716
xAI
Proprietary
57
49◄─►68
Anthropicclaude-haiku-4-5-20251001
1403
±4
35,608
Anthropic
Proprietary
58
49◄─►68
o1-2024-12-17
1402
±4
27,822
OpenAI
Proprietary
59
54◄─►70
qwen3-235b-a22b-no-thinking
1401
±4
39,590
Alibaba
Apache 2.0
60
54◄─►70
qwen3-next-80b-a3b-instruct
1401
±5
22,912
Alibaba
Apache 2.0
61
54◄─►70
Anthropicclaude-sonnet-4-20250514-thinking-32k
1401
±4
36,273
Anthropic
Proprietary
62
54◄─►76
longcat-flash-chat
1399
±6
11,577
Meituan
MIT
63
55◄─►77
qwen3-235b-a22b-thinking-2507
1398
±6
9,262
Alibaba
Apache 2.0
64
55◄─►76
deepseek-r1
1398
±5
18,537
DeepSeek
MIT
65
50◄─►82
amazon-nova-experimental-chat-12-10
1396
±10
3,298
Amazon
Proprietary
66
55◄─►80
qwen3-vl-235b-a22b-thinking
1395
±7
7,981
Alibaba
Apache 2.0
67
55◄─►81
mimo-v2-flash (non-thinking)
1395
±7
7,821
Xiaomi
MIT
68
59◄─►78
deepseek-v3-0324
1394
±4
46,744
DeepSeek
MIT
69
54◄─►83
Tencenthunyuan-vision-1.5-thinking
1393
±12
2,225
Tencent
Proprietary
70
59◄─►82
mai-1-preview
1391
±5
18,174
Microsoft AI
Proprietary
71
62◄─►81
o4-mini-2025-04-16
1391
±4
46,744
OpenAI
Proprietary
72
62◄─►82
gpt-5-mini-high
1391
±5
27,240
OpenAI
Proprietary
73
62◄─►82
Anthropicclaude-sonnet-4-20250514
1390
±4
41,697
Anthropic
Proprietary
74
62◄─►82
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
1389
±4
39,925
Anthropic
Proprietary
75
62◄─►82
o1-preview
1389
±5
31,120
OpenAI
Proprietary
76
62◄─►85
Tencenthunyuan-t1-20250711
1387
±9
4,806
Tencent
Proprietary
77
64◄─►83
qwen3-coder-480b-a35b-instruct
1386
±5
26,667
Alibaba
Apache 2.0
78
66◄─►83
mistral-medium-2505
1384
±5
34,594
Mistral
Proprietary
79
65◄─►88
Minimaxminimax-m2.1-preview
1383Preliminary
±7
7,332
MiniMax
MIT
80
67◄─►85
qwen3-30b-a3b-instruct-2507
1383
±5
24,125
Alibaba
Apache 2.0
81
66◄─►88
Tencenthunyuan-turbos-20250416
1382
±6
11,063
Tencent
Proprietary
82
69◄─►85
gpt-4.1-mini-2025-04-14
1382
±4
40,576
OpenAI
Proprietary
83
75◄─►89
gemini-2.5-flash-lite-preview-09-2025-no-thinking
1379
±4
38,666
Proprietary
84
78◄─►91
gemini-2.5-flash-lite-preview-06-17-thinking
1375
±4
33,970
Proprietary
85
78◄─►92
qwen3-235b-a22b
1374
±5
27,185
Alibaba
Apache 2.0
86
81◄─►92
qwen2.5-max
1374
±4
33,301
Alibaba
Proprietary
87
81◄─►91
Anthropicclaude-3-5-sonnet-20241022
1373
±3
89,495
Anthropic
Proprietary
88
81◄─►94
Anthropicclaude-3-7-sonnet-20250219
1372
±4
44,478
Anthropic
Proprietary
89
83◄─►95
glm-4.5-air
1371
±4
31,476
Z.ai
MIT
90
84◄─►98
qwen3-next-80b-a3b-thinking
1369
±6
13,872
Alibaba
Apache 2.0
91
84◄─►98
Minimaxminimax-m1
1367
±4
36,915
MiniMax
Apache 2.0
92
88◄─►98
gemma-3-27b-it
1365
±4
48,740
Gemma
93
88◄─►102
o3-mini-high
1364
±5
18,584
OpenAI
Proprietary
94
86◄─►108
amazon-nova-experimental-chat-11-10
1363
±7
8,278
Amazon
Proprietary
95
89◄─►106
grok-3-mini-high
1362
±5
17,555
xAI
Proprietary
96
90◄─►106
gemini-2.0-flash-001
1361
±4
44,838
Proprietary
97
90◄─►111
deepseek-v3
1359
±5
21,788
DeepSeek
DeepSeek
98
93◄─►117
grok-3-mini-beta
1356
±5
23,801
xAI
Proprietary
99
93◄─►118
mistral-small-2506
1356
±5
18,377
Mistral
Apache 2.0
100
90◄─►120
intellect-3
1355
±8
5,341
Prime Intellect
MIT
101
94◄─►120
gpt-oss-120b
1354
±4
31,035
OpenAI
Apache 2.0
102
94◄─►120
gemini-2.0-flash-lite-preview-02-05
1353
±4
24,951
Proprietary
103
93◄─►121
glm-4.5v
1353
±8
4,984
Z.ai
MIT
104
96◄─►120
Coherecommand-a-03-2025
1353
±3
57,523
Cohere
CC-BY-NC-4.0
105
97◄─►120
gemini-1.5-pro-002
1352
±3
55,607
Proprietary
106
93◄─►132
Tencenthunyuan-turbos-20250226
1349
±12
2,226
Tencent
Proprietary
107
98◄─►121
o3-mini
1348
±3
58,670
OpenAI
Proprietary
108
94◄─►133
Nvidiallama-3.1-nemotron-ultra-253b-v1
1347
±12
2,546
Nvidia
Nvidia Open Model
109
94◄─►133
amazon-nova-experimental-chat-10-09
1347
±11
2,904
Amazon
Proprietary
110
98◄─►124
amazon-nova-experimental-chat-10-20
1347
±6
11,471
Amazon
Proprietary
111
96◄─►132
qwen3-32b
1346
±9
3,932
Alibaba
Apache 2.0
112
97◄─►131
Minimaxminimax-m2
1346
±8
6,775
MiniMax
Apache 2.0
113
98◄─►129
ling-flash-2.0
1346
±7
7,054
Ant Group
MIT
114
98◄─►130
Stepfunstep-3
1346
±7
6,603
StepFun
Apache 2.0
115
100◄─►122
gpt-4o-2024-05-13
1346
±3
112,863
OpenAI
Proprietary
116
97◄─►132
qwen-plus-0125
1346
±8
5,823
Alibaba
Proprietary
117
105◄─►127
Anthropicclaude-3-5-sonnet-20240620
1343
±3
82,417
Anthropic
Proprietary
118
98◄─►133
glm-4-plus-0111
1343
±8
5,760
Zhipu
Proprietary
119
99◄─►136
gemma-3-12b-it
1342
±10
3,829
Gemma
120
98◄─►138
Tencenthunyuan-turbo-0110
1340
±12
2,295
Tencent
Proprietary
121
100◄─►137
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
1340
±10
3,430
Nvidia
Nvidia Open
122
107◄─►137
gpt-5-nano-high
1338
±7
8,401
OpenAI
Proprietary
123
109◄─►134
o1-mini
1337
±4
51,986
OpenAI
Proprietary
124
110◄─►134
Metallama-3.1-405b-instruct-bf16
1336
±4
41,392
Meta
Llama 3.1 Community
125
108◄─►137
nova-2-lite
1336
±6
12,303
Amazon
Proprietary
126
109◄─►137
qwq-32b
1336
±4
26,138
Alibaba
Apache 2.0
127
110◄─►137
gpt-4o-2024-08-06
1336
±4
45,498
OpenAI
Proprietary
128
109◄─►137
gemini-advanced-0514
1335
±5
50,142
Proprietary
129
112◄─►136
grok-2-2024-08-13
1335
±4
63,495
xAI
Proprietary
130
113◄─►137
Metallama-3.1-405b-instruct-fp8
1334
±4
59,655
Meta
Llama 3.1 Community
131
108◄─►145
Stepfunstep-2-16k-exp-202412
1334
±9
4,829
StepFun
Proprietary
132
119◄─►149
01.AIyi-lightning
1329
±5
27,340
01 AI
Proprietary
133
121◄─►149
Metallama-4-maverick-17b-128e-instruct
1328
±4
41,190
Meta
Llama 4
134
121◄─►152
qwen3-30b-a3b
1328
±5
27,458
Alibaba
Apache 2.0
135
111◄─►159
Nvidiallama-3.3-nemotron-49b-super-v1
1327
±12
2,230
Nvidia
Nvidia
136
116◄─►159
Tencenthunyuan-large-2025-02-10
1327
±10
3,738
Tencent
Proprietary
137
131◄─►155
gpt-4-turbo-2024-04-09
1325
±4
98,130
OpenAI
Proprietary
138
131◄─►154
Anthropicclaude-3-5-haiku-20241022
1324
±3
71,227
Anthropic
Proprietary
139
131◄─►155
gemini-1.5-pro-001
1323
±4
79,132
Proprietary
140
123◄─►159
deepseek-v2.5-1210
1323
±8
6,793
DeepSeek
DeepSeek
141
131◄─►157
Metallama-4-scout-17b-16e-instruct
1323
±5
31,288
Meta
Llama
142
131◄─►155
Anthropicclaude-3-opus-20240229
1323
±3
194,904
Anthropic
Proprietary
143
130◄─►160
gpt-4.1-nano-2025-04-14
1322
±8
6,107
OpenAI
Proprietary
144
131◄─►160
Stepfunstep-1o-turbo-202506
1321
±7
9,707
StepFun
Proprietary
145
131◄─►160
ring-flash-2.0
1320
±7
7,209
Ant Group
MIT
146
134◄─►159
Metallama-3.3-70b-instruct
1320
±3
55,602
Meta
Llama-3.3
147
132◄─►159
glm-4-plus
1319
±5
26,134
Zhipu AI
Proprietary
148
132◄─►160
gemma-3n-e4b-it
1319
±5
23,379
Gemma
149
132◄─►161
qwen-max-0919
1318
±6
16,479
Alibaba
Qwen
150
136◄─►160
gpt-4o-mini-2024-07-18
1318
±3
68,801
OpenAI
Proprietary
151
132◄─►166
gpt-oss-20b
1318
±6
10,816
OpenAI
Apache 2.0
152
134◄─►166
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16
1317
±7
10,362
Nvidia
NVIDIA Open Model
153
135◄─►168
qwen2.5-plus-1127
1315
±6
10,179
Alibaba
Proprietary
154
139◄─►166
mistral-large-2407
1315
±4
45,460
Mistral
Mistral Research
155
139◄─►166
athene-v2-chat
1314
±4
24,746
NexusFlow
NexusFlow
156
140◄─►166
gpt-4-1106-preview
1314
±4
100,107
OpenAI
Proprietary
157
140◄─►166
gpt-4-0125-preview
1314
±4
93,439
OpenAI
Proprietary
158
135◄─►171
Tencenthunyuan-standard-2025-02-10
1312
±10
3,905
Tencent
Proprietary
159
145◄─►170
gemini-1.5-flash-002
1310
±4
34,909
Proprietary
160
134◄─►176
mercury
1309
±14
1,929
Inception AI
Proprietary
161
151◄─►171
grok-2-mini-2024-08-13
1308
±4
52,574
xAI
Proprietary
162
151◄─►171
deepseek-v2.5
1307
±5
24,574
DeepSeek
DeepSeek
163
151◄─►171
athene-70b-0725
1306
±6
19,622
NexusFlow
CC-BY-NC-4.0
164
151◄─►171
magistral-medium-2506
1306
±6
12,083
Mistral
Proprietary
165
157◄─►171
mistral-large-2411
1305
±4
28,081
Mistral
MRL
166
150◄─►176
olmo-3-32b-think
1305
±8
5,919
Allen AI
Apache 2.0
167
157◄─►171
mistral-small-3.1-24b-instruct-2503
1305
±4
34,159
Mistral
Apache 2.0
168
151◄─►178
gemma-3-4b-it
1303
±9
4,177
Gemma
169
158◄─►171
qwen2.5-72b-instruct
1303
±4
39,409
Alibaba
Qwen
170
158◄─►180
Nvidiallama-3.1-nemotron-70b-instruct
1299
±8
7,136
Nvidia
Llama 3.1
171
159◄─►181
Tencenthunyuan-large-vision
1296
±9
5,608
Tencent
Proprietary
172
167◄─►181
Metallama-3.1-70b-instruct
1294
±4
55,234
Meta
Llama 3.1 Community
173
169◄─►183
amazon-nova-pro-v1.0
1290
±5
24,753
Amazon
Proprietary
174
167◄─►185
jamba-1.5-large
1289
±7
8,659
AI21 Labs
Jamba Open
175
170◄─►183
gemma-2-27b-it
1289
±3
75,764
Gemma license
176
167◄─►188
ibm-granite-h-small
1289
±8
5,688
IBM
Apache 2.0
177
169◄─►185
reka-core-20240904
1288
±7
7,309
Reka AI
Proprietary
178
170◄─►185
gpt-4-0314
1288
±5
54,167
OpenAI
Proprietary
179
167◄─►191
Nvidiallama-3.1-nemotron-51b-instruct
1287
±10
3,749
Nvidia
Llama 3.1
180
167◄─►191
llama-3.1-tulu-3-70b
1287
±10
2,846
Ai2
Llama 3.1
181
171◄─►185
gemini-1.5-flash-001
1286
±4
62,823
Proprietary
182
173◄─►191
Anthropicclaude-3-sonnet-20240229
1282
±4
109,289
Anthropic
Proprietary
183
173◄─►191
gemma-2-9b-it-simpo
1280
±7
10,069
Princeton
MIT
184
175◄─►191
Nvidianemotron-4-340b-instruct
1278
±5
19,661
Nvidia
NVIDIA Open Model
185
175◄─►193
Coherecommand-r-plus-08-2024
1277
±7
9,869
Cohere
CC-BY-NC-4.0
186
179◄─►191
Metallama-3-70b-instruct
1277
±4
156,880
Meta
Llama 3 Community
187
179◄─►191
gpt-4-0613
1276
±4
88,721
OpenAI
Proprietary
188
180◄─►194
mistral-small-24b-instruct-2501
1274
±6
14,677
Mistral
Apache 2.0
189
179◄─►196
glm-4-0520
1274
±7
9,788
Zhipu AI
Proprietary
190
180◄─►196
reka-flash-20240904
1273
±7
7,537
Reka AI
Proprietary
191
180◄─►200
qwen2.5-coder-32b-instruct
1271
±8
5,430
Alibaba
Apache 2.0
192
186◄─►200
Coherec4ai-aya-expanse-32b
1268
±5
27,123
Cohere
CC-BY-NC-4.0
193
188◄─►200
gemma-2-9b-it
1266
±4
54,615
Gemma license
194
187◄─►201
deepseek-coder-v2
1265
±6
15,147
DeepSeek
DeepSeek License
195
189◄─►201
Coherecommand-r-plus
1263
±4
77,556
Cohere
CC-BY-NC-4.0
196
189◄─►201
qwen2-72b-instruct
1263
±5
37,325
Alibaba
Qianwen LICENSE
197
191◄─►201
Anthropicclaude-3-haiku-20240307
1262
±4
117,705
Anthropic
Proprietary
198
191◄─►202
amazon-nova-lite-v1.0
1261
±5
19,376
Amazon
Proprietary
199
191◄─►202
gemini-1.5-flash-8b-001
1259
±4
35,556
Proprietary
200
194◄─►202
Azurephi-4
1256
±5
24,126
Microsoft
MIT
201
191◄─►208
olmo-2-0325-32b-instruct
1253
±11
3,335
Allen AI
Apache-2.0
202
198◄─►207
Coherecommand-r-08-2024
1251
±7
10,141
Cohere
CC-BY-NC-4.0
203
201◄─►211
mistral-large-2402
1244
±5
62,437
Mistral
Proprietary
204
201◄─►211
amazon-nova-micro-v1.0
1241
±5
19,355
Amazon
Proprietary
205
201◄─►215
jamba-1.5-mini
1239
±7
8,854
AI21 Labs
Jamba Open
206
201◄─►219
ministral-8b-2410
1237
±9
4,780
Mistral
MRL
207
202◄─►219
gemini-pro-dev-api
1236
±7
18,352
Proprietary
208
203◄─►219
qwen1.5-110b-chat
1235
±6
26,191
Alibaba
Qianwen LICENSE
209
203◄─►220
reka-flash-21b-20240226-online
1234
±7
15,451
Reka AI
Proprietary
210
203◄─►219
qwen1.5-72b-chat
1234
±5
39,296
Alibaba
Qianwen LICENSE
211
201◄─►221
Tencenthunyuan-standard-256k
1233
±12
2,729
Tencent
Proprietary
212
205◄─►220
mixtral-8x22b-instruct-v0.1
1231
±5
51,417
Mistral
Apache 2.0
213
205◄─►221
Coherecommand-r
1228
±5
54,038
Cohere
CC-BY-NC-4.0
214
205◄─►221
reka-flash-21b-20240226
1228
±6
24,806
Reka AI
Proprietary
215
206◄─►221
gpt-3.5-turbo-0125
1225
±5
66,191
OpenAI
Proprietary
216
206◄─►223
mistral-medium
1224
±5
34,552
Mistral
Proprietary
217
210◄─►222
Metallama-3-8b-instruct
1224
±4
104,636
Meta
Llama 3 Community
218
206◄─►223
Coherec4ai-aya-expanse-8b
1223
±7
9,827
Cohere
CC-BY-NC-4.0
219
205◄─►226
gemini-pro
1223
±12
6,390
Proprietary
220
206◄─►226
llama-3.1-tulu-3-8b
1221
±11
2,895
Ai2
Llama 3.1
221
217◄─►226
01.AIyi-1.5-34b-chat
1214
±5
24,142
01 AI
Apache-2.0
222
212◄─►227
HuggingFacezephyr-orpo-141b-A35b-v0.1
1214
±11
4,653
HuggingFace
Apache 2.0
223
219◄─►226
Metallama-3.1-8b-instruct
1212
±4
49,605
Meta
Llama 3.1 Community
224
215◄─►232
granite-3.1-8b-instruct
1210
±11
3,092
IBM
Apache 2.0
225
219◄─►232
qwen1.5-32b-chat
1205
±6
21,744
Alibaba
Qianwen LICENSE
226
219◄─►234
gpt-3.5-turbo-1106
1204
±9
16,616
OpenAI
Proprietary
227
223◄─►234
Azurephi-3-medium-4k-instruct
1199
±5
25,055
Microsoft
MIT
228
224◄─►234
gemma-2-2b-it
1199
±4
46,618
Gemma license
229
224◄─►234
mixtral-8x7b-instruct-v0.1
1198
±4
73,505
Mistral
Apache 2.0
230
224◄─►239
dbrx-instruct-preview
1196
±6
32,196
Databricks
DBRX LICENSE
231
224◄─►243
qwen1.5-14b-chat
1192
±7
17,841
Alibaba
Qianwen LICENSE
232
224◄─►243
InternLMinternlm2_5-20b-chat
1192
±7
9,902
InternLM
Other
233
226◄─►249
Azurewizardlm-70b
1186
±9
8,214
Microsoft
Llama 2 Community
234
226◄─►250
deepseek-llm-67b-chat
1186
±12
4,933
DeepSeek
DeepSeek License
235
230◄─►246
01.AIyi-34b-chat
1185
±7
15,483
01 AI
Yi License
236
230◄─►250
OpenChatopenchat-3.5
1184
±10
7,967
OpenChat
Apache-2.0
237
230◄─►249
OpenChatopenchat-3.5-0106
1184
±8
12,636
OpenChat
Apache-2.0
238
230◄─►250
granite-3.0-8b-instruct
1183
±9
6,643
IBM
Apache 2.0
239
231◄─►250
gemma-1.1-7b-it
1181
±6
23,893
Gemma license
240
231◄─►250
Snowflakesnowflake-arctic-instruct
1181
±6
32,836
Snowflake
Apache 2.0
241
230◄─►252
granite-3.1-2b-instruct
1180
±11
3,191
IBM
Apache 2.0
242
231◄─►250
tulu-2-dpo-70b
1179
±10
6,534
AllenAI/UW
AI2 ImpACT Low-risk
243
231◄─►254
openhermes-2.5-mistral-7b
1176
±10
5,006
NousResearch
Apache-2.0
244
233◄─►253
vicuna-33b
1174
±6
22,479
LMSYS
Non-commercial
245
233◄─►256
starling-lm-7b-beta
1172
±7
16,057
Nexusflow
Apache-2.0
246
233◄─►254
Azurephi-3-small-8k-instruct
1172
±6
17,763
Microsoft
MIT
247
234◄─►254
Metallama-2-70b-chat
1172
±5
38,491
Meta
Llama 2 Community
248
234◄─►257
starling-lm-7b-alpha
1169
±8
10,224
UC Berkeley
CC-BY-NC-4.0
249
236◄─►257
Metallama-3.2-3b-instruct
1168
±8
7,936
Meta
Llama 3.2
250
234◄─►260
nous-hermes-2-mixtral-8x7b-dpo
1166
±12
3,776
NousResearch
Apache-2.0
251
242◄─►267
qwq-32b-preview
1158
±12
3,233
Alibaba
Apache 2.0
252
247◄─►264
granite-3.0-2b-instruct
1157
±8
6,837
IBM
Apache 2.0
253
242◄─►267
Nvidiallama2-70b-steerlm-chat
1157
±13
3,584
Nvidia
Llama 2 Community
254
244◄─►270
solar-10.7b-instruct-v1.0
1154
±13
4,155
Upstage AI
CC-BY-NC-4.0
255
243◄─►272
dolphin-2.2.1-mistral-7b
1153
±15
1,679
Cognitive Computations
Apache-2.0
256
248◄─►270
mpt-30b-chat
1151
±12
2,571
MosaicML
CC-BY-NC-SA-4.0
257
250◄─►267
mistral-7b-instruct-v0.2
1151
±7
19,402
Mistral
Apache-2.0
258
250◄─►269
Azurewizardlm-13b
1150
±9
7,046
Microsoft
Llama 2 Community
259
247◄─►274
falcon-180b-chat
1148
±17
1,295
TII
Falcon-180B TII License
260
250◄─►273
qwen1.5-7b-chat
1145
±10
4,735
Alibaba
Qianwen LICENSE
261
251◄─►272
Azurephi-3-mini-4k-instruct-june-2024
1144
±6
12,296
Microsoft
MIT
262
251◄─►273
Metallama-2-13b-chat
1142
±7
19,171
Meta
Llama 2 Community
263
251◄─►273
vicuna-13b
1142
±7
19,366
LMSYS
Llama 2 Community
264
251◄─►275
qwen-14b-chat
1140
±11
4,964
Alibaba
Qianwen LICENSE
265
252◄─►275
palm-2
1139
±9
8,554
Proprietary
266
252◄─►275
Metacodellama-34b-instruct
1138
±9
7,363
Meta
Llama 2 Community
267
252◄─►275
gemma-7b-it
1137
±10
8,925
Gemma license
268
255◄─►276
HuggingFacezephyr-7b-beta
1132
±9
11,116
HuggingFace
MIT
269
258◄─►276
Azurephi-3-mini-128k-instruct
1130
±7
20,691
Microsoft
MIT
270
260◄─►276
Azurephi-3-mini-4k-instruct
1129
±6
20,115
Microsoft
MIT
271
256◄─►279
guanaco-33b
1129
±12
2,921
UW
Non-commercial
272
255◄─►280
HuggingFacezephyr-7b-alpha
1128
±16
1,785
HuggingFace
MIT
273
263◄─►280
stripedhyena-nous-7b
1122
±11
5,184
Together AI
Apache 2.0
274
258◄─►281
Metacodellama-70b-instruct
1120
±18
1,143
Meta
Llama 2 Community
275
268◄─►280
vicuna-7b
1116
±9
6,923
LMSYS
Llama 2 Community
276
264◄─►281
HuggingFacesmollm2-1.7b-instruct
1115
±14
2,201
HuggingFace
Apache 2.0
277
271◄─►280
gemma-1.1-2b-it
1115
±8
10,853
Gemma license
278
271◄─►280
Metallama-3.2-1b-instruct
1112
±8
8,045
Meta
Llama 3.2
279
271◄─►281
mistral-7b-instruct
1111
±9
8,977
Mistral
Apache 2.0
280
272◄─►281
Metallama-2-7b-chat
1109
±7
14,148
Meta
Llama 2 Community
281
277◄─►285
gemma-2b-it
1093
±12
4,779
Gemma license
282
281◄─►284
qwen1.5-4b-chat
1092
±9
7,598
Alibaba
Qianwen LICENSE
283
281◄─►288
olmo-7b-instruct
1076
±11
6,329
Allen AI
Apache-2.0
284
282◄─►288
koala-13b
1072
±10
6,964
UC Berkeley
Non-commercial
285
283◄─►288
alpaca-13b
1069
±12
5,745
Stanford
Non-commercial
286
281◄─►289
gpt4all-13b-snoozy
1067
±15
1,743
Nomic AI
Non-commercial
287
283◄─►289
mpt-7b-chat
1063
±12
3,925
MosaicML
CC-BY-NC-SA-4.0
288
283◄─►289
chatglm3-6b
1057
±12
4,658
Tsinghua
Apache-2.0
289
286◄─►291
RWKVRWKV-4-Raven-14B
1043
±11
4,845
RWKV
Apache 2.0
290
289◄─►291
chatglm2-6b
1026
±14
2,657
Tsinghua
Apache-2.0
291
289◄─►291
oasst-pythia-12b
1024
±11
6,311
OpenAssistant
Apache 2.0
292
292◄─►295
chatglm-6b
997
±13
4,914
Tsinghua
Non-commercial
293
292◄─►295
fastchat-t5-3b
993
±12
4,203
LMSYS
Apache 2.0
294
292◄─►295
dolly-v2-12b
982
±14
3,412
Databricks
MIT
295
292◄─►296
Metallama-13b
974
±16
2,391
Meta
Non-commercial
296
295◄─►296
Stabilitystablelm-tuned-alpha-7b
954
±13
3,287
Stability AI
CC-BY-NC-SA-4.0
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?