模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2026-01-12 08:09:21 UTC / 2026-01-12 16:09:21 CST (北京时间)
排行榜
1
1◄─►1
gemini-3-pro
1490
±5
25,178
Proprietary
2
2◄─►5
grok-4.1-thinking
1477
±5
25,440
xAI
Proprietary
3
2◄─►7
gemini-3-flash
1471
±7
10,204
Proprietary
4
2◄─►7
Anthropicclaude-opus-4-5-20251101-thinking-32k
1469
±6
17,570
Anthropic
Proprietary
5
3◄─►8
grok-4.1
1466
±5
29,539
xAI
Proprietary
6
3◄─►8
Anthropicclaude-opus-4-5-20251101
1465
±6
18,754
Anthropic
Proprietary
7
2◄─►8
gemini-3-flash (thinking-minimal)
1464
±9
5,624
Proprietary
8
5◄─►11
gpt-5.1-high
1457
±5
22,116
OpenAI
Proprietary
9
8◄─►15
gemini-2.5-pro
1450
±3
84,916
Proprietary
10
8◄─►16
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
1450
±4
36,318
Anthropic
Proprietary
11
9◄─►17
Anthropicclaude-opus-4-1-20250805-thinking-16k
1448
±4
50,339
Anthropic
Proprietary
12
9◄─►17
Anthropicclaude-sonnet-4-5-20250929
1448
±4
32,372
Anthropic
Proprietary
13
8◄─►21
ernie-5.0-preview-1203
1446
±8
7,250
Baidu
Proprietary
14
9◄─►21
gpt-4.5-preview-2025-02-27
1443
±6
14,646
OpenAI
Proprietary
15
10◄─►20
Anthropicclaude-opus-4-1-20250805
1443
±3
65,088
Anthropic
Proprietary
16
9◄─►24
glm-4.7
1443Preliminary
±8
7,157
Z.ai
MIT
17
11◄─►20
chatgpt-4o-latest-20250326
1442
±3
72,793
OpenAI
Proprietary
18
13◄─►26
gpt-5-high
1436
±5
32,723
OpenAI
Proprietary
19
13◄─►27
gpt-5.1
1435
±5
23,716
OpenAI
Proprietary
20
13◄─►33
gpt-5.2-high
1434
±7
9,657
OpenAI
Proprietary
21
15◄─►29
qwen3-max-preview
1434
±5
28,055
Alibaba
Proprietary
22
17◄─►28
o3-2025-04-16
1433
±4
61,321
OpenAI
Proprietary
23
17◄─►33
grok-4-1-fast-reasoning
1431
±5
18,610
xAI
Proprietary
24
18◄─►37
MoonshotAIkimi-k2-thinking-turbo
1430
±5
23,702
Moonshot
Modified MIT
25
17◄─►42
ernie-5.0-preview-1103
1429
±7
9,049
Baidu
Proprietary
26
18◄─►44
gpt-5.2
1425
±6
10,415
OpenAI
Proprietary
27
21◄─►44
gpt-5-chat
1425
±4
32,024
OpenAI
Proprietary
28
19◄─►45
qwen3-max-2025-09-23
1424
±6
9,211
Alibaba
Proprietary
29
22◄─►44
glm-4.6
1424
±4
31,660
Z.ai
MIT
30
20◄─►45
deepseek-v3.2-exp
1423
±6
11,910
DeepSeek AI
MIT
31
22◄─►44
Anthropicclaude-opus-4-20250514-thinking-16k
1423
±4
37,671
Anthropic
Proprietary
32
24◄─►44
qwen3-235b-a22b-instruct-2507
1422
±3
59,848
Alibaba
Apache 2.0
33
22◄─►48
deepseek-v3.2-exp-thinking
1422
±7
9,179
DeepSeek AI
MIT
34
22◄─►51
grok-4-fast-chat
1420
±8
7,013
xAI
Proprietary
35
24◄─►50
deepseek-v3.2
1420
±6
18,091
DeepSeek AI
MIT
36
24◄─►50
deepseek-v3.2-thinking
1419
±6
13,621
DeepSeek AI
MIT
37
25◄─►50
deepseek-r1-0528
1419
±6
19,149
DeepSeek
MIT
38
25◄─►51
MoonshotAIkimi-k2-0905-preview
1418
±6
12,061
Moonshot
Modified MIT
39
25◄─►52
deepseek-v3.1
1417
±6
15,189
DeepSeek
MIT
40
26◄─►51
MoonshotAIkimi-k2-0711-preview
1416
±5
28,491
Moonshot
Modified MIT
41
25◄─►52
deepseek-v3.1-thinking
1416
±7
11,932
DeepSeek
MIT
42
24◄─►57
deepseek-v3.1-terminus
1415
±10
3,731
DeepSeek AI
MIT
43
26◄─►53
qwen3-vl-235b-a22b-instruct
1414
±7
11,676
Alibaba
Apache 2.0
44
25◄─►61
deepseek-v3.1-terminus-thinking
1414
±10
3,506
DeepSeek AI
MIT
45
31◄─►54
mistral-large-3
1413
±6
14,472
Mistral
Apache 2.0
46
33◄─►53
gpt-4.1-2025-04-14
1412
±4
52,315
OpenAI
Proprietary
47
33◄─►53
Anthropicclaude-opus-4-20250514
1412
±4
45,407
Anthropic
Proprietary
48
33◄─►53
mistral-medium-2508
1412
±4
53,609
Mistral
Proprietary
49
34◄─►56
grok-3-preview-02-24
1411
±4
34,031
xAI
Proprietary
50
36◄─►59
grok-4-0709
1409
±4
42,343
xAI
Proprietary
51
34◄─►61
glm-4.5
1409
±5
24,694
Z.ai
MIT
52
40◄─►59
gemini-2.5-flash
1408
±3
84,420
Proprietary
53
42◄─►66
gemini-2.5-flash-preview-09-2025
1405
±4
33,365
Proprietary
54
47◄─►66
Anthropicclaude-haiku-4-5-20251001
1403
±4
34,960
Anthropic
Proprietary
55
46◄─►66
grok-4-fast-reasoning
1402
±5
18,781
xAI
Proprietary
56
49◄─►68
o1-2024-12-17
1401
±4
28,039
OpenAI
Proprietary
57
49◄─►69
qwen3-next-80b-a3b-instruct
1401
±5
22,974
Alibaba
Apache 2.0
58
47◄─►70
longcat-flash-chat
1400
±6
11,455
Meituan
MIT
59
51◄─►69
qwen3-235b-a22b-no-thinking
1400
±4
39,205
Alibaba
Apache 2.0
60
53◄─►70
Anthropicclaude-sonnet-4-20250514-thinking-32k
1399
±4
36,020
Anthropic
Proprietary
61
53◄─►75
qwen3-235b-a22b-thinking-2507
1398
±6
9,299
Alibaba
Apache 2.0
62
51◄─►76
mimo-v2-flash (non-thinking)
1397
±8
6,623
Xiaomi
MIT
63
53◄─►75
deepseek-r1
1396
±5
18,722
DeepSeek
MIT
64
48◄─►79
amazon-nova-experimental-chat-12-10
1396
±10
3,217
Amazon
Proprietary
65
53◄─►77
qwen3-vl-235b-a22b-thinking
1394
±7
7,951
Alibaba
Apache 2.0
66
56◄─►76
deepseek-v3-0324
1393
±4
46,580
DeepSeek
MIT
67
56◄─►77
gpt-5-mini-high
1392
±5
27,302
OpenAI
Proprietary
68
53◄─►82
Tencenthunyuan-vision-1.5-thinking
1392
±12
2,202
Tencent
Proprietary
69
59◄─►79
o4-mini-2025-04-16
1391
±4
46,663
OpenAI
Proprietary
70
57◄─►79
mai-1-preview
1391
±5
18,092
Microsoft AI
Proprietary
71
61◄─►80
Anthropicclaude-sonnet-4-20250514
1389
±4
41,446
Anthropic
Proprietary
72
61◄─►80
o1-preview
1388
±5
31,505
OpenAI
Proprietary
73
63◄─►80
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
1387
±4
39,795
Anthropic
Proprietary
74
61◄─►81
qwen3-coder-480b-a35b-instruct
1387
±5
26,656
Alibaba
Apache 2.0
75
61◄─►84
Tencenthunyuan-t1-20250711
1385
±9
4,800
Tencent
Proprietary
76
61◄─►84
Minimaxminimax-m2.1-preview
1384Preliminary
±8
6,148
MiniMax
MIT
77
65◄─►83
mistral-medium-2505
1384
±5
34,374
Mistral
Proprietary
78
67◄─►83
qwen3-30b-a3b-instruct-2507
1382
±5
24,077
Alibaba
Apache 2.0
79
70◄─►84
gpt-4.1-mini-2025-04-14
1381
±4
40,335
OpenAI
Proprietary
80
67◄─►87
Tencenthunyuan-turbos-20250416
1381
±6
11,082
Tencent
Proprietary
81
73◄─►87
gemini-2.5-flash-lite-preview-09-2025-no-thinking
1379
±4
37,839
Proprietary
82
74◄─►89
gemini-2.5-flash-lite-preview-06-17-thinking
1375
±4
33,772
Proprietary
83
75◄─►89
qwen3-235b-a22b
1374
±5
27,047
Alibaba
Apache 2.0
84
77◄─►89
qwen2.5-max
1373
±4
33,487
Alibaba
Proprietary
85
80◄─►89
Anthropicclaude-3-5-sonnet-20241022
1372
±3
89,708
Anthropic
Proprietary
86
80◄─►93
Anthropicclaude-3-7-sonnet-20250219
1371
±4
44,455
Anthropic
Proprietary
87
80◄─►93
glm-4.5-air
1371
±4
31,472
Z.ai
MIT
88
82◄─►96
qwen3-next-80b-a3b-thinking
1368
±6
13,750
Alibaba
Apache 2.0
89
82◄─►95
Minimaxminimax-m1
1367
±4
36,679
MiniMax
Apache 2.0
90
86◄─►98
gemma-3-27b-it
1365
±4
49,137
Gemma
91
86◄─►102
o3-mini-high
1363
±5
18,725
OpenAI
Proprietary
92
86◄─►103
grok-3-mini-high
1362
±5
17,475
xAI
Proprietary
93
86◄─►108
amazon-nova-experimental-chat-11-10
1361
±7
7,391
Amazon
Proprietary
94
88◄─►104
gemini-2.0-flash-001
1360
±4
45,001
Proprietary
95
88◄─►112
deepseek-v3
1358
±5
21,995
DeepSeek
DeepSeek
96
90◄─►112
grok-3-mini-beta
1357
±5
23,670
xAI
Proprietary
97
91◄─►117
mistral-small-2506
1355
±5
18,225
Mistral
Apache 2.0
98
88◄─►118
intellect-3
1354
±8
5,424
Prime Intellect
MIT
99
90◄─►120
glm-4.5v
1353
±8
4,961
Z.ai
MIT
100
92◄─►118
gpt-oss-120b
1353
±4
31,079
OpenAI
Apache 2.0
101
93◄─►118
gemini-2.0-flash-lite-preview-02-05
1353
±4
25,222
Proprietary
102
94◄─►117
Coherecommand-a-03-2025
1352
±3
57,583
Cohere
CC-BY-NC-4.0
103
94◄─►118
gemini-1.5-pro-002
1351
±3
56,014
Proprietary
104
97◄─►120
o3-mini
1348
±3
58,663
OpenAI
Proprietary
105
95◄─►122
amazon-nova-experimental-chat-10-20
1347
±6
11,615
Amazon
Proprietary
106
91◄─►130
Tencenthunyuan-turbos-20250226
1347
±12
2,254
Tencent
Proprietary
107
91◄─►130
amazon-nova-experimental-chat-10-09
1347
±11
2,876
Amazon
Proprietary
108
95◄─►123
ling-flash-2.0
1347
±7
7,111
Ant Group
MIT
109
94◄─►127
Minimaxminimax-m2
1346
±8
7,073
MiniMax
Apache 2.0
110
95◄─►126
Stepfunstep-3
1346
±7
6,600
StepFun
Apache 2.0
111
91◄─►131
Nvidiallama-3.1-nemotron-ultra-253b-v1
1346
±12
2,573
Nvidia
Nvidia Open Model
112
99◄─►121
gpt-4o-2024-05-13
1345
±3
113,568
OpenAI
Proprietary
113
94◄─►130
qwen3-32b
1345
±9
3,944
Alibaba
Apache 2.0
114
95◄─►130
qwen-plus-0125
1345
±8
5,862
Alibaba
Proprietary
115
97◄─►131
glm-4-plus-0111
1343
±8
5,808
Zhipu
Proprietary
116
102◄─►124
Anthropicclaude-3-5-sonnet-20240620
1342
±3
82,864
Anthropic
Proprietary
117
97◄─►135
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
1341
±10
3,465
Nvidia
Nvidia Open
118
97◄─►133
gemma-3-12b-it
1341
±9
3,866
Gemma
119
97◄─►137
Tencenthunyuan-turbo-0110
1339
±11
2,324
Tencent
Proprietary
120
103◄─►133
gpt-5-nano-high
1338
±7
8,348
OpenAI
Proprietary
121
105◄─►135
nova-2-lite
1337
±6
12,362
Amazon
Proprietary
122
107◄─►132
o1-mini
1336
±4
52,303
OpenAI
Proprietary
123
109◄─►132
Metallama-3.1-405b-instruct-bf16
1335
±4
41,948
Meta
Llama 3.1 Community
124
110◄─►135
gpt-4o-2024-08-06
1335
±4
45,787
OpenAI
Proprietary
125
109◄─►135
qwq-32b
1335
±4
26,177
Alibaba
Apache 2.0
126
111◄─►135
grok-2-2024-08-13
1334
±4
63,725
xAI
Proprietary
127
108◄─►135
gemini-advanced-0514
1334
±5
50,652
Proprietary
128
111◄─►135
Metallama-3.1-405b-instruct-fp8
1333
±3
60,273
Meta
Llama 3.1 Community
129
106◄─►146
Stepfunstep-2-16k-exp-202412
1333
±9
4,896
StepFun
Proprietary
130
117◄─►147
01.AIyi-lightning
1328
±5
27,624
01 AI
Proprietary
131
119◄─►148
Metallama-4-maverick-17b-128e-instruct
1327
±4
41,039
Meta
Llama 4
132
121◄─►150
qwen3-30b-a3b
1326
±5
27,383
Alibaba
Apache 2.0
133
111◄─►158
Nvidiallama-3.3-nemotron-49b-super-v1
1326
±12
2,246
Nvidia
Nvidia
134
115◄─►157
Tencenthunyuan-large-2025-02-10
1325
±10
3,760
Tencent
Proprietary
135
128◄─►153
gpt-4-turbo-2024-04-09
1324
±4
98,965
OpenAI
Proprietary
136
129◄─►153
Anthropicclaude-3-5-haiku-20241022
1323
±3
71,210
Anthropic
Proprietary
137
121◄─►157
deepseek-v2.5-1210
1323
±8
6,877
DeepSeek
DeepSeek
138
129◄─►154
Metallama-4-scout-17b-16e-instruct
1322
±5
31,044
Meta
Llama
139
129◄─►153
gemini-1.5-pro-001
1322
±4
79,769
Proprietary
140
129◄─►153
Anthropicclaude-3-opus-20240229
1322
±3
196,365
Anthropic
Proprietary
141
128◄─►158
gpt-4.1-nano-2025-04-14
1321
±8
6,146
OpenAI
Proprietary
142
129◄─►158
Stepfunstep-1o-turbo-202506
1320
±7
9,632
StepFun
Proprietary
143
129◄─►159
ring-flash-2.0
1320
±7
7,243
Ant Group
MIT
144
132◄─►157
Metallama-3.3-70b-instruct
1319
±3
55,942
Meta
Llama-3.3
145
131◄─►158
glm-4-plus
1319
±5
26,343
Zhipu AI
Proprietary
146
130◄─►158
gemma-3n-e4b-it
1319
±5
23,401
Gemma
147
129◄─►159
gpt-oss-20b
1318
±6
10,820
OpenAI
Apache 2.0
148
132◄─►159
qwen-max-0919
1317
±6
16,599
Alibaba
Qwen
149
129◄─►164
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16
1317
±7
9,342
Nvidia
NVIDIA Open Model
150
133◄─►158
gpt-4o-mini-2024-07-18
1317
±3
69,291
OpenAI
Proprietary
151
133◄─►165
qwen2.5-plus-1127
1314
±6
10,254
Alibaba
Proprietary
152
138◄─►164
mistral-large-2407
1314
±4
45,968
Mistral
Mistral Research
153
137◄─►165
athene-v2-chat
1313
±4
24,880
NexusFlow
NexusFlow
154
138◄─►164
gpt-4-0125-preview
1313
±4
94,534
OpenAI
Proprietary
155
138◄─►164
gpt-4-1106-preview
1313
±4
101,116
OpenAI
Proprietary
156
129◄─►169
mercury
1312
±14
1,960
Inception AI
Proprietary
157
133◄─►169
Tencenthunyuan-standard-2025-02-10
1311
±10
3,923
Tencent
Proprietary
158
141◄─►167
gemini-1.5-flash-002
1310
±4
35,180
Proprietary
159
150◄─►168
grok-2-mini-2024-08-13
1307
±4
52,792
xAI
Proprietary
160
150◄─►169
deepseek-v2.5
1306
±5
24,839
DeepSeek
DeepSeek
161
150◄─►169
athene-70b-0725
1305
±6
19,795
NexusFlow
CC-BY-NC-4.0
162
154◄─►169
mistral-large-2411
1305
±4
28,454
Mistral
MRL
163
147◄─►170
olmo-3-32b-think
1305
±8
5,977
Allen AI
Apache 2.0
164
150◄─►169
magistral-medium-2506
1305
±6
11,952
Mistral
Proprietary
165
156◄─►169
mistral-small-3.1-24b-instruct-2503
1304
±4
33,996
Mistral
Apache 2.0
166
150◄─►175
gemma-3-4b-it
1303
±9
4,196
Gemma
167
156◄─►169
qwen2.5-72b-instruct
1302
±4
39,634
Alibaba
Qwen
168
157◄─►178
Nvidiallama-3.1-nemotron-70b-instruct
1298
±8
7,216
Nvidia
Llama 3.1
169
158◄─►180
Tencenthunyuan-large-vision
1295
±9
5,573
Tencent
Proprietary
170
166◄─►178
Metallama-3.1-70b-instruct
1293
±4
56,002
Meta
Llama 3.1 Community
171
167◄─►183
jamba-1.5-large
1289
±7
8,730
AI21 Labs
Jamba Open
172
168◄─►181
amazon-nova-pro-v1.0
1289
±4
25,218
Amazon
Proprietary
173
167◄─►188
ibm-granite-h-small
1288
±8
5,681
IBM
Apache 2.0
174
168◄─►181
gemma-2-27b-it
1288
±3
76,196
Gemma license
175
167◄─►183
reka-core-20240904
1287
±7
7,380
Reka AI
Proprietary
176
168◄─►183
gpt-4-0314
1287
±5
54,754
OpenAI
Proprietary
177
167◄─►189
Nvidiallama-3.1-nemotron-51b-instruct
1286
±10
3,777
Nvidia
Llama 3.1
178
167◄─►189
llama-3.1-tulu-3-70b
1286
±10
2,882
Ai2
Llama 3.1
179
170◄─►183
gemini-1.5-flash-001
1285
±4
63,418
Proprietary
180
171◄─►189
Anthropicclaude-3-sonnet-20240229
1281
±4
110,173
Anthropic
Proprietary
181
170◄─►189
gemma-2-9b-it-simpo
1279
±7
10,108
Princeton
MIT
182
173◄─►189
Nvidianemotron-4-340b-instruct
1277
±5
19,913
Nvidia
NVIDIA Open Model
183
173◄─►190
Coherecommand-r-plus-08-2024
1277
±7
9,932
Cohere
CC-BY-NC-4.0
184
177◄─►189
Metallama-3-70b-instruct
1276
±4
158,907
Meta
Llama 3 Community
185
177◄─►190
gpt-4-0613
1275
±4
89,612
OpenAI
Proprietary
186
177◄─►192
mistral-small-24b-instruct-2501
1274
±6
14,831
Mistral
Apache 2.0
187
177◄─►194
glm-4-0520
1273
±7
9,857
Zhipu AI
Proprietary
188
177◄─►194
reka-flash-20240904
1273
±7
7,585
Reka AI
Proprietary
189
178◄─►198
qwen2.5-coder-32b-instruct
1270
±8
5,452
Alibaba
Apache 2.0
190
184◄─►198
Coherec4ai-aya-expanse-32b
1267
±5
27,362
Cohere
CC-BY-NC-4.0
191
186◄─►198
gemma-2-9b-it
1265
±4
54,957
Gemma license
192
186◄─►200
deepseek-coder-v2
1264
±6
15,242
DeepSeek AI
DeepSeek License
193
187◄─►199
Coherecommand-r-plus
1263
±4
78,400
Cohere
CC-BY-NC-4.0
194
187◄─►200
qwen2-72b-instruct
1262
±5
37,688
Alibaba
Qianwen LICENSE
195
189◄─►200
Anthropicclaude-3-haiku-20240307
1261
±4
118,626
Anthropic
Proprietary
196
189◄─►200
amazon-nova-lite-v1.0
1260
±5
19,765
Amazon
Proprietary
197
189◄─►200
gemini-1.5-flash-8b-001
1259
±4
35,914
Proprietary
198
192◄─►200
Azurephi-4
1256
±5
24,355
Microsoft
MIT
199
189◄─►206
olmo-2-0325-32b-instruct
1252
±11
3,379
Allen AI
Apache-2.0
200
193◄─►205
Coherecommand-r-08-2024
1251
±7
10,229
Cohere
CC-BY-NC-4.0
201
199◄─►209
mistral-large-2402
1242
±5
63,404
Mistral
Proprietary
202
199◄─►209
amazon-nova-micro-v1.0
1241
±5
19,776
Amazon
Proprietary
203
199◄─►214
jamba-1.5-mini
1239
±7
8,918
AI21 Labs
Jamba Open
204
199◄─►217
ministral-8b-2410
1237
±9
4,833
Mistral
MRL
205
200◄─►217
gemini-pro-dev-api
1235
±7
18,454
Proprietary
206
201◄─►216
qwen1.5-110b-chat
1234
±5
26,679
Alibaba
Qianwen LICENSE
207
201◄─►217
qwen1.5-72b-chat
1234
±5
39,689
Alibaba
Qianwen LICENSE
208
199◄─►219
Tencenthunyuan-standard-256k
1234
±12
2,761
Tencent
Proprietary
209
201◄─►218
reka-flash-21b-20240226-online
1233
±7
15,606
Reka AI
Proprietary
210
203◄─►218
mixtral-8x22b-instruct-v0.1
1230
±4
52,214
Mistral
Apache 2.0
211
203◄─►219
Coherecommand-r
1228
±5
54,710
Cohere
CC-BY-NC-4.0
212
203◄─►219
reka-flash-21b-20240226
1227
±6
25,026
Reka AI
Proprietary
213
204◄─►220
gpt-3.5-turbo-0125
1224
±5
67,214
OpenAI
Proprietary
214
204◄─►221
Coherec4ai-aya-expanse-8b
1224
±7
9,922
Cohere
CC-BY-NC-4.0
215
205◄─►221
mistral-medium
1223
±5
34,893
Mistral
Proprietary
216
208◄─►220
Metallama-3-8b-instruct
1223
±4
106,055
Meta
Llama 3 Community
217
203◄─►224
gemini-pro
1222
±12
6,418
Proprietary
218
203◄─►223
llama-3.1-tulu-3-8b
1222
±11
2,943
Ai2
Llama 3.1
219
210◄─►225
HuggingFacezephyr-orpo-141b-A35b-v0.1
1214
±11
4,712
HuggingFace
Apache 2.0
220
215◄─►224
01.AIyi-1.5-34b-chat
1213
±5
24,417
01 AI
Apache-2.0
221
217◄─►224
Metallama-3.1-8b-instruct
1211
±4
50,235
Meta
Llama 3.1 Community
222
213◄─►230
granite-3.1-8b-instruct
1210
±11
3,142
IBM
Apache 2.0
223
217◄─►229
qwen1.5-32b-chat
1205
±6
22,068
Alibaba
Qianwen LICENSE
224
218◄─►232
gpt-3.5-turbo-1106
1202
±9
16,760
OpenAI
Proprietary
225
222◄─►232
gemma-2-2b-it
1199
±4
46,900
Gemma license
226
221◄─►232
Azurephi-3-medium-4k-instruct
1198
±5
25,301
Microsoft
MIT
227
222◄─►232
mixtral-8x7b-instruct-v0.1
1198
±4
74,303
Mistral
Apache 2.0
228
222◄─►237
dbrx-instruct-preview
1196
±6
32,760
Databricks
DBRX LICENSE
229
222◄─►241
qwen1.5-14b-chat
1193
±7
18,066
Alibaba
Qianwen LICENSE
230
223◄─►241
InternLMinternlm2_5-20b-chat
1192
±7
10,038
InternLM
Other
231
224◄─►247
Azurewizardlm-70b
1185
±9
8,270
Microsoft
Llama 2 Community
232
224◄─►248
deepseek-llm-67b-chat
1184
±12
4,950
DeepSeek AI
DeepSeek License
233
228◄─►245
01.AIyi-34b-chat
1184
±7
15,624
01 AI
Yi License
234
228◄─►247
granite-3.0-8b-instruct
1184
±9
6,727
IBM
Apache 2.0
235
228◄─►247
OpenChatopenchat-3.5-0106
1183
±8
12,712
OpenChat
Apache-2.0
236
228◄─►248
OpenChatopenchat-3.5
1182
±10
8,009
OpenChat
Apache-2.0
237
228◄─►249
granite-3.1-2b-instruct
1181
±11
3,236
IBM
Apache 2.0
238
229◄─►248
gemma-1.1-7b-it
1180
±6
24,327
Gemma license
239
229◄─►247
Snowflakesnowflake-arctic-instruct
1180
±6
33,272
Snowflake
Apache 2.0
240
229◄─►250
tulu-2-dpo-70b
1179
±10
6,579
AllenAI/UW
AI2 ImpACT Low-risk
241
229◄─►252
openhermes-2.5-mistral-7b
1176
±10
5,026
NousResearch
Apache-2.0
242
231◄─►251
vicuna-33b
1174
±6
22,613
LMSYS
Non-commercial
243
231◄─►252
starling-lm-7b-beta
1173
±7
16,190
Nexusflow
Apache-2.0
244
231◄─►252
Azurephi-3-small-8k-instruct
1172
±6
17,983
Microsoft
MIT
245
232◄─►252
Metallama-2-70b-chat
1172
±5
38,767
Meta
Llama 2 Community
246
232◄─►255
starling-lm-7b-alpha
1168
±8
10,267
UC Berkeley
CC-BY-NC-4.0
247
236◄─►256
Metallama-3.2-3b-instruct
1167
±8
8,043
Meta
Llama 3.2
248
231◄─►257
nous-hermes-2-mixtral-8x7b-dpo
1166
±12
3,792
NousResearch
Apache-2.0
249
239◄─►262
qwq-32b-preview
1159
±11
3,256
Alibaba
Apache 2.0
250
240◄─►266
Nvidiallama2-70b-steerlm-chat
1157
±13
3,605
Nvidia
Llama 2 Community
251
246◄─►262
granite-3.0-2b-instruct
1157
±8
6,922
IBM
Apache 2.0
252
242◄─►266
solar-10.7b-instruct-v1.0
1154
±13
4,187
Upstage AI
CC-BY-NC-4.0
253
241◄─►270
dolphin-2.2.1-mistral-7b
1152
±15
1,685
Cognitive Computations
Apache-2.0
254
246◄─►268
mpt-30b-chat
1151
±12
2,606
MosaicML
CC-BY-NC-SA-4.0
255
248◄─►266
mistral-7b-instruct-v0.2
1151
±7
19,603
Mistral
Apache-2.0
256
247◄─►266
Azurewizardlm-13b
1150
±9
7,122
Microsoft
Llama 2 Community
257
246◄─►273
falcon-180b-chat
1148
±17
1,312
TII
Falcon-180B TII License
258
249◄─►271
qwen1.5-7b-chat
1144
±10
4,782
Alibaba
Qianwen LICENSE
259
249◄─►270
Azurephi-3-mini-4k-instruct-june-2024
1143
±6
12,414
Microsoft
MIT
260
249◄─►270
Metallama-2-13b-chat
1143
±7
19,357
Meta
Llama 2 Community
261
249◄─►271
vicuna-13b
1142
±7
19,539
LMSYS
Llama 2 Community
262
249◄─►273
qwen-14b-chat
1139
±11
5,004
Alibaba
Qianwen LICENSE
263
251◄─►273
palm-2
1137
±9
8,634
Proprietary
264
251◄─►273
Metacodellama-34b-instruct
1137
±9
7,417
Meta
Llama 2 Community
265
251◄─►273
gemma-7b-it
1136
±9
9,034
Gemma license
266
255◄─►274
HuggingFacezephyr-7b-beta
1132
±9
11,220
HuggingFace
MIT
267
256◄─►274
Azurephi-3-mini-128k-instruct
1131
±7
21,024
Microsoft
MIT
268
259◄─►274
Azurephi-3-mini-4k-instruct
1129
±6
20,539
Microsoft
MIT
269
251◄─►278
HuggingFacezephyr-7b-alpha
1129
±16
1,803
HuggingFace
MIT
270
255◄─►277
guanaco-33b
1128
±12
2,955
UW
Non-commercial
271
261◄─►278
stripedhyena-nous-7b
1121
±11
5,214
Together AI
Apache 2.0
272
256◄─►279
Metacodellama-70b-instruct
1119
±18
1,151
Meta
Llama 2 Community
273
261◄─►278
HuggingFacesmollm2-1.7b-instruct
1119
±14
2,244
HuggingFace
Apache 2.0
274
266◄─►278
vicuna-7b
1116
±9
6,972
LMSYS
Llama 2 Community
275
269◄─►278
gemma-1.1-2b-it
1115
±8
11,035
Gemma license
276
269◄─►278
Metallama-3.2-1b-instruct
1113
±8
8,166
Meta
Llama 3.2
277
269◄─►279
mistral-7b-instruct
1111
±9
9,042
Mistral
Apache 2.0
278
270◄─►279
Metallama-2-7b-chat
1109
±7
14,272
Meta
Llama 2 Community
279
276◄─►283
gemma-2b-it
1092
±12
4,817
Gemma license
280
279◄─►281
qwen1.5-4b-chat
1091
±9
7,662
Alibaba
Qianwen LICENSE
281
279◄─►286
olmo-7b-instruct
1074
±11
6,412
Allen AI
Apache-2.0
282
280◄─►286
koala-13b
1071
±10
6,998
UC Berkeley
Non-commercial
283
281◄─►286
alpaca-13b
1067
±11
5,828
Stanford
Non-commercial
284
280◄─►287
gpt4all-13b-snoozy
1066
±15
1,773
Nomic AI
Non-commercial
285
281◄─►287
mpt-7b-chat
1062
±12
3,977
MosaicML
CC-BY-NC-SA-4.0
286
281◄─►287
chatglm3-6b
1057
±12
4,692
Tsinghua
Apache-2.0
287
284◄─►289
RWKVRWKV-4-Raven-14B
1042
±11
4,898
RWKV
Apache 2.0
288
287◄─►289
chatglm2-6b
1026
±14
2,683
Tsinghua
Apache-2.0
289
287◄─►289
oasst-pythia-12b
1023
±11
6,343
OpenAssistant
Apache 2.0
290
290◄─►293
chatglm-6b
996
±13
4,968
Tsinghua
Non-commercial
291
290◄─►293
fastchat-t5-3b
992
±12
4,270
LMSYS
Apache 2.0
292
290◄─►293
dolly-v2-12b
980
±14
3,471
Databricks
MIT
293
290◄─►294
Metallama-13b
972
±16
2,441
Meta
Non-commercial
294
293◄─►294
Stabilitystablelm-tuned-alpha-7b
953
±13
3,325
Stability AI
CC-BY-NC-SA-4.0
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?