模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2026-01-30 08:12:42 UTC / 2026-01-30 16:12:42 CST (北京时间)
排行榜
1
1◄─►1
gemini-3-pro
1487
±5
32,205
Proprietary
2
2◄─►5
grok-4.1-thinking
1475
±5
32,194
xAI
Proprietary
3
2◄─►7
gemini-3-flash
1471
±5
22,683
Proprietary
4
2◄─►8
Anthropicclaude-opus-4-5-20251101-thinking-32k
1468
±5
23,990
Anthropic
Proprietary
5
2◄─►8
Anthropicclaude-opus-4-5-20251101
1466
±5
28,752
Anthropic
Proprietary
6
3◄─►8
grok-4.1
1466
±4
36,363
xAI
Proprietary
7
3◄─►10
gemini-3-flash (thinking-minimal)
1463
±6
13,989
Proprietary
8
4◄─►12
gpt-5.1-high
1459
±5
28,296
OpenAI
Proprietary
9
7◄─►21
ernie-5.0-0110
1453Preliminary
±7
8,274
Baidu
Proprietary
10
8◄─►19
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
1450
±4
42,538
Anthropic
Proprietary
11
9◄─►21
Anthropicclaude-sonnet-4-5-20250929
1450
±4
40,121
Anthropic
Proprietary
12
7◄─►21
MoonshotAIkimi-k2.5-thinking
1450
±9
4,901
Moonshot
Modified MIT
13
9◄─►19
gemini-2.5-pro
1450
±3
91,734
Proprietary
14
8◄─►21
ernie-5.0-preview-1203
1449
±7
9,786
Baidu
Proprietary
15
9◄─►21
Anthropicclaude-opus-4-1-20250805-thinking-16k
1448
±4
49,975
Anthropic
Proprietary
16
9◄─►21
Anthropicclaude-opus-4-1-20250805
1445
±3
71,681
Anthropic
Proprietary
17
9◄─►23
gpt-4.5-preview-2025-02-27
1444
±6
14,549
OpenAI
Proprietary
18
11◄─►22
chatgpt-4o-latest-20250326
1443
±3
78,991
OpenAI
Proprietary
19
9◄─►26
glm-4.7
1440
±6
12,027
Z.ai
MIT
20
9◄─►27
gpt-5.2
1440
±7
9,387
OpenAI
Proprietary
21
11◄─►27
gpt-5.2-high
1440
±7
12,869
OpenAI
Proprietary
22
17◄─►28
gpt-5.1
1435
±5
30,355
OpenAI
Proprietary
23
18◄─►29
gpt-5-high
1435
±5
32,651
OpenAI
Proprietary
24
19◄─►32
qwen3-max-preview
1434
±5
27,857
Alibaba
Proprietary
25
19◄─►32
o3-2025-04-16
1433
±4
61,365
OpenAI
Proprietary
26
19◄─►37
grok-4-1-fast-reasoning
1430
±5
25,264
xAI
Proprietary
27
20◄─►39
MoonshotAIkimi-k2-thinking-turbo
1429
±4
30,011
Moonshot
Modified MIT
28
23◄─►45
gpt-5-chat
1426
±4
31,849
OpenAI
Proprietary
29
22◄─►47
qwen3-max-2025-09-23
1425
±6
9,223
Alibaba
Proprietary
30
26◄─►46
glm-4.6
1425
±4
35,373
Z.ai
MIT
31
26◄─►46
Anthropicclaude-opus-4-20250514-thinking-16k
1424
±4
37,990
Anthropic
Proprietary
32
24◄─►49
deepseek-v3.2-exp
1423
±6
11,774
DeepSeek
MIT
33
24◄─►49
deepseek-v3.2-exp-thinking
1423
±7
9,000
DeepSeek
MIT
34
27◄─►47
qwen3-235b-a22b-instruct-2507
1422
±3
66,045
Alibaba
Apache 2.0
35
24◄─►51
grok-4-fast-chat
1422
±8
6,992
xAI
Proprietary
36
27◄─►51
deepseek-v3.2-thinking
1420
±5
19,713
DeepSeek
MIT
37
28◄─►51
deepseek-v3.2
1420
±5
24,571
DeepSeek
MIT
38
28◄─►53
deepseek-r1-0528
1418
±6
19,289
DeepSeek
MIT
39
28◄─►54
MoonshotAIkimi-k2-0905-preview
1418
±6
11,977
Moonshot
Modified MIT
40
26◄─►56
ernie-5.0-preview-1022
1418
±9
4,630
Baidu
Proprietary
41
28◄─►54
MoonshotAIkimi-k2-0711-preview
1417
±5
28,666
Moonshot
Modified MIT
42
28◄─►54
deepseek-v3.1
1417
±6
15,304
DeepSeek
MIT
43
28◄─►54
deepseek-v3.1-thinking
1417
±7
11,986
DeepSeek
MIT
44
26◄─►58
deepseek-v3.1-terminus
1416
±10
3,762
DeepSeek
MIT
45
26◄─►59
deepseek-v3.1-terminus-thinking
1416
±10
3,547
DeepSeek
MIT
46
29◄─►56
qwen3-vl-235b-a22b-instruct
1415
±6
11,684
Alibaba
Apache 2.0
47
31◄─►54
mistral-large-3
1414
±5
20,796
Mistral
Apache 2.0
48
33◄─►54
Anthropicclaude-opus-4-20250514
1414
±4
45,572
Anthropic
Proprietary
49
33◄─►54
gpt-4.1-2025-04-14
1413
±4
52,220
OpenAI
Proprietary
50
35◄─►56
mistral-medium-2508
1412
±3
59,880
Mistral
Proprietary
51
35◄─►58
grok-3-preview-02-24
1411
±4
33,980
xAI
Proprietary
52
38◄─►59
grok-4-0709
1410
±4
42,174
xAI
Proprietary
53
38◄─►62
glm-4.5
1410
±5
24,803
Z.ai
MIT
54
39◄─►58
gemini-2.5-flash
1409
±3
90,980
Proprietary
55
46◄─►67
gemini-2.5-flash-preview-09-2025
1405
±4
32,903
Proprietary
56
46◄─►68
grok-4-fast-reasoning
1404
±5
18,646
xAI
Proprietary
57
49◄─►68
Anthropicclaude-haiku-4-5-20251001
1403
±4
41,166
Anthropic
Proprietary
58
49◄─►68
o1-2024-12-17
1402
±4
27,822
OpenAI
Proprietary
59
54◄─►70
qwen3-235b-a22b-no-thinking
1401
±4
39,554
Alibaba
Apache 2.0
60
52◄─►70
qwen3-next-80b-a3b-instruct
1401
±5
22,874
Alibaba
Apache 2.0
61
55◄─►70
Anthropicclaude-sonnet-4-20250514-thinking-32k
1400
±4
36,212
Anthropic
Proprietary
62
54◄─►76
longcat-flash-chat
1399
±6
11,554
Meituan
MIT
63
55◄─►76
qwen3-235b-a22b-thinking-2507
1398
±6
9,248
Alibaba
Apache 2.0
64
55◄─►76
deepseek-r1
1398
±5
18,537
DeepSeek
MIT
65
55◄─►83
amazon-nova-experimental-chat-12-10
1395
±10
3,722
Amazon
Proprietary
66
55◄─►82
qwen3-vl-235b-a22b-thinking
1395
±7
7,974
Alibaba
Apache 2.0
67
56◄─►80
mimo-v2-flash (non-thinking)
1395
±6
13,410
Xiaomi
MIT
68
59◄─►77
deepseek-v3-0324
1394
±4
46,699
DeepSeek
MIT
69
54◄─►84
Tencenthunyuan-vision-1.5-thinking
1393
±12
2,222
Tencent
Proprietary
70
59◄─►83
mai-1-preview
1391
±5
18,137
Microsoft AI
Proprietary
71
62◄─►82
o4-mini-2025-04-16
1391
±4
46,681
OpenAI
Proprietary
72
62◄─►83
gpt-5-mini-high
1390
±5
27,174
OpenAI
Proprietary
73
62◄─►83
Anthropicclaude-sonnet-4-20250514
1390
±4
41,643
Anthropic
Proprietary
74
62◄─►83
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
1389
±4
39,890
Anthropic
Proprietary
75
62◄─►83
o1-preview
1388
±5
31,120
OpenAI
Proprietary
76
62◄─►86
Tencenthunyuan-t1-20250711
1387
±9
4,801
Tencent
Proprietary
77
65◄─►84
qwen3-coder-480b-a35b-instruct
1387
±5
26,648
Alibaba
Apache 2.0
78
66◄─►84
mistral-medium-2505
1384
±5
34,547
Mistral
Proprietary
79
67◄─►86
qwen3-30b-a3b-instruct-2507
1383
±5
24,093
Alibaba
Apache 2.0
80
67◄─►87
Minimaxminimax-m2.1-preview
1383
±6
12,907
MiniMax
MIT
81
66◄─►89
Tencenthunyuan-turbos-20250416
1382
±6
11,050
Tencent
Proprietary
82
69◄─►87
gpt-4.1-mini-2025-04-14
1382
±4
40,533
OpenAI
Proprietary
83
75◄─►89
gemini-2.5-flash-lite-preview-09-2025-no-thinking
1379
±4
44,231
Proprietary
84
66◄─►97
glm-4.6v
1378
±11
2,822
Z.ai
MIT
85
78◄─►94
gemini-2.5-flash-lite-preview-06-17-thinking
1375
±4
33,931
Proprietary
86
78◄─►94
qwen3-235b-a22b
1374
±5
27,156
Alibaba
Apache 2.0
87
80◄─►94
qwen2.5-max
1374
±4
33,288
Alibaba
Proprietary
88
82◄─►94
Anthropicclaude-3-5-sonnet-20241022
1373
±3
89,457
Anthropic
Proprietary
89
82◄─►96
Anthropicclaude-3-7-sonnet-20250219
1372
±4
44,434
Anthropic
Proprietary
90
84◄─►97
glm-4.5-air
1371
±4
31,396
Z.ai
MIT
91
84◄─►100
qwen3-next-80b-a3b-thinking
1369
±6
13,871
Alibaba
Apache 2.0
92
84◄─►100
Minimaxminimax-m1
1367
±4
36,843
MiniMax
Apache 2.0
93
84◄─►103
amazon-nova-experimental-chat-11-10
1367
±6
13,391
Amazon
Proprietary
94
88◄─►100
gemma-3-27b-it
1365
±4
48,686
Gemma
95
88◄─►104
o3-mini-high
1364
±5
18,584
OpenAI
Proprietary
96
89◄─►108
grok-3-mini-high
1362
±5
17,525
xAI
Proprietary
97
84◄─►116
glm-4.7-flash
1362
±9
4,090
Z.ai
MIT
98
91◄─►108
gemini-2.0-flash-001
1361
±4
44,815
Proprietary
99
91◄─►114
deepseek-v3
1359
±5
21,788
DeepSeek
DeepSeek
100
94◄─►119
grok-3-mini-beta
1357
±5
23,763
xAI
Proprietary
101
94◄─►120
mistral-small-2506
1356
±5
18,340
Mistral
Apache 2.0
102
91◄─►122
intellect-3
1356
±8
5,325
Prime Intellect
MIT
103
96◄─►121
gpt-oss-120b
1354
±4
30,977
OpenAI
Apache 2.0
104
96◄─►122
gemini-2.0-flash-lite-preview-02-05
1353
±4
24,951
Proprietary
105
94◄─►124
glm-4.5v
1353
±8
4,977
Z.ai
MIT
106
98◄─►122
Coherecommand-a-03-2025
1353
±3
57,447
Cohere
CC-BY-NC-4.0
107
98◄─►122
gemini-1.5-pro-002
1352
±3
55,607
Proprietary
108
95◄─►134
Tencenthunyuan-turbos-20250226
1349
±12
2,226
Tencent
Proprietary
109
100◄─►124
o3-mini
1348
±3
58,645
OpenAI
Proprietary
110
98◄─►126
amazon-nova-experimental-chat-10-20
1348
±6
11,439
Amazon
Proprietary
111
96◄─►135
Nvidiallama-3.1-nemotron-ultra-253b-v1
1347
±12
2,546
Nvidia
Nvidia Open Model
112
96◄─►134
amazon-nova-experimental-chat-10-09
1347
±11
2,892
Amazon
Proprietary
113
98◄─►134
qwen3-32b
1347
±9
3,932
Alibaba
Apache 2.0
114
98◄─►132
Minimaxminimax-m2
1347
±8
6,749
MiniMax
Apache 2.0
115
99◄─►131
ling-flash-2.0
1346
±7
7,044
Ant Group
MIT
116
99◄─►132
Stepfunstep-3
1346
±7
6,595
StepFun
Apache 2.0
117
98◄─►133
qwen-plus-0125
1346
±8
5,823
Alibaba
Proprietary
118
103◄─►124
gpt-4o-2024-05-13
1346
±3
112,863
OpenAI
Proprietary
119
100◄─►135
glm-4-plus-0111
1343
±8
5,760
Zhipu
Proprietary
120
107◄─►129
Anthropicclaude-3-5-sonnet-20240620
1343
±3
82,417
Anthropic
Proprietary
121
101◄─►138
gemma-3-12b-it
1342
±10
3,829
Gemma
122
102◄─►140
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
1341
±10
3,430
Nvidia
Nvidia Open
123
100◄─►141
Tencenthunyuan-turbo-0110
1340
±12
2,295
Tencent
Proprietary
124
107◄─►139
gpt-5-nano-high
1338
±7
8,402
OpenAI
Proprietary
125
111◄─►136
o1-mini
1337
±4
51,986
OpenAI
Proprietary
126
110◄─►140
nova-2-lite
1336
±6
12,274
Amazon
Proprietary
127
112◄─►137
Metallama-3.1-405b-instruct-bf16
1336
±4
41,392
Meta
Llama 3.1 Community
128
111◄─►139
qwq-32b
1336
±4
26,123
Alibaba
Apache 2.0
129
112◄─►139
gpt-4o-2024-08-06
1335
±4
45,498
OpenAI
Proprietary
130
111◄─►140
gemini-advanced-0514
1335
±5
50,142
Proprietary
131
114◄─►139
grok-2-2024-08-13
1335
±4
63,495
xAI
Proprietary
132
116◄─►140
Metallama-3.1-405b-instruct-fp8
1334
±4
59,655
Meta
Llama 3.1 Community
133
110◄─►148
Stepfunstep-2-16k-exp-202412
1334
±9
4,829
StepFun
Proprietary
134
121◄─►153
01.AIyi-lightning
1329
±5
27,340
01 AI
Proprietary
135
122◄─►154
qwen3-30b-a3b
1328
±5
27,428
Alibaba
Apache 2.0
136
123◄─►153
Metallama-4-maverick-17b-128e-instruct
1328
±4
41,136
Meta
Llama 4
137
113◄─►162
Nvidiallama-3.3-nemotron-49b-super-v1
1327
±12
2,230
Nvidia
Nvidia
138
118◄─►162
Tencenthunyuan-large-2025-02-10
1327
±10
3,738
Tencent
Proprietary
139
133◄─►158
gpt-4-turbo-2024-04-09
1325
±4
98,130
OpenAI
Proprietary
140
133◄─►158
Anthropicclaude-3-5-haiku-20241022
1324
±3
71,179
Anthropic
Proprietary
141
128◄─►162
olmo-3.1-32b-instruct
1324
±7
8,310
Allen AI
Apache 2.0
142
124◄─►162
deepseek-v2.5-1210
1323
±8
6,793
DeepSeek
DeepSeek
143
133◄─►158
gemini-1.5-pro-001
1323
±4
79,132
Proprietary
144
133◄─►160
Metallama-4-scout-17b-16e-instruct
1323
±5
31,231
Meta
Llama
145
133◄─►158
Anthropicclaude-3-opus-20240229
1323
±3
194,904
Anthropic
Proprietary
146
132◄─►163
gpt-4.1-nano-2025-04-14
1322
±8
6,107
OpenAI
Proprietary
147
133◄─►162
Stepfunstep-1o-turbo-202506
1321
±7
9,687
StepFun
Proprietary
148
133◄─►164
ring-flash-2.0
1320
±7
7,199
Ant Group
MIT
149
136◄─►162
Metallama-3.3-70b-instruct
1320
±3
55,561
Meta
Llama-3.3
150
134◄─►162
glm-4-plus
1319
±5
26,134
Zhipu AI
Proprietary
151
134◄─►163
gemma-3n-e4b-it
1319
±5
23,347
Gemma
152
134◄─►164
qwen-max-0919
1318
±6
16,479
Alibaba
Qwen
153
137◄─►162
gpt-4o-mini-2024-07-18
1318
±3
68,801
OpenAI
Proprietary
154
134◄─►169
gpt-oss-20b
1318
±6
10,807
OpenAI
Apache 2.0
155
137◄─►169
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16
1317
±6
15,510
Nvidia
NVIDIA Open Model
156
137◄─►170
qwen2.5-plus-1127
1315
±6
10,179
Alibaba
Proprietary
157
141◄─►169
athene-v2-chat
1314
±4
24,746
NexusFlow
NexusFlow
158
141◄─►169
mistral-large-2407
1314
±4
45,460
Mistral
Mistral Research
159
142◄─►169
gpt-4-1106-preview
1314
±4
100,107
OpenAI
Proprietary
160
142◄─►169
gpt-4-0125-preview
1314
±4
93,439
OpenAI
Proprietary
161
137◄─►174
Tencenthunyuan-standard-2025-02-10
1312
±10
3,905
Tencent
Proprietary
162
134◄─►178
mercury
1310
±14
1,921
Inception AI
Proprietary
163
149◄─►173
gemini-1.5-flash-002
1310
±4
34,909
Proprietary
164
154◄─►174
grok-2-mini-2024-08-13
1308
±4
52,574
xAI
Proprietary
165
154◄─►174
deepseek-v2.5
1307
±5
24,574
DeepSeek
DeepSeek
166
154◄─►174
athene-70b-0725
1306
±6
19,622
NexusFlow
CC-BY-NC-4.0
167
154◄─►174
magistral-medium-2506
1305
±6
12,061
Mistral
Proprietary
168
160◄─►174
mistral-large-2411
1305
±4
28,081
Mistral
MRL
169
152◄─►177
olmo-3-32b-think
1305
±8
5,891
Allen AI
Apache 2.0
170
161◄─►174
mistral-small-3.1-24b-instruct-2503
1305
±4
34,132
Mistral
Apache 2.0
171
154◄─►181
gemma-3-4b-it
1303
±9
4,177
Gemma
172
161◄─►174
qwen2.5-72b-instruct
1303
±4
39,409
Alibaba
Qwen
173
161◄─►184
Nvidiallama-3.1-nemotron-70b-instruct
1299
±8
7,136
Nvidia
Llama 3.1
174
162◄─►185
Tencenthunyuan-large-vision
1296
±9
5,601
Tencent
Proprietary
175
170◄─►185
Metallama-3.1-70b-instruct
1294
±4
55,234
Meta
Llama 3.1 Community
176
172◄─►186
amazon-nova-pro-v1.0
1290
±5
24,753
Amazon
Proprietary
177
172◄─►189
jamba-1.5-large
1289
±7
8,659
AI21 Labs
Jamba Open
178
171◄─►192
ibm-granite-h-small
1288
±8
5,675
IBM
Apache 2.0
179
173◄─►187
gemma-2-27b-it
1288
±3
75,764
Gemma license
180
172◄─►189
reka-core-20240904
1288
±7
7,309
Reka AI
Proprietary
181
173◄─►189
gpt-4-0314
1288
±5
54,167
OpenAI
Proprietary
182
170◄─►195
Nvidiallama-3.1-nemotron-51b-instruct
1287
±10
3,749
Nvidia
Llama 3.1
183
170◄─►195
llama-3.1-tulu-3-70b
1287
±10
2,846
Ai2
Llama 3.1
184
174◄─►189
gemini-1.5-flash-001
1286
±4
62,823
Proprietary
185
173◄─►195
olmo-3.1-32b-think
1285
±7
8,498
Allen AI
Apache 2.0
186
177◄─►195
Anthropicclaude-3-sonnet-20240229
1282
±4
109,289
Anthropic
Proprietary
187
176◄─►195
gemma-2-9b-it-simpo
1280
±7
10,069
Princeton
MIT
188
178◄─►195
Nvidianemotron-4-340b-instruct
1278
±5
19,661
Nvidia
NVIDIA Open Model
189
178◄─►197
Coherecommand-r-plus-08-2024
1277
±7
9,869
Cohere
CC-BY-NC-4.0
190
182◄─►195
Metallama-3-70b-instruct
1277
±4
156,880
Meta
Llama 3 Community
191
182◄─►196
gpt-4-0613
1276
±4
88,721
OpenAI
Proprietary
192
183◄─►198
mistral-small-24b-instruct-2501
1274
±6
14,677
Mistral
Apache 2.0
193
182◄─►200
glm-4-0520
1274
±7
9,788
Zhipu AI
Proprietary
194
183◄─►201
reka-flash-20240904
1273
±7
7,537
Reka AI
Proprietary
195
183◄─►204
qwen2.5-coder-32b-instruct
1271
±8
5,430
Alibaba
Apache 2.0
196
190◄─►204
Coherec4ai-aya-expanse-32b
1267
±5
27,123
Cohere
CC-BY-NC-4.0
197
192◄─►204
gemma-2-9b-it
1266
±4
54,615
Gemma license
198
191◄─►205
deepseek-coder-v2
1265
±6
15,147
DeepSeek
DeepSeek License
199
193◄─►205
Coherecommand-r-plus
1263
±4
77,556
Cohere
CC-BY-NC-4.0
200
193◄─►205
qwen2-72b-instruct
1263
±5
37,325
Alibaba
Qianwen LICENSE
201
195◄─►205
Anthropicclaude-3-haiku-20240307
1262
±4
117,705
Anthropic
Proprietary
202
194◄─►206
amazon-nova-lite-v1.0
1261
±5
19,376
Amazon
Proprietary
203
195◄─►206
gemini-1.5-flash-8b-001
1259
±4
35,556
Proprietary
204
198◄─►206
Azurephi-4
1256
±5
24,126
Microsoft
MIT
205
195◄─►212
olmo-2-0325-32b-instruct
1253
±11
3,335
Allen AI
Apache-2.0
206
202◄─►211
Coherecommand-r-08-2024
1251
±7
10,141
Cohere
CC-BY-NC-4.0
207
205◄─►215
mistral-large-2402
1243
±5
62,437
Mistral
Proprietary
208
205◄─►215
amazon-nova-micro-v1.0
1241
±5
19,355
Amazon
Proprietary
209
205◄─►219
jamba-1.5-mini
1239
±7
8,854
AI21 Labs
Jamba Open
210
205◄─►223
ministral-8b-2410
1237
±9
4,780
Mistral
MRL
211
206◄─►223
gemini-pro-dev-api
1236
±7
18,352
Proprietary
212
207◄─►223
qwen1.5-110b-chat
1235
±6
26,191
Alibaba
Qianwen LICENSE
213
207◄─►224
reka-flash-21b-20240226-online
1234
±7
15,451
Reka AI
Proprietary
214
207◄─►223
qwen1.5-72b-chat
1234
±5
39,296
Alibaba
Qianwen LICENSE
215
205◄─►225
Tencenthunyuan-standard-256k
1234
±12
2,729
Tencent
Proprietary
216
209◄─►224
mixtral-8x22b-instruct-v0.1
1230
±5
51,417
Mistral
Apache 2.0
217
209◄─►225
Coherecommand-r
1228
±5
54,038
Cohere
CC-BY-NC-4.0
218
209◄─►225
reka-flash-21b-20240226
1227
±6
24,806
Reka AI
Proprietary
219
210◄─►226
gpt-3.5-turbo-0125
1225
±5
66,191
OpenAI
Proprietary
220
210◄─►227
mistral-medium
1224
±5
34,552
Mistral
Proprietary
221
214◄─►226
Metallama-3-8b-instruct
1224
±4
104,636
Meta
Llama 3 Community
222
210◄─►227
Coherec4ai-aya-expanse-8b
1223
±7
9,827
Cohere
CC-BY-NC-4.0
223
209◄─►230
gemini-pro
1223
±12
6,390
Proprietary
224
210◄─►230
llama-3.1-tulu-3-8b
1221
±11
2,895
Ai2
Llama 3.1
225
221◄─►230
01.AIyi-1.5-34b-chat
1214
±5
24,142
01 AI
Apache-2.0
226
216◄─►231
HuggingFacezephyr-orpo-141b-A35b-v0.1
1214
±11
4,653
HuggingFace
Apache 2.0
227
223◄─►230
Metallama-3.1-8b-instruct
1212
±4
49,605
Meta
Llama 3.1 Community
228
219◄─►236
granite-3.1-8b-instruct
1209
±11
3,092
IBM
Apache 2.0
229
223◄─►236
qwen1.5-32b-chat
1205
±6
21,744
Alibaba
Qianwen LICENSE
230
223◄─►238
gpt-3.5-turbo-1106
1204
±9
16,616
OpenAI
Proprietary
231
227◄─►238
Azurephi-3-medium-4k-instruct
1199
±5
25,055
Microsoft
MIT
232
228◄─►238
gemma-2-2b-it
1199
±4
46,618
Gemma license
233
228◄─►238
mixtral-8x7b-instruct-v0.1
1198
±4
73,505
Mistral
Apache 2.0
234
228◄─►243
dbrx-instruct-preview
1196
±6
32,196
Databricks
DBRX LICENSE
235
228◄─►247
InternLMinternlm2_5-20b-chat
1192
±7
9,902
InternLM
Other
236
228◄─►247
qwen1.5-14b-chat
1192
±7
17,841
Alibaba
Qianwen LICENSE
237
230◄─►253
Azurewizardlm-70b
1186
±9
8,214
Microsoft
Llama 2 Community
238
230◄─►254
deepseek-llm-67b-chat
1185
±12
4,933
DeepSeek
DeepSeek License
239
234◄─►250
01.AIyi-34b-chat
1185
±7
15,483
01 AI
Yi License
240
234◄─►253
OpenChatopenchat-3.5-0106
1183
±8
12,636
OpenChat
Apache-2.0
241
234◄─►254
OpenChatopenchat-3.5
1183
±10
7,967
OpenChat
Apache-2.0
242
234◄─►254
granite-3.0-8b-instruct
1183
±9
6,643
IBM
Apache 2.0
243
235◄─►254
gemma-1.1-7b-it
1181
±6
23,893
Gemma license
244
235◄─►254
Snowflakesnowflake-arctic-instruct
1180
±6
32,836
Snowflake
Apache 2.0
245
234◄─►256
granite-3.1-2b-instruct
1180
±11
3,191
IBM
Apache 2.0
246
235◄─►254
tulu-2-dpo-70b
1179
±10
6,534
AllenAI/UW
AI2 ImpACT Low-risk
247
235◄─►258
openhermes-2.5-mistral-7b
1176
±10
5,006
NousResearch
Apache-2.0
248
237◄─►257
vicuna-33b
1174
±6
22,479
LMSYS
Non-commercial
249
237◄─►260
starling-lm-7b-beta
1172
±7
16,057
Nexusflow
Apache-2.0
250
237◄─►258
Azurephi-3-small-8k-instruct
1172
±6
17,763
Microsoft
MIT
251
238◄─►258
Metallama-2-70b-chat
1172
±5
38,491
Meta
Llama 2 Community
252
238◄─►261
starling-lm-7b-alpha
1168
±8
10,224
UC Berkeley
CC-BY-NC-4.0
253
240◄─►261
Metallama-3.2-3b-instruct
1167
±8
7,936
Meta
Llama 3.2
254
238◄─►264
nous-hermes-2-mixtral-8x7b-dpo
1166
±12
3,776
NousResearch
Apache-2.0
255
246◄─►270
qwq-32b-preview
1158
±12
3,233
Alibaba
Apache 2.0
256
251◄─►268
granite-3.0-2b-instruct
1157
±8
6,837
IBM
Apache 2.0
257
246◄─►271
Nvidiallama2-70b-steerlm-chat
1156
±13
3,584
Nvidia
Llama 2 Community
258
248◄─►274
solar-10.7b-instruct-v1.0
1153
±13
4,155
Upstage AI
CC-BY-NC-4.0
259
247◄─►276
dolphin-2.2.1-mistral-7b
1153
±15
1,679
Cognitive Computations
Apache-2.0
260
252◄─►274
mpt-30b-chat
1151
±12
2,571
MosaicML
CC-BY-NC-SA-4.0
261
254◄─►271
mistral-7b-instruct-v0.2
1151
±7
19,402
Mistral
Apache-2.0
262
254◄─►273
Azurewizardlm-13b
1150
±9
7,046
Microsoft
Llama 2 Community
263
251◄─►278
falcon-180b-chat
1148
±17
1,295
TII
Falcon-180B TII License
264
254◄─►277
qwen1.5-7b-chat
1145
±10
4,735
Alibaba
Qianwen LICENSE
265
255◄─►276
Azurephi-3-mini-4k-instruct-june-2024
1144
±6
12,296
Microsoft
MIT
266
255◄─►277
Metallama-2-13b-chat
1142
±7
19,171
Meta
Llama 2 Community
267
255◄─►277
vicuna-13b
1142
±7
19,366
LMSYS
Llama 2 Community
268
255◄─►279
qwen-14b-chat
1140
±11
4,964
Alibaba
Qianwen LICENSE
269
256◄─►279
palm-2
1138
±9
8,554
Proprietary
270
256◄─►279
Metacodellama-34b-instruct
1138
±9
7,363
Meta
Llama 2 Community
271
257◄─►279
gemma-7b-it
1137
±10
8,925
Gemma license
272
259◄─►280
HuggingFacezephyr-7b-beta
1132
±9
11,116
HuggingFace
MIT
273
262◄─►280
Azurephi-3-mini-128k-instruct
1130
±7
20,691
Microsoft
MIT
274
264◄─►280
Azurephi-3-mini-4k-instruct
1129
±6
20,115
Microsoft
MIT
275
260◄─►283
guanaco-33b
1128
±12
2,921
UW
Non-commercial
276
259◄─►284
HuggingFacezephyr-7b-alpha
1128
±16
1,785
HuggingFace
MIT
277
267◄─►284
stripedhyena-nous-7b
1122
±11
5,184
Together AI
Apache 2.0
278
262◄─►285
Metacodellama-70b-instruct
1120
±18
1,143
Meta
Llama 2 Community
279
272◄─►284
vicuna-7b
1116
±9
6,923
LMSYS
Llama 2 Community
280
268◄─►285
HuggingFacesmollm2-1.7b-instruct
1115
±14
2,201
HuggingFace
Apache 2.0
281
275◄─►284
gemma-1.1-2b-it
1115
±8
10,853
Gemma license
282
275◄─►284
Metallama-3.2-1b-instruct
1112
±8
8,045
Meta
Llama 3.2
283
275◄─►285
mistral-7b-instruct
1111
±9
8,977
Mistral
Apache 2.0
284
276◄─►285
Metallama-2-7b-chat
1109
±7
14,148
Meta
Llama 2 Community
285
281◄─►289
gemma-2b-it
1092
±12
4,779
Gemma license
286
285◄─►288
qwen1.5-4b-chat
1091
±9
7,598
Alibaba
Qianwen LICENSE
287
285◄─►292
olmo-7b-instruct
1075
±11
6,329
Allen AI
Apache-2.0
288
286◄─►292
koala-13b
1071
±10
6,964
UC Berkeley
Non-commercial
289
287◄─►292
alpaca-13b
1068
±12
5,745
Stanford
Non-commercial
290
285◄─►293
gpt4all-13b-snoozy
1067
±15
1,743
Nomic AI
Non-commercial
291
287◄─►293
mpt-7b-chat
1063
±12
3,925
MosaicML
CC-BY-NC-SA-4.0
292
287◄─►293
chatglm3-6b
1057
±12
4,658
Tsinghua
Apache-2.0
293
290◄─►295
RWKVRWKV-4-Raven-14B
1042
±11
4,845
RWKV
Apache 2.0
294
293◄─►295
chatglm2-6b
1025
±14
2,657
Tsinghua
Apache-2.0
295
293◄─►295
oasst-pythia-12b
1023
±11
6,311
OpenAssistant
Apache 2.0
296
296◄─►299
chatglm-6b
997
±13
4,914
Tsinghua
Non-commercial
297
296◄─►299
fastchat-t5-3b
992
±12
4,203
LMSYS
Apache 2.0
298
296◄─►299
dolly-v2-12b
981
±14
3,412
Databricks
MIT
299
296◄─►300
Metallama-13b
973
±16
2,391
Meta
Non-commercial
300
299◄─►300
Stabilitystablelm-tuned-alpha-7b
954
±13
3,287
Stability AI
CC-BY-NC-SA-4.0
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?