模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2025-11-22 08:07:09 UTC / 2025-11-22 16:07:09 CST (北京时间)
排行榜
1
1◄─►2
gemini-3-pro
1495Preliminary
±9
5,471
Proprietary
2
1◄─►2
grok-4.1-thinking
1481Preliminary
±9
5,822
xAI
Proprietary
3
3◄─►6
grok-4.1
1462Preliminary
±9
5,825
xAI
Proprietary
4
3◄─►9
gpt-5.1-high
1454
±9
4,980
OpenAI
Proprietary
5
3◄─►9
gemini-2.5-pro
1451
±4
67,956
Proprietary
6
3◄─►11
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
1449
±5
19,073
Anthropic
Proprietary
7
4◄─►9
Anthropicclaude-opus-4-1-20250805-thinking-16k
1449
±4
34,691
Anthropic
Proprietary
8
4◄─►13
Anthropicclaude-sonnet-4-5-20250929
1444
±6
13,765
Anthropic
Proprietary
9
4◄─►16
gpt-4.5-preview-2025-02-27
1442
±6
14,644
OpenAI
Proprietary
10
7◄─►16
Anthropicclaude-opus-4-1-20250805
1440
±4
47,504
Anthropic
Proprietary
11
8◄─►16
chatgpt-4o-latest-20250326
1439
±4
53,924
OpenAI
Proprietary
12
8◄─►16
gpt-5-high
1437
±5
32,910
OpenAI
Proprietary
13
7◄─►23
gpt-5.1
1435
±9
5,135
OpenAI
Proprietary
14
9◄─►17
o3-2025-04-16
1434
±4
61,635
OpenAI
Proprietary
15
9◄─►19
qwen3-max-preview
1433
±5
28,154
Alibaba
Proprietary
16
9◄─►30
MoonshotAIkimi-k2-thinking
1429
±7
8,126
Moonshot
Modified MIT
17
13◄─►32
glm-4.6
1426
±5
16,268
Z.ai
MIT
18
14◄─►31
gpt-5-chat
1425
±4
32,171
OpenAI
Proprietary
19
14◄─►33
qwen3-max-2025-09-23
1423
±6
9,266
Alibaba
Proprietary
20
15◄─►33
Anthropicclaude-opus-4-20250514-thinking-16k
1423
±4
37,905
Anthropic
Proprietary
21
16◄─►33
qwen3-235b-a22b-instruct-2507
1421
±4
42,468
Alibaba
Apache 2.0
22
15◄─►35
deepseek-v3.2-exp-thinking
1421
±7
9,228
DeepSeek AI
MIT
23
15◄─►39
grok-4-fast
1420
±8
7,067
xAI
Proprietary
24
15◄─►41
ernie-5.0-preview-1022
1418Preliminary
±9
4,714
Baidu
Proprietary
25
16◄─►39
deepseek-r1-0528
1418
±6
19,246
DeepSeek
MIT
26
16◄─►41
MoonshotAIkimi-k2-0905-preview
1416
±7
10,652
Moonshot
Modified MIT
27
16◄─►41
deepseek-v3.1
1416
±6
15,261
DeepSeek
MIT
28
16◄─►41
deepseek-v3.1-thinking
1416
±7
11,997
DeepSeek
MIT
29
18◄─►40
MoonshotAIkimi-k2-0711-preview
1416
±5
28,204
Moonshot
Modified MIT
30
16◄─►44
deepseek-v3.1-terminus
1415
±10
3,751
DeepSeek AI
MIT
31
17◄─►41
qwen3-vl-235b-a22b-instruct
1415
±7
8,539
Alibaba
Apache 2.0
32
16◄─►46
deepseek-v3.1-terminus-thinking
1414
±10
3,525
DeepSeek AI
MIT
33
19◄─►43
deepseek-v3.2-exp
1412
±6
11,053
DeepSeek AI
MIT
34
22◄─►42
Anthropicclaude-opus-4-20250514
1412
±4
45,701
Anthropic
Proprietary
35
22◄─►42
gpt-4.1-2025-04-14
1412
±4
52,608
OpenAI
Proprietary
36
23◄─►43
mistral-medium-2508
1410
±4
36,634
Mistral
Proprietary
37
23◄─►44
grok-3-preview-02-24
1410
±4
34,147
xAI
Proprietary
38
23◄─►46
grok-4-0709
1409
±4
41,963
xAI
Proprietary
39
23◄─►46
glm-4.5
1408
±5
24,834
Z.ai
MIT
40
25◄─►46
gemini-2.5-flash
1407
±4
67,181
Proprietary
41
26◄─►50
gemini-2.5-flash-preview-09-2025
1405
±5
20,319
Proprietary
42
31◄─►52
grok-4-fast-reasoning
1403
±5
18,235
xAI
Proprietary
43
33◄─►53
Anthropicclaude-haiku-4-5-20251001
1402
±5
16,895
Anthropic
Proprietary
44
37◄─►53
o1-2024-12-17
1400
±4
28,039
OpenAI
Proprietary
45
37◄─►55
qwen3-next-80b-a3b-instruct
1400
±5
23,138
Alibaba
Apache 2.0
46
34◄─►58
longcat-flash-chat
1399
±6
11,509
Meituan
MIT
47
40◄─►56
Anthropicclaude-sonnet-4-20250514-thinking-32k
1399
±4
36,237
Anthropic
Proprietary
48
41◄─►56
qwen3-235b-a22b-no-thinking
1399
±5
39,395
Alibaba
Apache 2.0
49
41◄─►60
qwen3-235b-a22b-thinking-2507
1397
±6
9,349
Alibaba
Apache 2.0
50
42◄─►60
deepseek-r1
1395
±5
18,718
DeepSeek
MIT
51
42◄─►63
qwen3-vl-235b-a22b-thinking
1393
±7
7,991
Alibaba
Apache 2.0
52
43◄─►61
gpt-5-mini-high
1393
±5
27,464
OpenAI
Proprietary
53
45◄─►61
deepseek-v3-0324
1391
±4
46,819
DeepSeek
MIT
54
46◄─►62
o4-mini-2025-04-16
1391
±4
46,870
OpenAI
Proprietary
55
41◄─►67
Tencenthunyuan-vision-1.5-thinking
1391
±12
2,217
Tencent
Proprietary
56
45◄─►65
mai-1-preview
1390
±5
18,203
Microsoft AI
Proprietary
57
48◄─►65
Anthropicclaude-sonnet-4-20250514
1389
±4
41,674
Anthropic
Proprietary
58
49◄─►66
o1-preview
1387
±5
31,505
OpenAI
Proprietary
59
49◄─►66
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
1387
±4
39,928
Anthropic
Proprietary
60
51◄─►66
qwen3-coder-480b-a35b-instruct
1385
±5
23,162
Alibaba
Apache 2.0
61
48◄─►68
Tencenthunyuan-t1-20250711
1385
±9
4,821
Tencent
Proprietary
62
53◄─►68
mistral-medium-2505
1383
±5
34,534
Mistral
Proprietary
63
54◄─►68
qwen3-30b-a3b-instruct-2507
1382
±5
24,219
Alibaba
Apache 2.0
64
57◄─►69
gpt-4.1-mini-2025-04-14
1380
±4
40,507
OpenAI
Proprietary
65
55◄─►72
Tencenthunyuan-turbos-20250416
1380
±6
11,138
Tencent
Proprietary
66
55◄─►69
gemini-2.5-flash-lite-preview-09-2025-no-thinking
1380
±5
20,164
Proprietary
67
60◄─►73
gemini-2.5-flash-lite-preview-06-17-thinking
1375
±5
33,996
Proprietary
68
61◄─►74
qwen3-235b-a22b
1374
±5
27,179
Alibaba
Apache 2.0
69
64◄─►74
qwen2.5-max
1372
±4
33,551
Alibaba
Proprietary
70
66◄─►74
Anthropicclaude-3-5-sonnet-20241022
1372
±3
89,858
Anthropic
Proprietary
71
66◄─►77
Anthropicclaude-3-7-sonnet-20250219
1371
±4
44,569
Anthropic
Proprietary
72
66◄─►77
glm-4.5-air
1370
±4
31,688
Z.ai
MIT
73
67◄─►80
qwen3-next-80b-a3b-thinking
1367
±6
13,838
Alibaba
Apache 2.0
74
68◄─►80
Minimaxminimax-m1
1365
±4
36,890
MiniMax
Apache 2.0
75
71◄─►80
gemma-3-27b-it
1364
±4
49,341
Gemma
76
71◄─►84
o3-mini-high
1362
±5
18,735
OpenAI
Proprietary
77
71◄─►84
grok-3-mini-high
1362
±5
17,595
xAI
Proprietary
78
73◄─►87
gemini-2.0-flash-001
1360
±4
45,120
Proprietary
79
73◄─►94
deepseek-v3
1357
±5
21,994
DeepSeek
DeepSeek
80
73◄─►95
grok-3-mini-beta
1357
±5
23,812
xAI
Proprietary
81
76◄─►100
mistral-small-2506
1354
±5
18,337
Mistral
Apache 2.0
82
78◄─►101
gpt-oss-120b
1352
±4
31,298
OpenAI
Apache 2.0
83
78◄─►101
gemini-2.0-flash-lite-preview-02-05
1352
±4
25,215
Proprietary
84
79◄─►100
Coherecommand-a-03-2025
1352
±3
57,871
Cohere
CC-BY-NC-4.0
85
76◄─►103
glm-4.5v
1352
±8
4,984
Z.ai
MIT
86
79◄─►101
gemini-1.5-pro-002
1351
±3
56,012
Proprietary
87
76◄─►105
amazon-nova-experimental-chat-10-20
1349Preliminary
±10
3,952
Amazon
Proprietary
88
81◄─►103
o3-mini
1348
±3
58,828
OpenAI
Proprietary
89
79◄─►108
Minimaxminimax-m2
1346
±8
7,132
MiniMax
Apache 2.0
90
79◄─►107
ling-flash-2.0
1346
±7
7,160
Ant Group
MIT
91
76◄─►112
Tencenthunyuan-turbos-20250226
1346
±12
2,250
Tencent
Proprietary
92
78◄─►113
Nvidiallama-3.1-nemotron-ultra-253b-v1
1345
±12
2,573
Nvidia
Nvidia Open Model
93
79◄─►108
Stepfunstep-3
1345
±7
6,652
StepFun
Apache 2.0
94
82◄─►103
gpt-4o-2024-05-13
1345
±3
113,568
OpenAI
Proprietary
95
79◄─►113
amazon-nova-experimental-chat-10-09
1345
±11
2,895
Amazon
Proprietary
96
79◄─►112
qwen3-32b
1344
±9
3,943
Alibaba
Apache 2.0
97
80◄─►112
qwen-plus-0125
1344
±8
5,861
Alibaba
Proprietary
98
81◄─►112
glm-4-plus-0111
1343
±8
5,806
Zhipu
Proprietary
99
86◄─►104
Anthropicclaude-3-5-sonnet-20240620
1343
±3
82,864
Anthropic
Proprietary
100
81◄─►115
gemma-3-12b-it
1340
±9
3,866
Gemma
101
81◄─►117
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
1340
±10
3,496
Nvidia
Nvidia Open
102
86◄─►115
gpt-5-nano-high
1338
±7
8,396
OpenAI
Proprietary
103
81◄─►120
Tencenthunyuan-turbo-0110
1338
±11
2,322
Tencent
Proprietary
104
91◄─►114
Metallama-3.1-405b-instruct-bf16
1335
±4
41,932
Meta
Llama 3.1 Community
105
91◄─►114
o1-mini
1335
±4
52,301
OpenAI
Proprietary
106
92◄─►116
gpt-4o-2024-08-06
1334
±4
45,787
OpenAI
Proprietary
107
90◄─►117
gemini-advanced-0514
1334
±5
50,654
Proprietary
108
94◄─►116
grok-2-2024-08-13
1334
±4
63,725
xAI
Proprietary
109
94◄─►116
Metallama-3.1-405b-instruct-fp8
1334
±3
60,272
Meta
Llama 3.1 Community
110
94◄─►117
qwq-32b
1333
±4
26,277
Alibaba
Apache 2.0
111
89◄─►127
Stepfunstep-2-16k-exp-202412
1332
±9
4,895
StepFun
Proprietary
112
100◄─►128
01.AIyi-lightning
1328
±5
27,624
01 AI
Proprietary
113
102◄─►128
Metallama-4-maverick-17b-128e-instruct
1327
±4
41,227
Meta
Llama 4
114
104◄─►131
qwen3-30b-a3b
1326
±5
27,509
Alibaba
Apache 2.0
115
94◄─►139
Nvidiallama-3.3-nemotron-49b-super-v1
1325
±12
2,243
Nvidia
Nvidia
116
98◄─►138
Tencenthunyuan-large-2025-02-10
1324
±10
3,760
Tencent
Proprietary
117
110◄─►131
gpt-4-turbo-2024-04-09
1324
±4
98,965
OpenAI
Proprietary
118
111◄─►133
Anthropicclaude-3-opus-20240229
1322
±3
196,368
Anthropic
Proprietary
119
110◄─►136
Metallama-4-scout-17b-16e-instruct
1322
±5
31,219
Meta
Llama
120
111◄─►134
Anthropicclaude-3-5-haiku-20241022
1322
±3
71,407
Anthropic
Propretary
121
111◄─►134
gemini-1.5-pro-001
1322
±4
79,769
Proprietary
122
107◄─►139
deepseek-v2.5-1210
1322
±8
6,877
DeepSeek
DeepSeek
123
110◄─►139
gpt-4.1-nano-2025-04-14
1320
±8
6,143
OpenAI
Proprietary
124
111◄─►139
ring-flash-2.0
1319
±7
7,286
Ant Group
MIT
125
111◄─►139
Stepfunstep-1o-turbo-202506
1319
±7
9,671
StepFun
Proprietary
126
114◄─►138
Metallama-3.3-70b-instruct
1319
±3
56,021
Meta
Llama-3.3
127
114◄─►139
glm-4-plus
1318
±5
26,342
Zhipu AI
Proprietary
128
112◄─►139
gemma-3n-e4b-it
1318
±5
23,483
Gemma
129
111◄─►140
gpt-oss-20b
1318
±6
10,858
OpenAI
Apache 2.0
130
114◄─►140
qwen-max-0919
1317
±6
16,598
Alibaba
Qwen
131
116◄─►139
gpt-4o-mini-2024-07-18
1316
±3
69,291
OpenAI
Proprietary
132
119◄─►144
gpt-4-1106-preview
1314
±4
101,117
OpenAI
Proprietary
133
119◄─►144
gpt-4-0125-preview
1314
±4
94,534
OpenAI
Proprietary
134
116◄─►145
qwen2.5-plus-1127
1314
±6
10,252
Alibaba
Proprietary
135
120◄─►144
mistral-large-2407
1313
±4
45,968
Mistral
Mistral Research
136
120◄─►145
athene-v2-chat
1313
±4
24,880
NexusFlow
NexusFlow
137
111◄─►149
mercury
1312
±14
1,974
Inception AI
Proprietary
138
122◄─►146
gemini-1.5-flash-002
1310
±4
35,180
Proprietary
139
117◄─►149
Tencenthunyuan-standard-2025-02-10
1310
±10
3,920
Tencent
Proprietary
140
132◄─►149
grok-2-mini-2024-08-13
1307
±4
52,789
xAI
Proprietary
141
132◄─►149
deepseek-v2.5
1306
±5
24,839
DeepSeek
DeepSeek
142
132◄─►149
athene-70b-0725
1305
±6
19,796
NexusFlow
CC-BY-NC-4.0
143
132◄─►149
magistral-medium-2506
1305
±6
12,001
Mistral
Proprietary
144
135◄─►149
mistral-large-2411
1304
±4
28,455
Mistral
MRL
145
137◄─►149
mistral-small-3.1-24b-instruct-2503
1303
±4
34,153
Mistral
Apache 2.0
146
130◄─►154
gemma-3-4b-it
1302
±9
4,195
Gemma
147
138◄─►149
qwen2.5-72b-instruct
1302
±4
39,632
Alibaba
Qwen
148
138◄─►157
Nvidiallama-3.1-nemotron-70b-instruct
1297
±8
7,216
Nvidia
Llama 3.1
149
138◄─►159
Tencenthunyuan-large-vision
1295
±9
5,600
Tencent
Proprietary
150
147◄─►157
Metallama-3.1-70b-instruct
1293
±4
56,003
Meta
Llama 3.1 Community
151
147◄─►162
jamba-1.5-large
1288
±7
8,730
AI21 Labs
Jamba Open
152
148◄─►161
amazon-nova-pro-v1.0
1288
±4
25,218
Amazon
Proprietary
153
148◄─►162
gpt-4-0314
1287
±5
54,754
OpenAI
Proprietary
154
147◄─►163
reka-core-20240904
1287
±7
7,380
Reka AI
Proprietary
155
148◄─►160
gemma-2-27b-it
1287
±3
76,195
Gemma license
156
147◄─►168
Nvidiallama-3.1-nemotron-51b-instruct
1286
±10
3,777
Nvidia
Llama 3.1
157
147◄─►168
llama-3.1-tulu-3-70b
1286
±10
2,881
Ai2
Llama 3.1
158
150◄─►163
gemini-1.5-flash-001
1285
±4
63,418
Proprietary
159
150◄─►167
Anthropicclaude-3-sonnet-20240229
1282
±4
110,173
Anthropic
Proprietary
160
150◄─►168
gemma-2-9b-it-simpo
1279
±7
10,108
Princeton
MIT
161
153◄─►168
Nvidianemotron-4-340b-instruct
1278
±5
19,913
Nvidia
NVIDIA Open Model
162
152◄─►169
Coherecommand-r-plus-08-2024
1277
±6
9,931
Cohere
CC-BY-NC-4.0
163
157◄─►168
Metallama-3-70b-instruct
1276
±3
158,908
Meta
Llama 3 Community
164
157◄─►168
gpt-4-0613
1276
±4
89,612
OpenAI
Proprietary
165
155◄─►173
glm-4-0520
1274
±7
9,857
Zhipu AI
Proprietary
166
157◄─►172
mistral-small-24b-instruct-2501
1273
±6
14,830
Mistral
Apache 2.0
167
157◄─►173
reka-flash-20240904
1273
±7
7,583
Reka AI
Proprietary
168
158◄─►177
qwen2.5-coder-32b-instruct
1269
±8
5,452
Alibaba
Apache 2.0
169
164◄─►177
Coherec4ai-aya-expanse-32b
1267
±5
27,362
Cohere
CC-BY-NC-4.0
170
165◄─►177
gemma-2-9b-it
1264
±4
54,954
Gemma license
171
165◄─►179
deepseek-coder-v2
1264
±6
15,242
DeepSeek AI
DeepSeek License
172
165◄─►178
Coherecommand-r-plus
1264
±4
78,401
Cohere
CC-BY-NC-4.0
173
166◄─►179
qwen2-72b-instruct
1262
±5
37,688
Alibaba
Qianwen LICENSE
174
168◄─►178
Anthropicclaude-3-haiku-20240307
1262
±4
118,626
Anthropic
Proprietary
175
168◄─►179
amazon-nova-lite-v1.0
1259
±5
19,760
Amazon
Proprietary
176
168◄─►179
gemini-1.5-flash-8b-001
1259
±4
35,914
Proprietary
177
171◄─►179
Azurephi-4
1255
±4
24,354
Microsoft
MIT
178
168◄─►186
olmo-2-0325-32b-instruct
1252
±11
3,377
Allen AI
Apache-2.0
179
172◄─►183
Coherecommand-r-08-2024
1252
±7
10,229
Cohere
CC-BY-NC-4.0
180
178◄─►188
mistral-large-2402
1243
±5
63,404
Mistral
Proprietary
181
178◄─►188
amazon-nova-micro-v1.0
1241
±5
19,774
Amazon
Proprietary
182
178◄─►193
jamba-1.5-mini
1239
±7
8,918
AI21 Labs
Jamba Open
183
178◄─►196
ministral-8b-2410
1237
±9
4,833
Mistral
MRL
184
179◄─►196
gemini-pro-dev-api
1235
±7
18,454
Proprietary
185
180◄─►195
qwen1.5-110b-chat
1235
±5
26,679
Alibaba
Qianwen LICENSE
186
180◄─►196
qwen1.5-72b-chat
1235
±5
39,689
Alibaba
Qianwen LICENSE
187
179◄─►197
reka-flash-21b-20240226-online
1234
±7
15,606
Reka AI
Proprietary
188
179◄─►198
Tencenthunyuan-standard-256k
1233
±12
2,761
Tencent
Proprietary
189
182◄─►197
mixtral-8x22b-instruct-v0.1
1230
±4
52,214
Mistral
Apache 2.0
190
182◄─►198
Coherecommand-r
1229
±5
54,710
Cohere
CC-BY-NC-4.0
191
182◄─►198
reka-flash-21b-20240226
1228
±6
25,026
Reka AI
Proprietary
192
184◄─►199
gpt-3.5-turbo-0125
1225
±5
67,214
OpenAI
Proprietary
193
183◄─►199
mistral-medium
1225
±5
34,893
Mistral
Proprietary
194
187◄─►199
Metallama-3-8b-instruct
1224
±4
106,055
Meta
Llama 3 Community
195
183◄─►200
Coherec4ai-aya-expanse-8b
1223
±7
9,922
Cohere
CC-BY-NC-4.0
196
182◄─►203
gemini-pro
1223
±12
6,418
Proprietary
197
182◄─►203
llama-3.1-tulu-3-8b
1221
±11
2,943
Ai2
Llama 3.1
198
189◄─►204
HuggingFacezephyr-orpo-141b-A35b-v0.1
1215
±11
4,712
HuggingFace
Apache 2.0
199
195◄─►203
01.AIyi-1.5-34b-chat
1214
±5
24,417
01 AI
Apache-2.0
200
196◄─►203
Metallama-3.1-8b-instruct
1211
±4
50,234
Meta
Llama 3.1 Community
201
192◄─►209
granite-3.1-8b-instruct
1210
±11
3,142
IBM
Apache 2.0
202
196◄─►208
qwen1.5-32b-chat
1206
±6
22,068
Alibaba
Qianwen LICENSE
203
196◄─►211
gpt-3.5-turbo-1106
1203
±9
16,760
OpenAI
Proprietary
204
200◄─►211
Azurephi-3-medium-4k-instruct
1199
±5
25,301
Microsoft
MIT
205
201◄─►211
mixtral-8x7b-instruct-v0.1
1199
±4
74,303
Mistral
Apache 2.0
206
201◄─►211
gemma-2-2b-it
1198
±4
46,901
Gemma license
207
201◄─►216
dbrx-instruct-preview
1197
±6
32,760
Databricks
DBRX LICENSE
208
201◄─►219
qwen1.5-14b-chat
1194
±7
18,066
Alibaba
Qianwen LICENSE
209
202◄─►220
InternLMinternlm2_5-20b-chat
1193
±7
10,038
InternLM
Other
210
203◄─►226
Azurewizardlm-70b
1186
±9
8,270
Microsoft
Llama 2 Community
211
203◄─►227
deepseek-llm-67b-chat
1185
±11
4,950
DeepSeek AI
DeepSeek License
212
207◄─►224
01.AIyi-34b-chat
1185
±7
15,624
01 AI
Yi License
213
207◄─►226
OpenChatopenchat-3.5-0106
1184
±8
12,712
OpenChat
Apache-2.0
214
207◄─►226
granite-3.0-8b-instruct
1184
±9
6,727
IBM
Apache 2.0
215
207◄─►227
OpenChatopenchat-3.5
1183
±10
8,009
OpenChat
Apache-2.0
216
208◄─►226
Snowflakesnowflake-arctic-instruct
1181
±6
33,272
Snowflake
Apache 2.0
217
207◄─►229
granite-3.1-2b-instruct
1181
±11
3,235
IBM
Apache 2.0
218
209◄─►227
gemma-1.1-7b-it
1180
±6
24,327
Gemma license
219
208◄─►229
tulu-2-dpo-70b
1180
±10
6,579
AllenAI/UW
AI2 ImpACT Low-risk
220
208◄─►231
openhermes-2.5-mistral-7b
1177
±10
5,026
NousResearch
Apache-2.0
221
210◄─►230
vicuna-33b
1175
±6
22,613
LMSYS
Non-commercial
222
210◄─►231
starling-lm-7b-beta
1174
±7
16,190
Nexusflow
Apache-2.0
223
210◄─►231
Azurephi-3-small-8k-instruct
1173
±6
17,983
Microsoft
MIT
224
211◄─►231
Metallama-2-70b-chat
1173
±5
38,767
Meta
Llama 2 Community
225
211◄─►234
starling-lm-7b-alpha
1169
±8
10,267
UC Berkeley
CC-BY-NC-4.0
226
215◄─►235
Metallama-3.2-3b-instruct
1167
±8
8,043
Meta
Llama 3.2
227
210◄─►236
nous-hermes-2-mixtral-8x7b-dpo
1167
±12
3,792
NousResearch
Apache-2.0
228
218◄─►241
qwq-32b-preview
1159
±11
3,256
Alibaba
Apache 2.0
229
218◄─►244
Nvidiallama2-70b-steerlm-chat
1158
±13
3,605
Nvidia
Llama 2 Community
230
225◄─►241
granite-3.0-2b-instruct
1157
±8
6,922
IBM
Apache 2.0
231
221◄─►246
solar-10.7b-instruct-v1.0
1155
±13
4,187
Upstage AI
CC-BY-NC-4.0
232
220◄─►249
dolphin-2.2.1-mistral-7b
1153
±15
1,685
Cognitive Computations
Apache-2.0
233
225◄─►247
mpt-30b-chat
1153
±12
2,606
MosaicML
CC-BY-NC-SA-4.0
234
226◄─►245
Azurewizardlm-13b
1152
±9
7,122
Microsoft
Llama 2 Community
235
227◄─►244
mistral-7b-instruct-v0.2
1152
±7
19,603
Mistral
Apache-2.0
236
225◄─►252
falcon-180b-chat
1149
±17
1,312
TII
Falcon-180B TII License
237
228◄─►250
qwen1.5-7b-chat
1145
±10
4,782
Alibaba
Qianwen LICENSE
238
228◄─►249
Metallama-2-13b-chat
1144
±7
19,357
Meta
Llama 2 Community
239
228◄─►249
Azurephi-3-mini-4k-instruct-june-2024
1144
±6
12,415
Microsoft
MIT
240
228◄─►249
vicuna-13b
1143
±7
19,539
LMSYS
Llama 2 Community
241
228◄─►252
qwen-14b-chat
1140
±11
5,004
Alibaba
Qianwen LICENSE
242
230◄─►252
palm-2
1138
±9
8,634
Proprietary
243
230◄─►252
Metacodellama-34b-instruct
1138
±9
7,417
Meta
Llama 2 Community
244
232◄─►252
gemma-7b-it
1136
±9
9,034
Gemma license
245
234◄─►253
HuggingFacezephyr-7b-beta
1133
±9
11,220
HuggingFace
MIT
246
235◄─►253
Azurephi-3-mini-128k-instruct
1132
±7
21,024
Microsoft
MIT
247
233◄─►256
guanaco-33b
1130
±12
2,955
UW
Non-commercial
248
239◄─►253
Azurephi-3-mini-4k-instruct
1130
±6
20,539
Microsoft
MIT
249
230◄─►257
HuggingFacezephyr-7b-alpha
1130
±16
1,803
HuggingFace
MIT
250
240◄─►257
stripedhyena-nous-7b
1122
±11
5,214
Together AI
Apache 2.0
251
235◄─►258
Metacodellama-70b-instruct
1120
±18
1,151
Meta
Llama 2 Community
252
240◄─►257
HuggingFacesmollm2-1.7b-instruct
1119
±14
2,244
HuggingFace
Apache 2.0
253
245◄─►257
vicuna-7b
1117
±9
6,972
LMSYS
Llama 2 Community
254
248◄─►257
gemma-1.1-2b-it
1114
±8
11,035
Gemma license
255
248◄─►257
Metallama-3.2-1b-instruct
1113
±8
8,166
Meta
Llama 3.2
256
248◄─►258
mistral-7b-instruct
1112
±9
9,042
Mistral
Apache 2.0
257
249◄─►258
Metallama-2-7b-chat
1110
±7
14,272
Meta
Llama 2 Community
258
258◄─►260
qwen1.5-4b-chat
1092
±9
7,662
Alibaba
Qianwen LICENSE
259
255◄─►262
gemma-2b-it
1092
±12
4,817
Gemma license
260
258◄─►265
olmo-7b-instruct
1076
±11
6,412
Allen AI
Apache-2.0
261
259◄─►265
koala-13b
1073
±10
6,998
UC Berkeley
Non-commercial
262
260◄─►265
alpaca-13b
1068
±11
5,828
Stanford
Non-commercial
263
259◄─►266
gpt4all-13b-snoozy
1068
±15
1,773
Nomic AI
Non-commercial
264
260◄─►266
mpt-7b-chat
1064
±12
3,977
MosaicML
CC-BY-NC-SA-4.0
265
260◄─►266
chatglm3-6b
1058
±12
4,692
Tsinghua
Apache-2.0
266
263◄─►268
RWKVRWKV-4-Raven-14B
1044
±11
4,898
RWKV
Apache 2.0
267
266◄─►268
chatglm2-6b
1027
±14
2,683
Tsinghua
Apache-2.0
268
266◄─►268
oasst-pythia-12b
1024
±11
6,343
OpenAssistant
Apache 2.0
269
269◄─►272
chatglm-6b
998
±13
4,968
Tsinghua
Non-commercial
270
269◄─►272
fastchat-t5-3b
994
±12
4,270
LMSYS
Apache 2.0
271
269◄─►273
dolly-v2-12b
981
±14
3,471
Databricks
MIT
272
269◄─►273
Metallama-13b
973
±16
2,441
Meta
Non-commercial
273
271◄─►273
Stabilitystablelm-tuned-alpha-7b
955
±13
3,325
Stability AI
CC-BY-NC-SA-4.0
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?