模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2025-12-22 08:09:23 UTC / 2025-12-22 16:09:23 CST (北京时间)
排行榜
1
1◄─►2
gemini-3-pro
1490
±6
19,199
Proprietary
2
2◄─►6
grok-4.1-thinking
1477
±6
20,074
xAI
Proprietary
3
1◄─►7
gemini-3-flash
1475Preliminary
±10
4,431
Proprietary
4
2◄─►8
Anthropicclaude-opus-4-5-20251101-thinking-32k
1469
±6
12,829
Anthropic
Proprietary
5
2◄─►8
Anthropicclaude-opus-4-5-20251101
1465
±6
13,654
Anthropic
Proprietary
6
3◄─►8
grok-4.1
1465
±6
20,553
xAI
Proprietary
7
2◄─►11
gemini-3-flash (thinking-minimal)
1463Preliminary
±9
5,605
Proprietary
8
4◄─►11
gpt-5.1-high
1457
±6
17,084
OpenAI
Proprietary
9
7◄─►15
gemini-2.5-pro
1451
±3
79,900
Proprietary
10
7◄─►15
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
1450
±4
31,225
Anthropic
Proprietary
11
9◄─►15
Anthropicclaude-opus-4-1-20250805-thinking-16k
1448
±4
46,958
Anthropic
Proprietary
12
9◄─►18
Anthropicclaude-sonnet-4-5-20250929
1446
±5
26,513
Anthropic
Proprietary
13
9◄─►22
gpt-4.5-preview-2025-02-27
1442
±6
14,644
OpenAI
Proprietary
14
7◄─►27
gpt-5.2
1442Preliminary
±12
2,422
OpenAI
Proprietary
15
12◄─►22
Anthropicclaude-opus-4-1-20250805
1440
±4
60,044
Anthropic
Proprietary
16
12◄─►22
chatgpt-4o-latest-20250326
1440
±3
66,721
OpenAI
Proprietary
17
9◄─►24
gpt-5.2-high
1440Preliminary
±9
6,192
OpenAI
Proprietary
18
12◄─►24
gpt-5.1
1437
±6
18,423
OpenAI
Proprietary
19
13◄─►24
gpt-5-high
1436
±5
32,779
OpenAI
Proprietary
20
13◄─►25
o3-2025-04-16
1434
±4
61,385
OpenAI
Proprietary
21
13◄─►29
qwen3-max-preview
1433
±5
28,039
Alibaba
Proprietary
22
13◄─►33
grok-4-1-fast-reasoning
1431
±6
12,526
xAI
Proprietary
23
16◄─►38
MoonshotAIkimi-k2-thinking-turbo
1428
±5
18,724
Moonshot
Modified MIT
24
16◄─►43
ernie-5.0-preview-1103
1427Preliminary
±8
6,699
Baidu
Proprietary
25
20◄─►43
glm-4.6
1425
±5
27,300
Z.ai
MIT
26
21◄─►43
gpt-5-chat
1425
±4
32,045
OpenAI
Proprietary
27
19◄─►43
qwen3-max-2025-09-23
1424
±6
9,227
Alibaba
Proprietary
28
20◄─►43
deepseek-v3.2-exp
1423
±7
11,929
DeepSeek AI
MIT
29
22◄─►43
Anthropicclaude-opus-4-20250514-thinking-16k
1423
±4
37,714
Anthropic
Proprietary
30
21◄─►46
deepseek-v3.2-thinking
1422
±7
8,916
DeepSeek AI
MIT
31
23◄─►43
qwen3-235b-a22b-instruct-2507
1422
±4
54,369
Alibaba
Apache 2.0
32
22◄─►46
deepseek-v3.2-exp-thinking
1421
±7
9,198
DeepSeek AI
MIT
33
22◄─►49
grok-4-fast-chat
1420
±8
7,040
xAI
Proprietary
34
22◄─►49
mistral-large-3
1419Preliminary
±7
9,518
Mistral
Apache 2.0
35
23◄─►49
MoonshotAIkimi-k2-0905-preview
1418
±7
11,845
Moonshot
Modified MIT
36
23◄─►49
deepseek-r1-0528
1418
±6
19,162
DeepSeek
MIT
37
24◄─►50
deepseek-v3.1
1416
±6
15,213
DeepSeek
MIT
38
24◄─►49
MoonshotAIkimi-k2-0711-preview
1416
±5
28,532
Moonshot
Modified MIT
39
24◄─►51
deepseek-v3.1-thinking
1416
±7
11,947
DeepSeek
MIT
40
24◄─►51
deepseek-v3.2
1415
±7
10,253
DeepSeek AI
MIT
41
23◄─►54
deepseek-v3.1-terminus
1415
±10
3,737
DeepSeek AI
MIT
42
24◄─►51
qwen3-vl-235b-a22b-instruct
1414
±7
9,011
Alibaba
Apache 2.0
43
23◄─►57
deepseek-v3.1-terminus-thinking
1414
±10
3,511
DeepSeek AI
MIT
44
31◄─►51
gpt-4.1-2025-04-14
1412
±4
52,398
OpenAI
Proprietary
45
31◄─►51
Anthropicclaude-opus-4-20250514
1412
±4
45,477
Anthropic
Proprietary
46
31◄─►51
mistral-medium-2508
1411
±4
48,471
Mistral
Proprietary
47
33◄─►54
grok-3-preview-02-24
1410
±4
34,039
xAI
Proprietary
48
33◄─►56
grok-4-0709
1409
±4
42,410
xAI
Proprietary
49
33◄─►57
glm-4.5
1408
±5
24,736
Z.ai
MIT
50
38◄─►56
gemini-2.5-flash
1408
±3
79,419
Proprietary
51
39◄─►60
gemini-2.5-flash-preview-09-2025
1405
±4
32,599
Proprietary
52
45◄─►62
Anthropicclaude-haiku-4-5-20251001
1402
±4
29,594
Anthropic
Proprietary
53
45◄─►63
grok-4-fast-reasoning
1402
±5
18,821
xAI
Proprietary
54
47◄─►63
o1-2024-12-17
1400
±4
28,039
OpenAI
Proprietary
55
47◄─►65
qwen3-next-80b-a3b-instruct
1400
±5
23,035
Alibaba
Apache 2.0
56
45◄─►66
longcat-flash-chat
1400
±6
11,470
Meituan
MIT
57
49◄─►65
qwen3-235b-a22b-no-thinking
1399
±4
39,246
Alibaba
Apache 2.0
58
51◄─►66
Anthropicclaude-sonnet-4-20250514-thinking-32k
1399
±4
36,070
Anthropic
Proprietary
59
51◄─►71
qwen3-235b-a22b-thinking-2507
1397
±6
9,311
Alibaba
Apache 2.0
60
52◄─►71
deepseek-r1
1396
±5
18,718
DeepSeek
MIT
61
52◄─►72
qwen3-vl-235b-a22b-thinking
1394
±7
7,965
Alibaba
Apache 2.0
62
53◄─►72
gpt-5-mini-high
1392
±5
27,344
OpenAI
Proprietary
63
55◄─►71
deepseek-v3-0324
1392
±4
46,624
DeepSeek
MIT
64
51◄─►79
Tencenthunyuan-vision-1.5-thinking
1391
±12
2,210
Tencent
Proprietary
65
57◄─►72
o4-mini-2025-04-16
1391
±4
46,703
OpenAI
Proprietary
66
55◄─►74
mai-1-preview
1390
±5
18,116
Microsoft AI
Proprietary
67
59◄─►75
Anthropicclaude-sonnet-4-20250514
1389
±4
41,493
Anthropic
Proprietary
68
59◄─►75
o1-preview
1387
±5
31,505
OpenAI
Proprietary
69
59◄─►75
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
1387
±4
39,817
Anthropic
Proprietary
70
59◄─►77
qwen3-coder-480b-a35b-instruct
1386
±5
23,591
Alibaba
Apache 2.0
71
59◄─►80
Tencenthunyuan-t1-20250711
1385
±9
4,803
Tencent
Proprietary
72
62◄─►79
mistral-medium-2505
1383
±5
34,401
Mistral
Proprietary
73
65◄─►79
qwen3-30b-a3b-instruct-2507
1382
±5
24,120
Alibaba
Apache 2.0
74
66◄─►80
gpt-4.1-mini-2025-04-14
1380
±4
40,370
OpenAI
Proprietary
75
65◄─►83
Tencenthunyuan-turbos-20250416
1380
±6
11,094
Tencent
Proprietary
76
69◄─►83
gemini-2.5-flash-lite-preview-09-2025-no-thinking
1378
±4
32,630
Proprietary
77
70◄─►84
gemini-2.5-flash-lite-preview-06-17-thinking
1375
±4
33,818
Proprietary
78
70◄─►85
qwen3-235b-a22b
1374
±5
27,077
Alibaba
Apache 2.0
79
73◄─►85
qwen2.5-max
1373
±4
33,489
Alibaba
Proprietary
80
75◄─►85
Anthropicclaude-3-5-sonnet-20241022
1372
±3
89,727
Anthropic
Proprietary
81
75◄─►88
Anthropicclaude-3-7-sonnet-20250219
1371
±4
44,473
Anthropic
Proprietary
82
69◄─►94
amazon-nova-experimental-chat-11-10
1370Preliminary
±12
2,743
Amazon
Proprietary
83
75◄─►88
glm-4.5-air
1370
±4
31,551
Z.ai
MIT
84
77◄─►91
qwen3-next-80b-a3b-thinking
1367
±6
13,779
Alibaba
Apache 2.0
85
78◄─►90
Minimaxminimax-m1
1366
±4
36,732
MiniMax
Apache 2.0
86
81◄─►91
gemma-3-27b-it
1364
±4
49,181
Gemma
87
81◄─►96
o3-mini-high
1362
±5
18,735
OpenAI
Proprietary
88
81◄─►96
grok-3-mini-high
1362
±5
17,505
xAI
Proprietary
89
83◄─►98
gemini-2.0-flash-001
1360
±4
45,015
Proprietary
90
83◄─►107
deepseek-v3
1357
±5
21,994
DeepSeek
DeepSeek
91
84◄─►107
grok-3-mini-beta
1357
±5
23,701
xAI
Proprietary
92
86◄─►112
mistral-small-2506
1355
±5
18,254
Mistral
Apache 2.0
93
89◄─►113
gpt-oss-120b
1352
±4
31,145
OpenAI
Apache 2.0
94
89◄─►112
gemini-2.0-flash-lite-preview-02-05
1352
±4
25,215
Proprietary
95
86◄─►115
glm-4.5v
1352
±8
4,971
Z.ai
MIT
96
90◄─►112
Coherecommand-a-03-2025
1352
±3
57,648
Cohere
CC-BY-NC-4.0
97
90◄─►113
gemini-1.5-pro-002
1351
±3
56,012
Proprietary
98
86◄─►116
intellect-3
1351
±9
4,753
Prime Intellect
MIT
99
90◄─►116
amazon-nova-experimental-chat-10-20
1349
±6
10,971
Amazon
Proprietary
100
92◄─►115
o3-mini
1348
±3
58,693
OpenAI
Proprietary
101
87◄─►126
Tencenthunyuan-turbos-20250226
1346
±12
2,250
Tencent
Proprietary
102
90◄─►119
ling-flash-2.0
1346
±7
7,126
Ant Group
MIT
103
90◄─►122
Minimaxminimax-m2
1346
±8
7,093
MiniMax
Apache 2.0
104
90◄─►121
Stepfunstep-3
1346
±7
6,620
StepFun
Apache 2.0
105
87◄─►127
Nvidiallama-3.1-nemotron-ultra-253b-v1
1346
±12
2,573
Nvidia
Nvidia Open Model
106
89◄─►126
amazon-nova-experimental-chat-10-09
1345
±11
2,878
Amazon
Proprietary
107
94◄─►116
gpt-4o-2024-05-13
1345
±3
113,568
OpenAI
Proprietary
108
90◄─►126
qwen3-32b
1345
±9
3,943
Alibaba
Apache 2.0
109
90◄─►126
qwen-plus-0125
1344
±8
5,861
Alibaba
Proprietary
110
92◄─►126
glm-4-plus-0111
1343
±8
5,806
Zhipu
Proprietary
111
97◄─►119
Anthropicclaude-3-5-sonnet-20240620
1342
±3
82,864
Anthropic
Proprietary
112
92◄─►129
gemma-3-12b-it
1340
±9
3,866
Gemma
113
92◄─►131
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
1340
±10
3,469
Nvidia
Nvidia Open
114
92◄─►133
Tencenthunyuan-turbo-0110
1339
±11
2,322
Tencent
Proprietary
115
97◄─►128
gpt-5-nano-high
1338
±7
8,369
OpenAI
Proprietary
116
99◄─►131
nova-2-lite
1336
±7
8,997
Amazon
Proprietary
117
102◄─►128
o1-mini
1335
±4
52,301
OpenAI
Proprietary
118
104◄─►128
Metallama-3.1-405b-instruct-bf16
1335
±4
41,932
Meta
Llama 3.1 Community
119
104◄─►131
gpt-4o-2024-08-06
1334
±4
45,787
OpenAI
Proprietary
120
106◄─►129
grok-2-2024-08-13
1334
±4
63,725
xAI
Proprietary
121
105◄─►131
qwq-32b
1334
±4
26,196
Alibaba
Apache 2.0
122
102◄─►131
gemini-advanced-0514
1334
±5
50,654
Proprietary
123
106◄─►131
Metallama-3.1-405b-instruct-fp8
1333
±3
60,272
Meta
Llama 3.1 Community
124
102◄─►141
Stepfunstep-2-16k-exp-202412
1332
±9
4,895
StepFun
Proprietary
125
112◄─►142
01.AIyi-lightning
1328
±5
27,624
01 AI
Proprietary
126
115◄─►143
Metallama-4-maverick-17b-128e-instruct
1327
±4
41,093
Meta
Llama 4
127
106◄─►153
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16
1326Preliminary
±10
4,247
Nvidia
NVIDIA Open Model
128
117◄─►145
qwen3-30b-a3b
1326
±5
27,405
Alibaba
Apache 2.0
129
106◄─►154
Nvidiallama-3.3-nemotron-49b-super-v1
1325
±12
2,243
Nvidia
Nvidia
130
111◄─►153
Tencenthunyuan-large-2025-02-10
1325
±10
3,760
Tencent
Proprietary
131
123◄─►146
gpt-4-turbo-2024-04-09
1324
±4
98,965
OpenAI
Proprietary
132
124◄─►148
Anthropicclaude-3-5-haiku-20241022
1322
±3
71,241
Anthropic
Proprietary
133
124◄─►148
Metallama-4-scout-17b-16e-instruct
1322
±5
31,074
Meta
Llama
134
117◄─►154
deepseek-v2.5-1210
1322
±8
6,877
DeepSeek
DeepSeek
135
124◄─►148
gemini-1.5-pro-001
1322
±4
79,769
Proprietary
136
124◄─►148
Anthropicclaude-3-opus-20240229
1322
±3
196,368
Anthropic
Proprietary
137
123◄─►154
gpt-4.1-nano-2025-04-14
1321
±8
6,143
OpenAI
Proprietary
138
124◄─►154
ring-flash-2.0
1320
±7
7,250
Ant Group
MIT
139
124◄─►154
Stepfunstep-1o-turbo-202506
1319
±7
9,635
StepFun
Proprietary
140
127◄─►153
Metallama-3.3-70b-instruct
1319
±3
55,961
Meta
Llama-3.3
141
125◄─►154
gemma-3n-e4b-it
1318
±5
23,421
Gemma
142
126◄─►154
glm-4-plus
1318
±5
26,342
Zhipu AI
Proprietary
143
124◄─►155
gpt-oss-20b
1318
±6
10,828
OpenAI
Apache 2.0
144
127◄─►155
qwen-max-0919
1317
±6
16,598
Alibaba
Qwen
145
129◄─►154
gpt-4o-mini-2024-07-18
1316
±3
69,290
OpenAI
Proprietary
146
128◄─►161
qwen2.5-plus-1127
1314
±6
10,252
Alibaba
Proprietary
147
133◄─►159
gpt-4-1106-preview
1313
±4
101,117
OpenAI
Proprietary
148
133◄─►159
mistral-large-2407
1313
±4
45,968
Mistral
Mistral Research
149
133◄─►159
gpt-4-0125-preview
1313
±4
94,534
OpenAI
Proprietary
150
133◄─►160
athene-v2-chat
1313
±4
24,880
NexusFlow
NexusFlow
151
124◄─►164
mercury
1311
±14
1,961
Inception AI
Proprietary
152
129◄─►164
Tencenthunyuan-standard-2025-02-10
1310
±10
3,920
Tencent
Proprietary
153
136◄─►161
gemini-1.5-flash-002
1310
±4
35,180
Proprietary
154
133◄─►164
olmo-3-32b-think
1308
±9
5,307
Allen AI
Apache 2.0
155
146◄─►163
grok-2-mini-2024-08-13
1307
±4
52,789
xAI
Proprietary
156
146◄─►164
deepseek-v2.5
1306
±5
24,839
DeepSeek
DeepSeek
157
146◄─►164
athene-70b-0725
1305
±6
19,796
NexusFlow
CC-BY-NC-4.0
158
149◄─►164
mistral-large-2411
1304
±4
28,455
Mistral
MRL
159
146◄─►164
magistral-medium-2506
1304
±6
11,957
Mistral
Proprietary
160
150◄─►164
mistral-small-3.1-24b-instruct-2503
1303
±4
34,030
Mistral
Apache 2.0
161
144◄─►170
gemma-3-4b-it
1302
±9
4,195
Gemma
162
152◄─►164
qwen2.5-72b-instruct
1302
±4
39,632
Alibaba
Qwen
163
152◄─►173
Nvidiallama-3.1-nemotron-70b-instruct
1297
±8
7,216
Nvidia
Llama 3.1
164
153◄─►175
Tencenthunyuan-large-vision
1294
±9
5,575
Tencent
Proprietary
165
162◄─►173
Metallama-3.1-70b-instruct
1293
±4
56,003
Meta
Llama 3.1 Community
166
162◄─►178
jamba-1.5-large
1288
±7
8,730
AI21 Labs
Jamba Open
167
163◄─►176
amazon-nova-pro-v1.0
1288
±4
25,218
Amazon
Proprietary
168
162◄─►183
ibm-granite-h-small
1287
±8
5,690
IBM
Apache 2.0
169
163◄─►176
gemma-2-27b-it
1287
±3
76,195
Gemma license
170
162◄─►178
reka-core-20240904
1287
±7
7,380
Reka AI
Proprietary
171
163◄─►178
gpt-4-0314
1286
±5
54,754
OpenAI
Proprietary
172
162◄─►184
Nvidiallama-3.1-nemotron-51b-instruct
1286
±10
3,777
Nvidia
Llama 3.1
173
162◄─►184
llama-3.1-tulu-3-70b
1286
±10
2,881
Ai2
Llama 3.1
174
165◄─►178
gemini-1.5-flash-001
1284
±4
63,418
Proprietary
175
166◄─►183
Anthropicclaude-3-sonnet-20240229
1281
±4
110,173
Anthropic
Proprietary
176
165◄─►184
gemma-2-9b-it-simpo
1278
±7
10,108
Princeton
MIT
177
168◄─►184
Nvidianemotron-4-340b-instruct
1277
±5
19,913
Nvidia
NVIDIA Open Model
178
168◄─►185
Coherecommand-r-plus-08-2024
1277
±7
9,931
Cohere
CC-BY-NC-4.0
179
172◄─►184
Metallama-3-70b-instruct
1276
±3
158,908
Meta
Llama 3 Community
180
172◄─►185
gpt-4-0613
1275
±4
89,612
OpenAI
Proprietary
181
172◄─►187
mistral-small-24b-instruct-2501
1273
±6
14,830
Mistral
Apache 2.0
182
172◄─►189
glm-4-0520
1273
±7
9,857
Zhipu AI
Proprietary
183
172◄─►189
reka-flash-20240904
1272
±7
7,583
Reka AI
Proprietary
184
174◄─►193
qwen2.5-coder-32b-instruct
1269
±8
5,452
Alibaba
Apache 2.0
185
179◄─►193
Coherec4ai-aya-expanse-32b
1266
±5
27,362
Cohere
CC-BY-NC-4.0
186
181◄─►193
gemma-2-9b-it
1264
±4
54,954
Gemma license
187
181◄─►195
deepseek-coder-v2
1264
±6
15,242
DeepSeek AI
DeepSeek License
188
182◄─►194
Coherecommand-r-plus
1263
±4
78,401
Cohere
CC-BY-NC-4.0
189
182◄─►195
qwen2-72b-instruct
1262
±5
37,688
Alibaba
Qianwen LICENSE
190
184◄─►195
Anthropicclaude-3-haiku-20240307
1261
±4
118,626
Anthropic
Proprietary
191
184◄─►195
amazon-nova-lite-v1.0
1259
±5
19,760
Amazon
Proprietary
192
184◄─►195
gemini-1.5-flash-8b-001
1259
±4
35,914
Proprietary
193
187◄─►195
Azurephi-4
1255
±4
24,354
Microsoft
MIT
194
184◄─►201
olmo-2-0325-32b-instruct
1252
±11
3,377
Allen AI
Apache-2.0
195
188◄─►200
Coherecommand-r-08-2024
1251
±7
10,229
Cohere
CC-BY-NC-4.0
196
194◄─►204
mistral-large-2402
1242
±5
63,404
Mistral
Proprietary
197
194◄─►204
amazon-nova-micro-v1.0
1241
±5
19,774
Amazon
Proprietary
198
194◄─►209
jamba-1.5-mini
1239
±7
8,918
AI21 Labs
Jamba Open
199
194◄─►212
ministral-8b-2410
1237
±9
4,833
Mistral
MRL
200
195◄─►212
gemini-pro-dev-api
1234
±7
18,454
Proprietary
201
196◄─►212
qwen1.5-110b-chat
1234
±5
26,679
Alibaba
Qianwen LICENSE
202
196◄─►212
qwen1.5-72b-chat
1234
±5
39,689
Alibaba
Qianwen LICENSE
203
196◄─►213
reka-flash-21b-20240226-online
1233
±7
15,606
Reka AI
Proprietary
204
194◄─►214
Tencenthunyuan-standard-256k
1233
±12
2,761
Tencent
Proprietary
205
198◄─►213
mixtral-8x22b-instruct-v0.1
1230
±4
52,214
Mistral
Apache 2.0
206
198◄─►214
Coherecommand-r
1228
±5
54,710
Cohere
CC-BY-NC-4.0
207
198◄─►214
reka-flash-21b-20240226
1227
±6
25,026
Reka AI
Proprietary
208
199◄─►215
gpt-3.5-turbo-0125
1224
±5
67,214
OpenAI
Proprietary
209
199◄─►216
mistral-medium
1223
±5
34,893
Mistral
Proprietary
210
199◄─►216
Coherec4ai-aya-expanse-8b
1223
±7
9,922
Cohere
CC-BY-NC-4.0
211
203◄─►215
Metallama-3-8b-instruct
1223
±4
106,055
Meta
Llama 3 Community
212
198◄─►219
gemini-pro
1222
±12
6,418
Proprietary
213
198◄─►218
llama-3.1-tulu-3-8b
1221
±11
2,943
Ai2
Llama 3.1
214
205◄─►220
HuggingFacezephyr-orpo-141b-A35b-v0.1
1214
±11
4,712
HuggingFace
Apache 2.0
215
210◄─►219
01.AIyi-1.5-34b-chat
1213
±5
24,417
01 AI
Apache-2.0
216
212◄─►219
Metallama-3.1-8b-instruct
1211
±4
50,234
Meta
Llama 3.1 Community
217
208◄─►225
granite-3.1-8b-instruct
1210
±11
3,142
IBM
Apache 2.0
218
212◄─►224
qwen1.5-32b-chat
1205
±6
22,068
Alibaba
Qianwen LICENSE
219
213◄─►227
gpt-3.5-turbo-1106
1202
±9
16,760
OpenAI
Proprietary
220
216◄─►227
Azurephi-3-medium-4k-instruct
1198
±5
25,301
Microsoft
MIT
221
217◄─►227
gemma-2-2b-it
1198
±4
46,901
Gemma license
222
217◄─►227
mixtral-8x7b-instruct-v0.1
1198
±4
74,303
Mistral
Apache 2.0
223
217◄─►232
dbrx-instruct-preview
1196
±6
32,760
Databricks
DBRX LICENSE
224
217◄─►236
qwen1.5-14b-chat
1193
±7
18,066
Alibaba
Qianwen LICENSE
225
218◄─►236
InternLMinternlm2_5-20b-chat
1192
±7
10,038
InternLM
Other
226
219◄─►242
Azurewizardlm-70b
1185
±9
8,270
Microsoft
Llama 2 Community
227
219◄─►243
deepseek-llm-67b-chat
1184
±12
4,950
DeepSeek AI
DeepSeek License
228
223◄─►240
01.AIyi-34b-chat
1184
±7
15,624
01 AI
Yi License
229
223◄─►242
granite-3.0-8b-instruct
1183
±9
6,727
IBM
Apache 2.0
230
223◄─►242
OpenChatopenchat-3.5-0106
1183
±8
12,712
OpenChat
Apache-2.0
231
223◄─►243
OpenChatopenchat-3.5
1182
±10
8,009
OpenChat
Apache-2.0
232
223◄─►244
granite-3.1-2b-instruct
1181
±11
3,235
IBM
Apache 2.0
233
224◄─►242
Snowflakesnowflake-arctic-instruct
1180
±6
33,272
Snowflake
Apache 2.0
234
224◄─►243
gemma-1.1-7b-it
1180
±6
24,327
Gemma license
235
224◄─►245
tulu-2-dpo-70b
1179
±10
6,579
AllenAI/UW
AI2 ImpACT Low-risk
236
224◄─►247
openhermes-2.5-mistral-7b
1176
±10
5,026
NousResearch
Apache-2.0
237
226◄─►246
vicuna-33b
1173
±6
22,613
LMSYS
Non-commercial
238
226◄─►247
starling-lm-7b-beta
1173
±7
16,190
Nexusflow
Apache-2.0
239
226◄─►247
Azurephi-3-small-8k-instruct
1172
±6
17,983
Microsoft
MIT
240
227◄─►247
Metallama-2-70b-chat
1171
±5
38,767
Meta
Llama 2 Community
241
227◄─►250
starling-lm-7b-alpha
1168
±8
10,267
UC Berkeley
CC-BY-NC-4.0
242
231◄─►251
Metallama-3.2-3b-instruct
1167
±8
8,043
Meta
Llama 3.2
243
226◄─►252
nous-hermes-2-mixtral-8x7b-dpo
1166
±12
3,792
NousResearch
Apache-2.0
244
234◄─►257
qwq-32b-preview
1159
±11
3,256
Alibaba
Apache 2.0
245
235◄─►260
Nvidiallama2-70b-steerlm-chat
1157
±13
3,605
Nvidia
Llama 2 Community
246
241◄─►257
granite-3.0-2b-instruct
1157
±8
6,922
IBM
Apache 2.0
247
237◄─►261
solar-10.7b-instruct-v1.0
1154
±13
4,187
Upstage AI
CC-BY-NC-4.0
248
236◄─►265
dolphin-2.2.1-mistral-7b
1152
±15
1,685
Cognitive Computations
Apache-2.0
249
241◄─►263
mpt-30b-chat
1151
±12
2,606
MosaicML
CC-BY-NC-SA-4.0
250
243◄─►261
mistral-7b-instruct-v0.2
1151
±7
19,603
Mistral
Apache-2.0
251
242◄─►261
Azurewizardlm-13b
1150
±9
7,122
Microsoft
Llama 2 Community
252
241◄─►268
falcon-180b-chat
1147
±17
1,312
TII
Falcon-180B TII License
253
244◄─►266
qwen1.5-7b-chat
1144
±10
4,782
Alibaba
Qianwen LICENSE
254
244◄─►265
Azurephi-3-mini-4k-instruct-june-2024
1143
±6
12,415
Microsoft
MIT
255
244◄─►265
Metallama-2-13b-chat
1143
±7
19,357
Meta
Llama 2 Community
256
244◄─►266
vicuna-13b
1142
±7
19,539
LMSYS
Llama 2 Community
257
244◄─►268
qwen-14b-chat
1139
±11
5,004
Alibaba
Qianwen LICENSE
258
246◄─►268
palm-2
1137
±9
8,634
Proprietary
259
246◄─►268
Metacodellama-34b-instruct
1137
±9
7,417
Meta
Llama 2 Community
260
246◄─►268
gemma-7b-it
1135
±9
9,034
Gemma license
261
250◄─►269
HuggingFacezephyr-7b-beta
1132
±9
11,220
HuggingFace
MIT
262
251◄─►269
Azurephi-3-mini-128k-instruct
1131
±7
21,024
Microsoft
MIT
263
254◄─►269
Azurephi-3-mini-4k-instruct
1129
±6
20,539
Microsoft
MIT
264
247◄─►273
HuggingFacezephyr-7b-alpha
1128
±16
1,803
HuggingFace
MIT
265
250◄─►272
guanaco-33b
1128
±12
2,955
UW
Non-commercial
266
256◄─►273
stripedhyena-nous-7b
1121
±11
5,214
Together AI
Apache 2.0
267
251◄─►274
Metacodellama-70b-instruct
1119
±18
1,151
Meta
Llama 2 Community
268
256◄─►273
HuggingFacesmollm2-1.7b-instruct
1119
±14
2,244
HuggingFace
Apache 2.0
269
261◄─►273
vicuna-7b
1115
±9
6,972
LMSYS
Llama 2 Community
270
264◄─►273
gemma-1.1-2b-it
1114
±8
11,035
Gemma license
271
264◄─►273
Metallama-3.2-1b-instruct
1113
±8
8,166
Meta
Llama 3.2
272
264◄─►274
mistral-7b-instruct
1111
±9
9,042
Mistral
Apache 2.0
273
265◄─►274
Metallama-2-7b-chat
1109
±7
14,272
Meta
Llama 2 Community
274
271◄─►278
gemma-2b-it
1091
±12
4,817
Gemma license
275
274◄─►276
qwen1.5-4b-chat
1091
±9
7,662
Alibaba
Qianwen LICENSE
276
274◄─►281
olmo-7b-instruct
1074
±11
6,412
Allen AI
Apache-2.0
277
275◄─►281
koala-13b
1071
±10
6,998
UC Berkeley
Non-commercial
278
276◄─►281
alpaca-13b
1067
±11
5,828
Stanford
Non-commercial
279
275◄─►282
gpt4all-13b-snoozy
1066
±15
1,773
Nomic AI
Non-commercial
280
276◄─►282
mpt-7b-chat
1062
±12
3,977
MosaicML
CC-BY-NC-SA-4.0
281
276◄─►282
chatglm3-6b
1057
±12
4,692
Tsinghua
Apache-2.0
282
279◄─►284
RWKVRWKV-4-Raven-14B
1042
±11
4,898
RWKV
Apache 2.0
283
282◄─►284
chatglm2-6b
1026
±14
2,683
Tsinghua
Apache-2.0
284
282◄─►284
oasst-pythia-12b
1023
±11
6,343
OpenAssistant
Apache 2.0
285
285◄─►288
chatglm-6b
996
±13
4,968
Tsinghua
Non-commercial
286
285◄─►288
fastchat-t5-3b
992
±12
4,270
LMSYS
Apache 2.0
287
285◄─►288
dolly-v2-12b
980
±14
3,471
Databricks
MIT
288
285◄─►289
Metallama-13b
972
±16
2,441
Meta
Non-commercial
289
288◄─►289
Stabilitystablelm-tuned-alpha-7b
953
±13
3,325
Stability AI
CC-BY-NC-SA-4.0
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?