模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2025-12-13 08:07:20 UTC / 2025-12-13 16:07:20 CST (北京时间)
排行榜
1
1◄─►1
gemini-3-pro
1492
±6
15,871
Proprietary
2
2◄─►4
grok-4.1-thinking
1478
±6
16,660
xAI
Proprietary
3
2◄─►6
Anthropicclaude-opus-4-5-20251101-thinking-32k
1470
±7
9,879
Anthropic
Proprietary
4
2◄─►6
Anthropicclaude-opus-4-5-20251101
1467
±7
10,659
Anthropic
Proprietary
5
3◄─►6
grok-4.1
1465
±6
16,501
xAI
Proprietary
6
3◄─►9
gpt-5.1-high
1457
±6
13,953
OpenAI
Proprietary
7
6◄─►11
gemini-2.5-pro
1451
±3
76,975
Proprietary
8
6◄─►11
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
1450
±5
28,019
Anthropic
Proprietary
9
6◄─►12
Anthropicclaude-opus-4-1-20250805-thinking-16k
1448
±4
43,836
Anthropic
Proprietary
10
7◄─►15
Anthropicclaude-sonnet-4-5-20250929
1445
±5
23,185
Anthropic
Proprietary
11
7◄─►17
gpt-4.5-preview-2025-02-27
1443
±6
14,644
OpenAI
Proprietary
12
9◄─►16
Anthropicclaude-opus-4-1-20250805
1441
±4
56,830
Anthropic
Proprietary
13
10◄─►18
chatgpt-4o-latest-20250326
1440
±3
63,299
OpenAI
Proprietary
14
10◄─►20
gpt-5.1
1437
±6
15,063
OpenAI
Proprietary
15
10◄─►19
gpt-5-high
1436
±5
32,884
OpenAI
Proprietary
16
12◄─►20
o3-2025-04-16
1433
±4
61,581
OpenAI
Proprietary
17
11◄─►24
qwen3-max-preview
1433
±5
28,133
Alibaba
Proprietary
18
14◄─►39
grok-4-1-fast-reasoning
1428
±7
9,096
xAI
Proprietary
19
13◄─►39
ernie-5.0-preview-1103
1428Preliminary
±9
5,738
Baidu
Proprietary
20
15◄─►39
MoonshotAIkimi-k2-thinking-turbo
1426
±6
15,621
Moonshot
Modified MIT
21
17◄─►39
gpt-5-chat
1425
±4
32,128
OpenAI
Proprietary
22
17◄─►39
glm-4.6
1425
±5
24,502
Z.ai
MIT
23
17◄─►39
qwen3-max-2025-09-23
1423
±6
9,251
Alibaba
Proprietary
24
17◄─►39
deepseek-v3.2-exp
1423
±7
11,973
DeepSeek AI
MIT
25
18◄─►39
Anthropicclaude-opus-4-20250514-thinking-16k
1423
±4
37,863
Anthropic
Proprietary
26
18◄─►39
qwen3-235b-a22b-instruct-2507
1421
±4
51,250
Alibaba
Apache 2.0
27
18◄─►42
deepseek-v3.2-exp-thinking
1421
±7
9,220
DeepSeek AI
MIT
28
18◄─►45
grok-4-fast-chat
1420
±8
7,056
xAI
Proprietary
29
18◄─►46
deepseek-v3.2-thinking
1418
±9
5,948
DeepSeek AI
MIT
30
18◄─►45
MoonshotAIkimi-k2-0905-preview
1418
±7
11,836
Moonshot
Modified MIT
31
18◄─►45
deepseek-r1-0528
1418
±6
19,236
DeepSeek
MIT
32
18◄─►45
MoonshotAIkimi-k2-0711-preview
1417
±5
28,658
Moonshot
Modified MIT
33
18◄─►46
deepseek-v3.1
1416
±6
15,256
DeepSeek
MIT
34
18◄─►46
deepseek-v3.1-thinking
1416
±7
11,987
DeepSeek
MIT
35
18◄─►48
mistral-large-3
1415Preliminary
±8
6,377
Mistral
Apache 2.0
36
18◄─►47
qwen3-vl-235b-a22b-instruct
1415
±7
8,532
Alibaba
Apache 2.0
37
18◄─►50
deepseek-v3.1-terminus
1415
±10
3,745
DeepSeek AI
MIT
38
18◄─►51
deepseek-v3.2
1414
±8
6,494
DeepSeek AI
MIT
39
18◄─►52
deepseek-v3.1-terminus-thinking
1414
±10
3,520
DeepSeek AI
MIT
40
27◄─►47
Anthropicclaude-opus-4-20250514
1412
±4
45,672
Anthropic
Proprietary
41
27◄─►47
gpt-4.1-2025-04-14
1412
±4
52,571
OpenAI
Proprietary
42
27◄─►49
mistral-medium-2508
1411
±4
45,391
Mistral
Proprietary
43
28◄─►50
grok-3-preview-02-24
1410
±4
34,122
xAI
Proprietary
44
28◄─►52
grok-4-0709
1409
±4
42,568
xAI
Proprietary
45
28◄─►53
glm-4.5
1408
±5
24,820
Z.ai
MIT
46
32◄─►52
gemini-2.5-flash
1408
±3
76,434
Proprietary
47
35◄─►57
gemini-2.5-flash-preview-09-2025
1405
±4
29,427
Proprietary
48
38◄─►58
Anthropicclaude-haiku-4-5-20251001
1402
±5
26,365
Anthropic
Proprietary
49
38◄─►58
grok-4-fast-reasoning
1402
±5
18,876
xAI
Proprietary
50
42◄─►59
o1-2024-12-17
1401
±4
28,039
OpenAI
Proprietary
51
43◄─►61
qwen3-next-80b-a3b-instruct
1400
±5
23,111
Alibaba
Apache 2.0
52
40◄─►63
longcat-flash-chat
1400
±6
11,498
Meituan
MIT
53
46◄─►62
qwen3-235b-a22b-no-thinking
1399
±4
39,375
Alibaba
Apache 2.0
54
47◄─►62
Anthropicclaude-sonnet-4-20250514-thinking-32k
1399
±4
36,210
Anthropic
Proprietary
55
47◄─►66
qwen3-235b-a22b-thinking-2507
1397
±6
9,345
Alibaba
Apache 2.0
56
47◄─►66
deepseek-r1
1396
±5
18,718
DeepSeek
MIT
57
48◄─►68
qwen3-vl-235b-a22b-thinking
1394
±7
7,986
Alibaba
Apache 2.0
58
50◄─►68
gpt-5-mini-high
1392
±5
27,446
OpenAI
Proprietary
59
51◄─►67
deepseek-v3-0324
1392
±4
46,777
DeepSeek
MIT
60
47◄─►73
Tencenthunyuan-vision-1.5-thinking
1391
±12
2,210
Tencent
Proprietary
61
52◄─►68
o4-mini-2025-04-16
1391
±4
46,832
OpenAI
Proprietary
62
51◄─►71
mai-1-preview
1390
±5
18,178
Microsoft AI
Proprietary
63
55◄─►71
Anthropicclaude-sonnet-4-20250514
1389
±4
41,654
Anthropic
Proprietary
64
55◄─►71
o1-preview
1388
±5
31,505
OpenAI
Proprietary
65
55◄─►72
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
1387
±4
39,905
Anthropic
Proprietary
66
57◄─►72
qwen3-coder-480b-a35b-instruct
1385
±5
23,148
Alibaba
Apache 2.0
67
54◄─►75
Tencenthunyuan-t1-20250711
1385
±9
4,818
Tencent
Proprietary
68
58◄─►74
mistral-medium-2505
1383
±5
34,522
Mistral
Proprietary
69
61◄─►74
qwen3-30b-a3b-instruct-2507
1382
±5
24,190
Alibaba
Apache 2.0
70
61◄─►75
gpt-4.1-mini-2025-04-14
1380
±4
40,485
OpenAI
Proprietary
71
61◄─►78
Tencenthunyuan-turbos-20250416
1380
±6
11,130
Tencent
Proprietary
72
64◄─►78
gemini-2.5-flash-lite-preview-09-2025-no-thinking
1378
±4
29,396
Proprietary
73
66◄─►79
gemini-2.5-flash-lite-preview-06-17-thinking
1375
±4
33,959
Proprietary
74
67◄─►80
qwen3-235b-a22b
1374
±5
27,167
Alibaba
Apache 2.0
75
69◄─►80
qwen2.5-max
1373
±4
33,545
Alibaba
Proprietary
76
71◄─►80
Anthropicclaude-3-5-sonnet-20241022
1372
±3
89,836
Anthropic
Proprietary
77
71◄─►83
Anthropicclaude-3-7-sonnet-20250219
1371
±4
44,557
Anthropic
Proprietary
78
71◄─►83
glm-4.5-air
1370
±4
31,668
Z.ai
MIT
79
73◄─►86
qwen3-next-80b-a3b-thinking
1367
±6
13,822
Alibaba
Apache 2.0
80
74◄─►86
Minimaxminimax-m1
1366
±4
36,877
MiniMax
Apache 2.0
81
77◄─►87
gemma-3-27b-it
1364
±4
49,312
Gemma
82
77◄─►90
o3-mini-high
1363
±5
18,735
OpenAI
Proprietary
83
77◄─►91
grok-3-mini-high
1362
±5
17,587
xAI
Proprietary
84
79◄─►92
gemini-2.0-flash-001
1360
±4
45,105
Proprietary
85
79◄─►100
deepseek-v3
1358
±5
21,994
DeepSeek
DeepSeek
86
79◄─►103
grok-3-mini-beta
1357
±5
23,791
xAI
Proprietary
87
82◄─►107
mistral-small-2506
1354
±5
18,325
Mistral
Apache 2.0
88
84◄─►108
gemini-2.0-flash-lite-preview-02-05
1353
±4
25,215
Proprietary
89
85◄─►108
gpt-oss-120b
1352
±4
31,256
OpenAI
Apache 2.0
90
85◄─►108
Coherecommand-a-03-2025
1352
±3
57,820
Cohere
CC-BY-NC-4.0
91
85◄─►108
gemini-1.5-pro-002
1352
±3
56,012
Proprietary
92
82◄─►110
glm-4.5v
1352
±8
4,973
Z.ai
MIT
93
85◄─►112
amazon-nova-experimental-chat-10-20
1348
±7
7,940
Amazon
Proprietary
94
87◄─►110
o3-mini
1348
±3
58,808
OpenAI
Proprietary
95
81◄─►121
intellect-3
1348
±13
2,020
Prime Intellect
MIT
96
82◄─►121
Tencenthunyuan-turbos-20250226
1346
±12
2,250
Tencent
Proprietary
97
85◄─►116
ling-flash-2.0
1346
±7
7,157
Ant Group
MIT
98
88◄─►111
gpt-4o-2024-05-13
1346
±3
113,568
OpenAI
Proprietary
99
86◄─►116
Stepfunstep-3
1346
±7
6,638
StepFun
Apache 2.0
100
85◄─►118
Minimaxminimax-m2
1345
±8
7,123
MiniMax
Apache 2.0
101
83◄─►121
Nvidiallama-3.1-nemotron-ultra-253b-v1
1345
±12
2,573
Nvidia
Nvidia Open Model
102
85◄─►121
amazon-nova-experimental-chat-10-09
1345
±11
2,891
Amazon
Proprietary
103
86◄─►120
qwen-plus-0125
1345
±8
5,861
Alibaba
Proprietary
104
85◄─►120
qwen3-32b
1344
±9
3,943
Alibaba
Apache 2.0
105
86◄─►120
glm-4-plus-0111
1343
±8
5,806
Zhipu
Proprietary
106
92◄─►112
Anthropicclaude-3-5-sonnet-20240620
1343
±3
82,864
Anthropic
Proprietary
107
87◄─►124
gemma-3-12b-it
1340
±9
3,866
Gemma
108
87◄─►125
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
1340
±10
3,492
Nvidia
Nvidia Open
109
87◄─►127
Tencenthunyuan-turbo-0110
1339
±11
2,322
Tencent
Proprietary
110
92◄─►122
gpt-5-nano-high
1339
±7
8,390
OpenAI
Proprietary
111
97◄─►122
o1-mini
1336
±4
52,301
OpenAI
Proprietary
112
97◄─►122
Metallama-3.1-405b-instruct-bf16
1336
±4
41,932
Meta
Llama 3.1 Community
113
97◄─►123
gpt-4o-2024-08-06
1335
±4
45,787
OpenAI
Proprietary
114
97◄─►125
gemini-advanced-0514
1335
±5
50,654
Proprietary
115
99◄─►123
grok-2-2024-08-13
1335
±4
63,725
xAI
Proprietary
116
100◄─►124
Metallama-3.1-405b-instruct-fp8
1334
±3
60,272
Meta
Llama 3.1 Community
117
94◄─►133
nova-2-lite
1334
±9
5,799
Amazon
Proprietary
118
99◄─►125
qwq-32b
1334
±4
26,269
Alibaba
Apache 2.0
119
95◄─►134
Stepfunstep-2-16k-exp-202412
1333
±9
4,895
StepFun
Proprietary
120
107◄─►135
01.AIyi-lightning
1329
±5
27,624
01 AI
Proprietary
121
110◄─►138
Metallama-4-maverick-17b-128e-instruct
1327
±4
41,199
Meta
Llama 4
122
114◄─►139
qwen3-30b-a3b
1326
±5
27,485
Alibaba
Apache 2.0
123
100◄─►148
Nvidiallama-3.3-nemotron-49b-super-v1
1326
±12
2,243
Nvidia
Nvidia
124
103◄─►147
Tencenthunyuan-large-2025-02-10
1325
±10
3,760
Tencent
Proprietary
125
117◄─►139
gpt-4-turbo-2024-04-09
1325
±4
98,965
OpenAI
Proprietary
126
118◄─►142
Anthropicclaude-3-opus-20240229
1323
±3
196,368
Anthropic
Proprietary
127
118◄─►142
gemini-1.5-pro-001
1323
±4
79,769
Proprietary
128
118◄─►142
Anthropicclaude-3-5-haiku-20241022
1323
±3
71,373
Anthropic
Proprietary
129
112◄─►148
deepseek-v2.5-1210
1323
±8
6,877
DeepSeek
DeepSeek
130
118◄─►146
Metallama-4-scout-17b-16e-instruct
1322
±5
31,193
Meta
Llama
131
117◄─►148
gpt-4.1-nano-2025-04-14
1320
±8
6,143
OpenAI
Proprietary
132
118◄─►148
ring-flash-2.0
1320
±7
7,278
Ant Group
MIT
133
122◄─►147
Metallama-3.3-70b-instruct
1319
±3
56,008
Meta
Llama-3.3
134
118◄─►148
Stepfunstep-1o-turbo-202506
1319
±7
9,661
StepFun
Proprietary
135
121◄─►148
glm-4-plus
1319
±5
26,342
Zhipu AI
Proprietary
136
121◄─►148
gemma-3n-e4b-it
1318
±5
23,470
Gemma
137
121◄─►148
qwen-max-0919
1318
±6
16,598
Alibaba
Qwen
138
120◄─►149
gpt-oss-20b
1318
±6
10,848
OpenAI
Apache 2.0
139
124◄─►148
gpt-4o-mini-2024-07-18
1317
±3
69,291
OpenAI
Proprietary
140
127◄─►153
gpt-4-1106-preview
1314
±4
101,117
OpenAI
Proprietary
141
124◄─►154
qwen2.5-plus-1127
1314
±6
10,252
Alibaba
Proprietary
142
127◄─►153
gpt-4-0125-preview
1314
±4
94,534
OpenAI
Proprietary
143
127◄─►153
mistral-large-2407
1314
±4
45,968
Mistral
Mistral Research
144
127◄─►154
athene-v2-chat
1313
±4
24,880
NexusFlow
NexusFlow
145
130◄─►155
gemini-1.5-flash-002
1311
±4
35,180
Proprietary
146
119◄─►159
mercury
1311
±14
1,965
Inception AI
Proprietary
147
124◄─►158
Tencenthunyuan-standard-2025-02-10
1310
±10
3,920
Tencent
Proprietary
148
140◄─►157
grok-2-mini-2024-08-13
1307
±4
52,789
xAI
Proprietary
149
140◄─►158
deepseek-v2.5
1307
±5
24,839
DeepSeek
DeepSeek
150
128◄─►162
olmo-3-32b-think
1306
±11
3,405
Allen AI
Apache 2.0
151
140◄─►158
athene-70b-0725
1305
±6
19,796
NexusFlow
CC-BY-NC-4.0
152
143◄─►158
mistral-large-2411
1305
±4
28,455
Mistral
MRL
153
140◄─►158
magistral-medium-2506
1305
±6
11,995
Mistral
Proprietary
154
145◄─►158
mistral-small-3.1-24b-instruct-2503
1303
±4
34,136
Mistral
Apache 2.0
155
139◄─►163
gemma-3-4b-it
1302
±9
4,195
Gemma
156
146◄─►158
qwen2.5-72b-instruct
1302
±4
39,632
Alibaba
Qwen
157
146◄─►166
Nvidiallama-3.1-nemotron-70b-instruct
1298
±8
7,216
Nvidia
Llama 3.1
158
147◄─►169
Tencenthunyuan-large-vision
1295
±9
5,598
Tencent
Proprietary
159
154◄─►166
Metallama-3.1-70b-instruct
1294
±4
56,003
Meta
Llama 3.1 Community
160
155◄─►171
jamba-1.5-large
1289
±7
8,730
AI21 Labs
Jamba Open
161
157◄─►170
amazon-nova-pro-v1.0
1288
±4
25,218
Amazon
Proprietary
162
156◄─►172
reka-core-20240904
1288
±7
7,380
Reka AI
Proprietary
163
157◄─►169
gemma-2-27b-it
1288
±3
76,195
Gemma license
164
157◄─►171
gpt-4-0314
1288
±5
54,754
OpenAI
Proprietary
165
155◄─►177
Nvidiallama-3.1-nemotron-51b-instruct
1287
±10
3,777
Nvidia
Llama 3.1
166
155◄─►177
llama-3.1-tulu-3-70b
1286
±10
2,881
Ai2
Llama 3.1
167
159◄─►172
gemini-1.5-flash-001
1285
±4
63,418
Proprietary
168
159◄─►176
Anthropicclaude-3-sonnet-20240229
1282
±4
110,173
Anthropic
Proprietary
169
159◄─►177
gemma-2-9b-it-simpo
1279
±7
10,108
Princeton
MIT
170
162◄─►177
Nvidianemotron-4-340b-instruct
1279
±5
19,913
Nvidia
NVIDIA Open Model
171
161◄─►178
Coherecommand-r-plus-08-2024
1278
±7
9,931
Cohere
CC-BY-NC-4.0
172
166◄─►177
Metallama-3-70b-instruct
1277
±3
158,908
Meta
Llama 3 Community
173
166◄─►177
gpt-4-0613
1276
±4
89,612
OpenAI
Proprietary
174
164◄─►182
glm-4-0520
1274
±7
9,857
Zhipu AI
Proprietary
175
166◄─►181
mistral-small-24b-instruct-2501
1274
±6
14,830
Mistral
Apache 2.0
176
166◄─►182
reka-flash-20240904
1273
±7
7,583
Reka AI
Proprietary
177
167◄─►186
qwen2.5-coder-32b-instruct
1270
±8
5,452
Alibaba
Apache 2.0
178
173◄─►186
Coherec4ai-aya-expanse-32b
1267
±5
27,362
Cohere
CC-BY-NC-4.0
179
174◄─►186
gemma-2-9b-it
1265
±4
54,954
Gemma license
180
174◄─►188
deepseek-coder-v2
1265
±6
15,242
DeepSeek AI
DeepSeek License
181
174◄─►187
Coherecommand-r-plus
1264
±4
78,401
Cohere
CC-BY-NC-4.0
182
175◄─►188
qwen2-72b-instruct
1263
±5
37,688
Alibaba
Qianwen LICENSE
183
177◄─►188
Anthropicclaude-3-haiku-20240307
1262
±4
118,626
Anthropic
Proprietary
184
177◄─►188
amazon-nova-lite-v1.0
1260
±5
19,760
Amazon
Proprietary
185
177◄─►188
gemini-1.5-flash-8b-001
1260
±4
35,914
Proprietary
186
180◄─►188
Azurephi-4
1255
±4
24,354
Microsoft
MIT
187
177◄─►195
olmo-2-0325-32b-instruct
1252
±11
3,377
Allen AI
Apache-2.0
188
181◄─►192
Coherecommand-r-08-2024
1252
±7
10,229
Cohere
CC-BY-NC-4.0
189
187◄─►197
mistral-large-2402
1243
±5
63,404
Mistral
Proprietary
190
187◄─►197
amazon-nova-micro-v1.0
1242
±5
19,774
Amazon
Proprietary
191
187◄─►202
jamba-1.5-mini
1240
±7
8,918
AI21 Labs
Jamba Open
192
187◄─►205
ministral-8b-2410
1237
±9
4,833
Mistral
MRL
193
188◄─►205
gemini-pro-dev-api
1236
±7
18,454
Proprietary
194
189◄─►204
qwen1.5-110b-chat
1235
±5
26,679
Alibaba
Qianwen LICENSE
195
189◄─►205
qwen1.5-72b-chat
1235
±5
39,689
Alibaba
Qianwen LICENSE
196
188◄─►206
reka-flash-21b-20240226-online
1235
±7
15,606
Reka AI
Proprietary
197
188◄─►207
Tencenthunyuan-standard-256k
1234
±12
2,761
Tencent
Proprietary
198
191◄─►206
mixtral-8x22b-instruct-v0.1
1231
±4
52,214
Mistral
Apache 2.0
199
191◄─►207
Coherecommand-r
1229
±5
54,710
Cohere
CC-BY-NC-4.0
200
191◄─►207
reka-flash-21b-20240226
1228
±6
25,026
Reka AI
Proprietary
201
192◄─►208
gpt-3.5-turbo-0125
1225
±5
67,214
OpenAI
Proprietary
202
192◄─►208
mistral-medium
1225
±5
34,893
Mistral
Proprietary
203
196◄─►208
Metallama-3-8b-instruct
1224
±4
106,055
Meta
Llama 3 Community
204
192◄─►209
Coherec4ai-aya-expanse-8b
1224
±7
9,922
Cohere
CC-BY-NC-4.0
205
191◄─►212
gemini-pro
1223
±12
6,418
Proprietary
206
191◄─►212
llama-3.1-tulu-3-8b
1222
±11
2,943
Ai2
Llama 3.1
207
198◄─►213
HuggingFacezephyr-orpo-141b-A35b-v0.1
1215
±11
4,712
HuggingFace
Apache 2.0
208
204◄─►212
01.AIyi-1.5-34b-chat
1214
±5
24,417
01 AI
Apache-2.0
209
205◄─►212
Metallama-3.1-8b-instruct
1212
±4
50,234
Meta
Llama 3.1 Community
210
201◄─►218
granite-3.1-8b-instruct
1210
±11
3,142
IBM
Apache 2.0
211
205◄─►217
qwen1.5-32b-chat
1207
±6
22,068
Alibaba
Qianwen LICENSE
212
205◄─►220
gpt-3.5-turbo-1106
1203
±9
16,760
OpenAI
Proprietary
213
209◄─►220
Azurephi-3-medium-4k-instruct
1199
±5
25,301
Microsoft
MIT
214
210◄─►220
mixtral-8x7b-instruct-v0.1
1199
±4
74,303
Mistral
Apache 2.0
215
210◄─►220
gemma-2-2b-it
1199
±4
46,901
Gemma license
216
210◄─►225
dbrx-instruct-preview
1197
±6
32,760
Databricks
DBRX LICENSE
217
210◄─►229
qwen1.5-14b-chat
1194
±7
18,066
Alibaba
Qianwen LICENSE
218
211◄─►229
InternLMinternlm2_5-20b-chat
1193
±7
10,038
InternLM
Other
219
212◄─►235
Azurewizardlm-70b
1187
±9
8,270
Microsoft
Llama 2 Community
220
212◄─►236
deepseek-llm-67b-chat
1186
±12
4,950
DeepSeek AI
DeepSeek License
221
216◄─►233
01.AIyi-34b-chat
1185
±7
15,624
01 AI
Yi License
222
216◄─►235
granite-3.0-8b-instruct
1184
±9
6,727
IBM
Apache 2.0
223
216◄─►235
OpenChatopenchat-3.5-0106
1184
±8
12,712
OpenChat
Apache-2.0
224
216◄─►236
OpenChatopenchat-3.5
1184
±10
8,009
OpenChat
Apache-2.0
225
217◄─►235
Snowflakesnowflake-arctic-instruct
1182
±6
33,272
Snowflake
Apache 2.0
226
216◄─►238
granite-3.1-2b-instruct
1181
±11
3,235
IBM
Apache 2.0
227
217◄─►236
gemma-1.1-7b-it
1181
±6
24,327
Gemma license
228
217◄─►238
tulu-2-dpo-70b
1180
±10
6,579
AllenAI/UW
AI2 ImpACT Low-risk
229
217◄─►240
openhermes-2.5-mistral-7b
1178
±10
5,026
NousResearch
Apache-2.0
230
219◄─►239
vicuna-33b
1175
±6
22,613
LMSYS
Non-commercial
231
219◄─►240
starling-lm-7b-beta
1174
±7
16,190
Nexusflow
Apache-2.0
232
219◄─►240
Azurephi-3-small-8k-instruct
1173
±6
17,983
Microsoft
MIT
233
220◄─►240
Metallama-2-70b-chat
1173
±5
38,767
Meta
Llama 2 Community
234
220◄─►243
starling-lm-7b-alpha
1170
±8
10,267
UC Berkeley
CC-BY-NC-4.0
235
224◄─►244
Metallama-3.2-3b-instruct
1168
±8
8,043
Meta
Llama 3.2
236
219◄─►245
nous-hermes-2-mixtral-8x7b-dpo
1167
±12
3,792
NousResearch
Apache-2.0
237
227◄─►250
qwq-32b-preview
1160
±11
3,256
Alibaba
Apache 2.0
238
227◄─►253
Nvidiallama2-70b-steerlm-chat
1158
±13
3,605
Nvidia
Llama 2 Community
239
234◄─►250
granite-3.0-2b-instruct
1157
±8
6,922
IBM
Apache 2.0
240
230◄─►254
solar-10.7b-instruct-v1.0
1155
±13
4,187
Upstage AI
CC-BY-NC-4.0
241
229◄─►258
dolphin-2.2.1-mistral-7b
1154
±15
1,685
Cognitive Computations
Apache-2.0
242
234◄─►256
mpt-30b-chat
1153
±12
2,606
MosaicML
CC-BY-NC-SA-4.0
243
236◄─►254
mistral-7b-instruct-v0.2
1152
±7
19,603
Mistral
Apache-2.0
244
235◄─►254
Azurewizardlm-13b
1152
±9
7,122
Microsoft
Llama 2 Community
245
234◄─►261
falcon-180b-chat
1149
±17
1,312
TII
Falcon-180B TII License
246
237◄─►259
qwen1.5-7b-chat
1145
±10
4,782
Alibaba
Qianwen LICENSE
247
237◄─►258
Azurephi-3-mini-4k-instruct-june-2024
1144
±6
12,415
Microsoft
MIT
248
237◄─►258
Metallama-2-13b-chat
1144
±7
19,357
Meta
Llama 2 Community
249
237◄─►259
vicuna-13b
1143
±7
19,539
LMSYS
Llama 2 Community
250
237◄─►261
qwen-14b-chat
1141
±11
5,004
Alibaba
Qianwen LICENSE
251
239◄─►261
palm-2
1139
±9
8,634
Proprietary
252
239◄─►261
Metacodellama-34b-instruct
1138
±9
7,417
Meta
Llama 2 Community
253
240◄─►261
gemma-7b-it
1136
±9
9,034
Gemma license
254
243◄─►262
HuggingFacezephyr-7b-beta
1133
±9
11,220
HuggingFace
MIT
255
244◄─►262
Azurephi-3-mini-128k-instruct
1133
±7
21,024
Microsoft
MIT
256
247◄─►262
Azurephi-3-mini-4k-instruct
1130
±6
20,539
Microsoft
MIT
257
239◄─►266
HuggingFacezephyr-7b-alpha
1130
±16
1,803
HuggingFace
MIT
258
243◄─►265
guanaco-33b
1130
±12
2,955
UW
Non-commercial
259
249◄─►266
stripedhyena-nous-7b
1122
±11
5,214
Together AI
Apache 2.0
260
244◄─►267
Metacodellama-70b-instruct
1120
±18
1,151
Meta
Llama 2 Community
261
249◄─►266
HuggingFacesmollm2-1.7b-instruct
1120
±14
2,244
HuggingFace
Apache 2.0
262
254◄─►266
vicuna-7b
1117
±9
6,972
LMSYS
Llama 2 Community
263
257◄─►266
gemma-1.1-2b-it
1115
±8
11,035
Gemma license
264
257◄─►266
Metallama-3.2-1b-instruct
1114
±8
8,166
Meta
Llama 3.2
265
257◄─►267
mistral-7b-instruct
1112
±9
9,042
Mistral
Apache 2.0
266
258◄─►267
Metallama-2-7b-chat
1110
±7
14,272
Meta
Llama 2 Community
267
267◄─►269
qwen1.5-4b-chat
1092
±9
7,662
Alibaba
Qianwen LICENSE
268
264◄─►271
gemma-2b-it
1092
±12
4,817
Gemma license
269
267◄─►274
olmo-7b-instruct
1076
±11
6,412
Allen AI
Apache-2.0
270
268◄─►274
koala-13b
1073
±10
6,998
UC Berkeley
Non-commercial
271
269◄─►274
alpaca-13b
1069
±11
5,828
Stanford
Non-commercial
272
268◄─►275
gpt4all-13b-snoozy
1067
±15
1,773
Nomic AI
Non-commercial
273
269◄─►275
mpt-7b-chat
1064
±12
3,977
MosaicML
CC-BY-NC-SA-4.0
274
269◄─►275
chatglm3-6b
1058
±12
4,692
Tsinghua
Apache-2.0
275
272◄─►277
RWKVRWKV-4-Raven-14B
1044
±11
4,898
RWKV
Apache 2.0
276
275◄─►277
chatglm2-6b
1027
±14
2,683
Tsinghua
Apache-2.0
277
275◄─►277
oasst-pythia-12b
1024
±11
6,343
OpenAssistant
Apache 2.0
278
278◄─►281
chatglm-6b
998
±13
4,968
Tsinghua
Non-commercial
279
278◄─►281
fastchat-t5-3b
993
±12
4,270
LMSYS
Apache 2.0
280
278◄─►281
dolly-v2-12b
981
±14
3,471
Databricks
MIT
281
278◄─►282
Metallama-13b
973
±16
2,441
Meta
Non-commercial
282
281◄─►282
Stabilitystablelm-tuned-alpha-7b
954
±13
3,325
Stability AI
CC-BY-NC-SA-4.0
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?