模型榜单
这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。
数据更新时间: 2026-03-22 08:11:05 UTC / 2026-03-22 16:11:05 CST (北京时间)
排行榜
1
14
Anthropicclaude-opus-4-6-thinkingAnthropic · Proprietary
1502±6
11,801
$5/$25
1M
2
14
Anthropicclaude-opus-4-6Anthropic · Proprietary
1501±6
12,546
$5/$25
1M
3
18
gemini-3.1-pro-previewGoogle · Proprietary
1493±6
14,677
$2/$12
1M
4
18
grok-4.20-beta1xAI · Proprietary
1492±7Preliminary
7,396
N/A
N/A
5
38
gemini-3-proGoogle · Proprietary
1486±4
41,762
$2/$12
1M
6
310
gpt-5.4-highOpenAI · Proprietary
1485±9
4,965
$2.50/$15
1.1M
7
310
gpt-5.2-chat-latest-20260210OpenAI · Proprietary
1482±6
10,140
$1.75/$14
128K
8
313
grok-4.20-beta-0309-reasoningxAI · Proprietary
1481±9
4,504
$2/$6
2M
9
616
gemini-3-flashGoogle · Proprietary
1475±4
31,060
$0.50/$3
1M
10
616
Anthropicclaude-opus-4-5-20251101-thinking-32kAnthropic · Proprietary
1474±4
37,036
$5/$25
200K
11
816
grok-4.1-thinkingxAI · Proprietary
1472±4
43,930
$0.20/$0.50
N/A
12
818
Anthropicclaude-opus-4-5-20251101Anthropic · Proprietary
1469±4
41,976
$5/$25
200K
13
921
Anthropicclaude-sonnet-4-6Anthropic · Proprietary
1465±6
9,843
$3/$15
1M
14
827
qwen3.5-max-previewAlibaba · Proprietary
1464±9Preliminary
4,252
N/A
N/A
15
923
gpt-5.3-chat-latestOpenAI · Proprietary
1464±7
8,942
$1.75/$14
128K
16
1221
gemini-3-flash (thinking-minimal)Google · Proprietary
1463±4
27,448
$0.50/$3
1M
17
928
gpt-5.4OpenAI · Proprietary
1463±8
4,972
$2.50/$15
1.1M
18
1227
Bytedancedola-seed-2.0-previewBytedance · Proprietary
1462±6Preliminary
10,651
N/A
N/A
19
1323
grok-4.1xAI · Proprietary
1461±4
47,757
$0.20/$0.50
N/A
20
1329
gpt-5.1-highOpenAI · Proprietary
1455±4
40,759
$1.25/$10
400K
21
1332
glm-5Z.ai · MIT
1455±6
11,093
$0.72/$2.30
80K
22
1533
MoonshotAIkimi-k2.5-thinkingMoonshot · Modified MIT
1453±5
16,262
$0.60/$3
N/A
23
1732
Anthropicclaude-sonnet-4-5-20250929Anthropic · Proprietary
1453±3
53,556
$3/$15
200K
24
1732
Anthropicclaude-sonnet-4-5-20250929-thinking-32kAnthropic · Proprietary
1453±3
55,811
$3/$15
200K
25
1734
ernie-5.0-0110Baidu · Proprietary
1452±5
18,715
N/A
N/A
26
1536
qwen3.5-397b-a17bAlibaba · Apache 2.0
1452±6
10,431
$0.39/$2.34
262.1K
27
1737
ernie-5.0-preview-1203Baidu · Proprietary
1450±7
9,857
N/A
N/A
28
2036
Anthropicclaude-opus-4-1-20250805-thinking-16kAnthropic · Proprietary
1449±3
50,375
$15/$75
200K
29
2136
gemini-2.5-proGoogle · Proprietary
1448±3
103,317
$1.25/$10
1M
30
2137
Anthropicclaude-opus-4-1-20250805Anthropic · Proprietary
1447±3
78,224
$15/$75
200K
31
1944
mimo-v2-proXiaomi · Proprietary
1445±10
3,531
$1/$3
1M
32
2141
gpt-4.5-preview-2025-02-27OpenAI · Proprietary
1444±6
14,547
$75/$150
128K
33
2638
chatgpt-4o-latest-20250326OpenAI · Proprietary
1443±3
83,559
$5/$15
128K
34
2442
glm-4.7Z.ai · MIT
1443±6
12,242
$0.39/$1.75
202.8K
35
2642
gpt-5.2-highOpenAI · Proprietary
1442±5
25,328
$1.75/$14
400K
36
2942
gpt-5.2OpenAI · Proprietary
1440±5
22,231
$1.75/$14
400K
37
3144
gpt-5.1OpenAI · Proprietary
1439±4
43,475
$1.25/$10
400K
38
2552
gemini-3.1-flash-lite-previewGoogle · Proprietary
1438±9
3,881
$0.25/$1.50
1M
39
3248
qwen3-max-previewAlibaba · Proprietary
1435±4
28,066
$0.78/$3.90
262.1K
40
3350
gpt-5-highOpenAI · Proprietary
1434±5
32,470
$1.25/$10
400K
41
3255
MoonshotAIkimi-k2.5-instantMoonshot · Modified MIT
1433±7
8,257
$0.45/$2.20
262.1K
42
3653
o3-2025-04-16OpenAI · Proprietary
1432±4
60,698
$2/$8
200K
43
3654
grok-4-1-fast-reasoningxAI · Proprietary
1431±4
37,473
$0.20/$0.50
2M
44
3856
MoonshotAIkimi-k2-thinking-turboMoonshot · Modified MIT
1430±4
41,738
$1.15/$8
262.1K
45
3267
amazon-nova-experimental-chat-26-02-10Amazon · Proprietary
1429±10Preliminary
3,467
N/A
N/A
46
3864
gpt-5-chatOpenAI · Proprietary
1426±4
32,009
$1.25/$10
128K
47
3965
glm-4.6Z.ai · MIT
1426±4
36,102
$0.39/$1.90
204.8K
48
3867
deepseek-v3.2-exp-thinkingDeepSeek · MIT
1425±7
9,188
$0.27/$0.41
163.8K
49
4066
deepseek-v3.2DeepSeek · MIT
1425±4
36,511
$0.26/$0.38
163.8K
50
3867
qwen3-max-2025-09-23Alibaba · Proprietary
1424±6
9,273
$0.78/$3.90
262.1K
51
4167
Anthropicclaude-opus-4-20250514-thinking-16kAnthropic · Proprietary
1424±4
37,503
$15/$75
200K
52
3969
deepseek-v3.2-expDeepSeek · MIT
1423±6
12,088
$0.27/$0.41
163.8K
53
4567
qwen3-235b-a22b-instruct-2507Alibaba · Apache 2.0
1422±3
77,683
$0.26/$1.06
N/A
54
4568
deepseek-v3.2-thinkingDeepSeek · MIT
1422±4
31,048
$0.26/$0.38
163.8K
55
4372
deepseek-r1-0528DeepSeek · MIT
1421±6
18,831
$0.45/$2.15
163.8K
56
4074
grok-4-fast-chatxAI · Proprietary
1421±8
6,901
$3/$15
256K
57
4277
ernie-5.0-preview-1022Baidu · Proprietary
1419±9
4,782
N/A
N/A
58
4576
deepseek-v3.1DeepSeek · MIT
1418±6
15,150
$1.23/$4.94
N/A
59
4576
MoonshotAIkimi-k2-0905-previewMoonshot · Modified MIT
1418±6
11,924
$0.60/$2.50
262.1K
60
4577
qwen3.5-122b-a10bAlibaba · Apache 2.0
1417±7
6,946
$0.26/$2.08
262.1K
61
4776
MoonshotAIkimi-k2-0711-previewMoonshot · Modified MIT
1417±5
28,082
$0.60/$2.50
131.1K
62
4577
deepseek-v3.1-thinkingDeepSeek · MIT
1417±7
11,885
$1.23/$4.94
N/A
63
4485
deepseek-v3.1-terminus-thinkingDeepSeek · MIT
1416±10
3,497
$0.21/$0.78
163.8K
64
4876
mistral-large-3Mistral · Apache 2.0
1416±4
33,200
$0.50/$1.50
N/A
65
4587
deepseek-v3.1-terminusDeepSeek · MIT
1416±10
3,736
$0.21/$0.78
163.8K
66
4680
qwen3-vl-235b-a22b-instructAlibaba · Apache 2.0
1415±6
11,645
$0.20/$0.88
262.1K
67
4589
amazon-nova-experimental-chat-26-01-10Amazon · Proprietary
1414±10
3,439
N/A
N/A
68
5479
gpt-4.1-2025-04-14OpenAI · Proprietary
1413±4
51,831
$2/$8
1M
69
5580
Anthropicclaude-opus-4-20250514Anthropic · Proprietary
1413±4
44,988
$15/$75
200K
70
5583
grok-3-preview-02-24xAI · Proprietary
1412±4
33,374
$3/$15
131.1K
71
5682
gemini-2.5-flashGoogle · Proprietary
1411±3
102,736
$0.30/$2.50
1M
72
5587
glm-4.5Z.ai · MIT
1411±5
24,640
$0.60/$2.20
131.1K
73
5686
grok-4-0709xAI · Proprietary
1410±4
42,034
$3/$15
256K
74
5784
mistral-medium-2508Mistral · Proprietary
1410±3
72,410
$2.70/$8.10
32K
75
5394
Minimaxminimax-m2.7MiniMax · Proprietary
1407±11Preliminary
2,981
$0.30/$1.20
204.8K
76
6489
Anthropicclaude-haiku-4-5-20251001Anthropic · Proprietary
1407±3
54,261
$1/$5
200K
77
5692
qwen3.5-27bAlibaba · Apache 2.0
1406±7
6,957
$0.20/$1.56
262.1K
78
6192
Minimaxminimax-m2.5MiniMax · Modified MIT
1405±6
11,909
$0.20/$1.17
196.6K
79
6592
gemini-2.5-flash-preview-09-2025Google · Proprietary
1405±4
33,278
$0.30/$2.50
1M
80
6492
grok-4-fast-reasoningxAI · Proprietary
1405±5
18,993
$0.20/$0.50
2M
81
6993
qwen3-235b-a22b-no-thinkingAlibaba · Apache 2.0
1403±4
38,797
$0.46/$1.82
131.1K
82
7294
o1-2024-12-17OpenAI · Proprietary
1402±4
27,807
$15/$60
200K
83
7194
qwen3-next-80b-a3b-instructAlibaba · Apache 2.0
1401±5
23,187
$0.09/$1.10
262.1K
84
6796
qwen3.5-flashAlibaba · Proprietary
1401±7
7,853
N/A
N/A
85
6898
qwen3.5-35b-a3bAlibaba · Apache 2.0
1401±7
7,278
$0.16/$1.30
262.1K
86
7098
longcat-flash-chatMeituan · MIT
1400±6
11,517
$0.20/$0.80
131.1K
87
74102
qwen3-235b-a22b-thinking-2507Alibaba · Apache 2.0
1399±7
9,128
$0.15/$1.50
131.1K
88
7696
Anthropicclaude-sonnet-4-20250514-thinking-32kAnthropic · Proprietary
1399±4
35,733
$3/$15
1M
89
76102
deepseek-r1DeepSeek · MIT
1398±5
18,524
$0.70/$2.50
64K
90
67110
Tencenthunyuan-vision-1.5-thinkingTencent · Proprietary
1396±12
2,235
N/A
N/A
91
76108
qwen3-vl-235b-a22b-thinkingAlibaba · Apache 2.0
1396±7
8,052
$0.26/$2.60
131.1K
92
74110
amazon-nova-experimental-chat-12-10Amazon · Proprietary
1396±10
3,720
N/A
N/A
93
81106
deepseek-v3-0324DeepSeek · MIT
1394±4
46,144
$3/$4.50
32.8K
94
80109
mai-1-previewMicrosoft AI · Proprietary
1393±5
18,095
N/A
N/A
95
84109
mimo-v2-flash (non-thinking)Xiaomi · MIT
1392±4
25,427
$0.09/$0.29
262.1K
96
88110
o4-mini-2025-04-16OpenAI · Proprietary
1390±4
46,166
$1.10/$4.40
200K
97
86110
gpt-5-mini-highOpenAI · Proprietary
1390±5
27,372
$0.25/$2
400K
98
88110
Anthropicclaude-sonnet-4-20250514Anthropic · Proprietary
1389±4
41,021
$3/$15
1M
99
86111
Stepfunstep-3.5-flashStepFun · Apache 2.0
1389±6
13,885
$0.10/$0.30
256K
100
88111
o1-previewOpenAI · Proprietary
1388±5
31,122
$15/$60
N/A
101
88112
mimo-v2-flash (thinking)Xiaomi · MIT
1387±6
11,053
$0.09/$0.29
262.1K
102
90112
qwen3-coder-480b-a35b-instructAlibaba · Apache 2.0
1387±5
26,162
$0.40/$1.60
262.1K
103
84115
Tencenthunyuan-t1-20250711Tencent · Proprietary
1387±9
4,742
N/A
N/A
104
90111
Anthropicclaude-3-7-sonnet-20250219-thinking-32kAnthropic · Proprietary
1387±4
39,307
$3/$15
200K
105
90112
mistral-medium-2505Mistral · Proprietary
1386±5
33,805
$0.40/$2
131.1K
106
90112
Minimaxminimax-m2.1-previewMiniMax · MIT
1386±5
17,271
$0.27/$0.95
196.6K
107
91115
Tencenthunyuan-turbos-20250416Tencent · Proprietary
1383±6
10,871
N/A
N/A
108
92115
qwen3-30b-a3b-instruct-2507Alibaba · Apache 2.0
1383±5
24,086
$0.09/$0.30
262.1K
109
94115
gpt-4.1-mini-2025-04-14OpenAI · Proprietary
1382±4
39,876
$0.40/$1.60
1M
110
99116
gemini-2.5-flash-lite-preview-09-2025-no-thinkingGoogle · Proprietary
1380±3
47,802
$0.10/$0.40
1M
111
91126
glm-4.6vZ.ai · MIT
1378±11
2,836
$0.30/$0.90
131.1K
112
102123
trinity-largeArcee AI · Apache 2.0
1376±7
8,033
N/A
N/A
113
106122
qwen3-235b-a22bAlibaba · Apache 2.0
1375±5
26,679
$0.46/$1.82
131.1K
114
106122
qwen2.5-maxAlibaba · Proprietary
1374±4
32,882
N/A
N/A
115
106122
gemini-2.5-flash-lite-preview-06-17-thinkingGoogle · Proprietary
1374±4
33,445
$0.10/$0.40
1M
116
110126
glm-4.5-airZ.ai · MIT
1372±4
31,546
$0.13/$0.85
131.1K
117
111125
Anthropicclaude-3-5-sonnet-20241022Anthropic · Proprietary
1372±3
88,853
$6/$30
200K
118
111126
Anthropicclaude-3-7-sonnet-20250219Anthropic · Proprietary
1371±4
43,753
$3/$15
200K
119
111128
qwen3-next-80b-a3b-thinkingAlibaba · Apache 2.0
1369±6
13,906
$0.10/$0.78
131.1K
120
111129
glm-4.7-flashZ.ai · MIT
1368±6
11,857
$0.06/$0.40
202.8K
121
111128
amazon-nova-experimental-chat-11-10Amazon · Proprietary
1368±5
24,655
N/A
N/A
122
114131
gemma-3-27b-itGoogle · Gemma
1365±4
48,195
$0.08/$0.16
131.1K
123
111142
Nvidianvidia-nemotron-3-super-120b-a12bNvidia · NVIDIA Open Model
1365±10Preliminary
3,641
N/A
N/A
124
114133
Minimaxminimax-m1MiniMax · Apache 2.0
1364±4
35,951
$0.40/$2.20
1M
125
115135
o3-mini-highOpenAI · Proprietary
1363±5
18,589
$1.10/$4.40
200K
126
116135
grok-3-mini-highxAI · Proprietary
1363±5
17,210
$0.30/$0.50
131.1K
127
119139
gemini-2.0-flash-001Google · Proprietary
1360±4
44,240
$0.10/$0.40
1M
128
121144
deepseek-v3DeepSeek · DeepSeek
1358±5
21,770
$1.14/$4.56
N/A
129
122146
grok-3-mini-betaxAI · Proprietary
1358±5
23,162
$0.30/$0.50
131.1K
130
122146
mistral-small-2506Mistral · Apache 2.0
1357±5
18,015
$0.10/$0.30
32K
131
119151
intellect-3Prime Intellect · MIT
1357±8
5,387
$0.20/$1.10
131.1K
132
124150
gpt-oss-120bOpenAI · Apache 2.0
1354±4
31,077
$0.04/$0.19
131.1K
133
126150
Coherecommand-a-03-2025Cohere · CC-BY-NC-4.0
1354±3
57,068
$2.50/$10
256K
134
123153
glm-4.5vZ.ai · MIT
1353±8
4,989
$0.60/$1.80
65.5K
135
126151
gemini-2.0-flash-lite-preview-02-05Google · Proprietary
1353±4
24,955
$0.07/$0.30
1M
136
128152
gemini-1.5-pro-002Google · Proprietary
1351±3
55,606
$3.50/$10.50
2.1M
137
126153
amazon-nova-experimental-chat-10-20Amazon · Proprietary
1351±6
11,580
N/A
N/A
138
123164
Tencenthunyuan-turbos-20250226Tencent · Proprietary
1349±12
2,220
N/A
N/A
139
127156
Stepfunstep-3StepFun · Apache 2.0
1348±7
6,600
$0.57/$1.42
65.5K
140
131153
o3-miniOpenAI · Proprietary
1348±3
57,950
$1.10/$4.40
200K
141
127160
Minimaxminimax-m2MiniMax · Apache 2.0
1347±8
6,950
$0.26/$1
196.6K
142
127163
qwen3-32bAlibaba · Apache 2.0
1347±9
3,926
$0.08/$0.24
41K
143
124166
Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia · Nvidia Open Model
1347±12
2,549
$0.60/$1.80
131.1K
144
125165
amazon-nova-experimental-chat-10-09Amazon · Proprietary
1347±11
2,872
N/A
N/A
145
129160
ling-flash-2.0Ant Group · MIT
1346±7
7,125
N/A
N/A
146
128163
qwen-plus-0125Alibaba · Proprietary
1346±8
5,819
$0.40/$1.20
131.1K
147
133156
gpt-4o-2024-05-13OpenAI · Proprietary
1345±3
112,881
$5/$15
128K
148
129167
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia · Nvidia Open
1343±10
3,392
$0.10/$0.40
131.1K
149
131166
glm-4-plus-0111Zhipu · Proprietary
1343±8
5,760
N/A
N/A
150
136162
Anthropicclaude-3-5-sonnet-20240620Anthropic · Proprietary
1342±3
82,419
$6/$30
200K
151
131168
gemma-3-12b-itGoogle · Gemma
1342±10
3,829
$0.04/$0.13
131.1K
152
131171
Tencenthunyuan-turbo-0110Tencent · Proprietary
1340±12
2,290
N/A
N/A
153
139168
nova-2-liteAmazon · Proprietary
1338±6
12,343
$0.30/$2.50
1M
154
139170
gpt-5-nano-highOpenAI · Proprietary
1337±7
8,349
$0.05/$0.40
400K
155
141167
o1-miniOpenAI · Proprietary
1337±4
51,981
$1.10/$4.40
N/A
156
141169
qwq-32bAlibaba · Apache 2.0
1336±4
25,749
$0.15/$0.58
131.1K
157
143169
grok-2-2024-08-13xAI · Proprietary
1335±4
63,498
$2/$10
131.1K
158
144170
Metallama-3.1-405b-instruct-bf16Meta · Llama 3.1 Community
1335±4
41,375
$4/$4
32.8K
159
143170
gpt-4o-2024-08-06OpenAI · Proprietary
1335±4
45,499
$2.50/$10
128K
160
141170
gemini-advanced-0514Google · Proprietary
1334±5
50,148
N/A
N/A
161
139177
Stepfunstep-2-16k-exp-202412StepFun · Proprietary
1334±9
4,833
N/A
N/A
162
147171
Metallama-3.1-405b-instruct-fp8Meta · Llama 3.1 Community
1333±4
59,656
$4/$4
32.8K
163
146177
olmo-3.1-32b-instructAi2 · Apache 2.0
1331±6
12,326
$0.20/$0.60
65.5K
164
150182
01.AIyi-lightning01 AI · Proprietary
1328±5
27,332
N/A
N/A
165
152183
qwen3-30b-a3bAlibaba · Apache 2.0
1328±5
26,908
$0.08/$0.28
41K
166
141192
Nvidiallama-3.3-nemotron-49b-super-v1Nvidia · Nvidia
1327±12
2,218
N/A
N/A
167
156183
Metallama-4-maverick-17b-128e-instructMeta · Llama 4
1327±4
40,529
$0.63/$1.80
131.1K
168
135203
molmo-2-8bAi2 · Apache 2.0
1326±21
811
$0.20/$0.20
36.9K
169
148192
Tencenthunyuan-large-2025-02-10Tencent · Proprietary
1326±10
3,738
N/A
N/A
170
162188
gpt-4-turbo-2024-04-09OpenAI · Proprietary
1324±4
98,114
$10/$30
128K
171
153192
deepseek-v2.5-1210DeepSeek · DeepSeek
1323±8
6,795
N/A
N/A
172
162188
Anthropicclaude-3-5-haiku-20241022Anthropic · Proprietary
1323±3
70,555
$0.80/$4
200K
173
162188
gemini-1.5-pro-001Google · Proprietary
1323±4
79,138
$3.50/$10.50
2.1M
174
162189
Metallama-4-scout-17b-16e-instructMeta · Llama
1322±5
30,740
$0.40/$0.70
8.2K
175
160193
gpt-4.1-nano-2025-04-14OpenAI · Proprietary
1322±8
6,103
$0.10/$0.40
1M
176
162192
Stepfunstep-1o-turbo-202506StepFun · Proprietary
1321±7
9,310
N/A
N/A
177
164189
Anthropicclaude-3-opus-20240229Anthropic · Proprietary
1321±3
194,909
$15/$75
200K
178
162193
ring-flash-2.0Ant Group · MIT
1321±7
7,252
N/A
N/A
179
164193
glm-4-plusZhipu AI · Proprietary
1319±5
26,126
$0.44/$1.76
204.8K
180
164194
gemma-3n-e4b-itGoogle · Gemma
1318±5
22,915
$0.02/$0.04
32.8K
181
167192
Metallama-3.3-70b-instructMeta · Llama-3.3
1318±3
55,149
$0.10/$0.32
131.1K
182
164195
gpt-oss-20bOpenAI · Apache 2.0
1318±6
10,750
$0.03/$0.11
131.1K
183
164195
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia · NVIDIA Open Model
1318±6
15,650
$0.06/$0.24
262.1K
184
165195
qwen-max-0919Alibaba · Qwen
1318±6
16,478
$1.60/$6.40
32.8K
185
167193
gpt-4o-mini-2024-07-18OpenAI · Proprietary
1317±3
68,757
$0.15/$0.60
128K
186
167200
qwen2.5-plus-1127Alibaba · Proprietary
1315±6
10,187
N/A
N/A
187
170199
athene-v2-chatNexusFlow · NexusFlow
1314±5
24,739
N/A
N/A
188
172199
mistral-large-2407Mistral · Mistral Research
1314±4
45,459
$2/$6
131.1K
189
172200
gpt-4-0125-previewOpenAI · Proprietary
1313±4
93,439
$10/$30
128K
190
172200
gpt-4-1106-previewOpenAI · Proprietary
1312±4
100,105
$10/$30
128K
191
167204
Tencenthunyuan-standard-2025-02-10Tencent · Proprietary
1311±10
3,904
N/A
N/A
192
181203
gemini-1.5-flash-002Google · Proprietary
1309±4
34,902
$0.07/$0.30
1M
193
185203
grok-2-mini-2024-08-13xAI · Proprietary
1308±4
52,567
$2/$10
131.1K
194
185204
deepseek-v2.5DeepSeek · DeepSeek
1307±5
24,572
N/A
N/A
195
167212
mercuryInception AI · Proprietary
1306±14
1,988
$0.25/$0.75
128K
196
177204
olmo-3-32b-thinkAi2 · Apache 2.0
1306±8
6,023
$0.15/$0.50
65.5K
197
185204
athene-70b-0725NexusFlow · CC-BY-NC-4.0
1306±6
19,621
N/A
N/A
198
187204
mistral-large-2411Mistral · MRL
1305±4
28,073
$2/$6
131.1K
199
185204
magistral-medium-2506Mistral · Proprietary
1304±6
11,825
$2/$5
40K
200
182211
gemma-3-4b-itGoogle · Gemma
1303±9
4,171
$0.04/$0.08
131.1K
201
190204
mistral-small-3.1-24b-instruct-2503Mistral · Apache 2.0
1303±4
33,731
$0.10/$0.30
32K
202
190204
qwen2.5-72b-instructAlibaba · Qwen
1302±4
39,406
$1.20/$1.20
N/A
203
190214
Nvidiallama-3.1-nemotron-70b-instructNvidia · Llama 3.1
1299±8
7,140
$1.20/$1.20
131.1K
204
193216
Tencenthunyuan-large-visionTencent · Proprietary
1294±9
5,451
N/A
N/A
205
201215
Metallama-3.1-70b-instructMeta · Llama 3.1 Community
1293±4
55,240
$0.40/$0.40
131.1K
206
201216
amazon-nova-pro-v1.0Amazon · Proprietary
1290±5
24,745
$0.80/$3.20
300K
207
201219
jamba-1.5-largeAI21 Labs · Jamba Open
1288±7
8,662
$2/$8
256K
208
203216
gemma-2-27b-itGoogle · Gemma license
1288±3
75,754
$0.65/$0.65
8.2K
209
201219
reka-core-20240904Reka AI · Proprietary
1287±7
7,312
N/A
N/A
210
201221
ibm-granite-h-smallIBM · Apache 2.0
1287±8
5,783
N/A
N/A
211
203219
gpt-4-0314OpenAI · Proprietary
1286±5
54,173
$30/$60
8.2K
212
201225
llama-3.1-tulu-3-70bAi2 · Llama 3.1
1286±10
2,846
N/A
N/A
213
202222
olmo-3.1-32b-thinkAi2 · Apache 2.0
1286±7
8,575
$0.15/$0.50
65.5K
214
201225
Nvidiallama-3.1-nemotron-51b-instructNvidia · Llama 3.1
1286±10
3,749
N/A
N/A
215
204219
gemini-1.5-flash-001Google · Proprietary
1285±4
62,833
$0.07/$0.30
1M
216
208225
Anthropicclaude-3-sonnet-20240229Anthropic · Proprietary
1280±4
109,284
$3/$15
200K
217
205225
gemma-2-9b-it-simpoPrinceton · MIT
1279±7
10,072
$0.03/$0.09
8.2K
218
208226
Nvidianemotron-4-340b-instructNvidia · NVIDIA Open Model
1277±5
19,659
N/A
N/A
219
208227
Coherecommand-r-plus-08-2024Cohere · CC-BY-NC-4.0
1276±7
9,866
$2.50/$10
128K
220
213225
Metallama-3-70b-instructMeta · Llama 3 Community
1275±4
156,876
$0.51/$0.74
8.2K
221
214226
gpt-4-0613OpenAI · Proprietary
1274±4
88,723
$30/$60
8.2K
222
212228
mistral-small-24b-instruct-2501Mistral · Apache 2.0
1274±6
14,681
$0.05/$0.08
32.8K
223
212229
glm-4-0520Zhipu AI · Proprietary
1273±7
9,788
N/A
N/A
224
214231
reka-flash-20240904Reka AI · Proprietary
1271±7
7,536
N/A
N/A
225
214234
qwen2.5-coder-32b-instructAlibaba · Apache 2.0
1270±8
5,432
$0.87/$0.87
32K
226
219234
Coherec4ai-aya-expanse-32bCohere · CC-BY-NC-4.0
1267±5
27,124
N/A
N/A
227
222234
gemma-2-9b-itGoogle · Gemma license
1265±4
54,611
$0.03/$0.09
8.2K
228
221235
deepseek-coder-v2DeepSeek · DeepSeek License
1264±6
15,147
$0.14/$0.28
128K
229
224235
Coherecommand-r-plusCohere · CC-BY-NC-4.0
1261±4
77,554
$2.50/$10
128K
230
223235
qwen2-72b-instructAlibaba · Qianwen LICENSE
1261±5
37,325
$0.90/$0.90
32.8K
231
225235
Anthropicclaude-3-haiku-20240307Anthropic · Proprietary
1260±4
117,701
$0.25/$1.25
200K
232
224236
amazon-nova-lite-v1.0Amazon · Proprietary
1260±5
19,372
$0.06/$0.24
300K
233
225236
gemini-1.5-flash-8b-001Google · Proprietary
1258±4
35,558
$0.07/$0.30
1M
234
228236
Azurephi-4Microsoft · MIT
1256±5
24,126
$0.07/$0.14
16.4K
235
225242
olmo-2-0325-32b-instructAi2 · Apache-2.0
1252±11
3,334
$0.05/$0.20
128K
236
232241
Coherecommand-r-08-2024Cohere · CC-BY-NC-4.0
1249±7
10,140
$0.15/$0.60
128K
237
235245
mistral-large-2402Mistral · Proprietary
1242±5
62,436
$4/$12
32K
238
235245
amazon-nova-micro-v1.0Amazon · Proprietary
1240±5
19,364
$0.04/$0.14
128K
239
235248
jamba-1.5-miniAI21 Labs · Jamba Open
1239±7
8,858
$0.20/$0.40
256K
240
235253
ministral-8b-2410Mistral · MRL
1237±9
4,781
$0.10/$0.10
131.1K
241
236253
gemini-pro-dev-apiGoogle · Proprietary
1234±7
18,354
$0.35/$1.05
32.8K
242
237252
qwen1.5-110b-chatAlibaba · Qianwen LICENSE
1233±6
26,195
N/A
N/A
243
235255
Tencenthunyuan-standard-256kTencent · Proprietary
1233±12
2,728
N/A
N/A
244
237254
reka-flash-21b-20240226-onlineReka AI · Proprietary
1233±7
15,450
N/A
N/A
245
237253
qwen1.5-72b-chatAlibaba · Qianwen LICENSE
1232±5
39,302
N/A
N/A
246
239254
mixtral-8x22b-instruct-v0.1Mistral · Apache 2.0
1229±5
51,416
$0.90/$0.90
65.5K
247
240255
Coherecommand-rCohere · CC-BY-NC-4.0
1226±5
54,036
$0.15/$0.60
128K
248
239255
reka-flash-21b-20240226Reka AI · Proprietary
1226±6
24,806
N/A
N/A
249
240256
gpt-3.5-turbo-0125OpenAI · Proprietary
1223±5
66,207
$0.50/$1.50
16.4K
250
244255
Metallama-3-8b-instructMeta · Llama 3 Community
1223±4
104,642
$0.03/$0.04
8.2K
251
240257
Coherec4ai-aya-expanse-8bCohere · CC-BY-NC-4.0
1222±7
9,818
N/A
N/A
252
241257
mistral-mediumMistral · Proprietary
1222±6
34,550
$2.70/$8.10
32K
253
239259
gemini-proGoogle · Proprietary
1221±12
6,390
$0.35/$1.05
32.8K
254
240259
llama-3.1-tulu-3-8bAi2 · Llama 3.1
1221±11
2,896
N/A
N/A
255
251260
01.AIyi-1.5-34b-chat01 AI · Apache-2.0
1213±5
24,146
N/A
N/A
256
246262
HuggingFacezephyr-orpo-141b-A35b-v0.1HuggingFace · Apache 2.0
1212±11
4,652
N/A
N/A
257
253260
Metallama-3.1-8b-instructMeta · Llama 3.1 Community
1211±4
49,605
$0.02/$0.05
16.4K
258
249266
granite-3.1-8b-instructIBM · Apache 2.0
1208±11
3,090
N/A
N/A
259
255266
qwen1.5-32b-chatAlibaba · Qianwen LICENSE
1203±6
21,741
N/A
N/A
260
253268
gpt-3.5-turbo-1106OpenAI · Proprietary
1202±9
16,619
$1/$2
16.4K
261
257267
gemma-2-2b-itGoogle · Gemma license
1199±4
46,616
N/A
N/A
262
257268
Azurephi-3-medium-4k-instructMicrosoft · MIT
1197±5
25,055
$0.17/$0.68
N/A
263
258268
mixtral-8x7b-instruct-v0.1Mistral · Apache 2.0
1196±4
73,503
$0.63/$0.63
32K
264
258273
dbrx-instruct-previewDatabricks · DBRX LICENSE
1194±6
32,191
$0.60/$0.60
32.8K
265
258277
InternLMinternlm2_5-20b-chatInternLM · Other
1191±7
9,901
$0/$0
32.8K
266
258277
qwen1.5-14b-chatAlibaba · Qianwen LICENSE
1190±7
17,839
$0.30/$0.30
N/A
267
261283
Azurewizardlm-70bMicrosoft · Llama 2 Community
1184±9
8,214
N/A
N/A
268
260284
deepseek-llm-67b-chatDeepSeek · DeepSeek License
1184±12
4,932
N/A
N/A
269
264279
01.AIyi-34b-chat01 AI · Yi License
1183±7
15,483
$0.90/$0.90
4.1K
270
264284
OpenChatopenchat-3.5-0106OpenChat · Apache-2.0
1181±8
12,637
N/A
N/A
271
264284
OpenChatopenchat-3.5OpenChat · Apache-2.0
1181±10
7,968
$0.20/$0.20
N/A
272
264284
granite-3.0-8b-instructIBM · Apache 2.0
1181±9
6,638
N/A
N/A
273
265283
gemma-1.1-7b-itGoogle · Gemma license
1180±6
23,893
$0.03/$0.09
8.2K
274
265284
Snowflakesnowflake-arctic-instructSnowflake · Apache 2.0
1179±6
32,832
N/A
N/A
275
264286
granite-3.1-2b-instructIBM · Apache 2.0
1178±11
3,188
N/A
N/A
276
265285
tulu-2-dpo-70bAllenAI/UW · AI2 ImpACT Low-risk
1177±10
6,535
N/A
N/A
277
265288
openhermes-2.5-mistral-7bNousResearch · Apache-2.0
1174±10
5,006
$0.17/$0.17
N/A
278
267287
vicuna-33bLMSYS · Non-commercial
1172±6
22,479
$0/$0
2K
279
267289
starling-lm-7b-betaNexusflow · Apache-2.0
1171±7
16,056
N/A
N/A
280
268288
Azurephi-3-small-8k-instructMicrosoft · MIT
1170±6
17,766
$0.15/$0.60
N/A
281
268288
Metallama-2-70b-chatMeta · Llama 2 Community
1170±6
38,492
$0.70/$2.80
4.1K
282
268291
starling-lm-7b-alphaUC Berkeley · CC-BY-NC-4.0
1167±8
10,224
N/A
N/A
283
270291
Metallama-3.2-3b-instructMeta · Llama 3.2
1166±8
7,936
$0.05/$0.34
80K
284
268294
nous-hermes-2-mixtral-8x7b-dpoNousResearch · Apache-2.0
1164±12
3,777
$0.90/$0.90
N/A
285
275300
qwq-32b-previewAlibaba · Apache 2.0
1156±12
3,231
$0.15/$0.58
131.1K
286
281297
granite-3.0-2b-instructIBM · Apache 2.0
1155±8
6,837
N/A
N/A
287
276301
Nvidiallama2-70b-steerlm-chatNvidia · Llama 2 Community
1155±13
3,585
N/A
N/A
288
278304
solar-10.7b-instruct-v1.0Upstage AI · CC-BY-NC-4.0
1152±13
4,155
$0.30/$0.30
N/A
289
277306
dolphin-2.2.1-mistral-7bCognitive Computations · Apache-2.0
1151±15
1,679
$0.50/$0.50
16.4K
290
282304
mpt-30b-chatMosaicML · CC-BY-NC-SA-4.0
1149±12
2,572
N/A
N/A
291
284301
mistral-7b-instruct-v0.2Mistral · Apache-2.0
1149±7
19,402
$0.20/$0.20
32.8K
292
284302
Azurewizardlm-13bMicrosoft · Llama 2 Community
1148±9
7,044
$0.30/$0.30
N/A
293
282308
falcon-180b-chatTII · Falcon-180B TII License
1146±17
1,295
N/A
N/A
294
284307
qwen1.5-7b-chatAlibaba · Qianwen LICENSE
1143±10
4,737
$0.20/$0.20
N/A
295
285305
Azurephi-3-mini-4k-instruct-june-2024Microsoft · MIT
1142±6
12,297
$0.13/$0.52
4.1K
296
285307
Metallama-2-13b-chatMeta · Llama 2 Community
1141±7
19,174
$0.25/$0.25
4.1K
297
286307
vicuna-13bLMSYS · Llama 2 Community
1140±7
19,367
$0.30/$0.30
N/A
298
285309
qwen-14b-chatAlibaba · Qianwen LICENSE
1138±11
4,964
N/A
N/A
299
286309
palm-2Google · Proprietary
1136±9
8,554
$0.50/$0.50
25.8K
300
287309
Metacodellama-34b-instructMeta · Llama 2 Community
1136±9
7,366
$0.35/$1.40
16.4K
301
286309
gemma-7b-itGoogle · Gemma license
1136±10
8,925
$0.05/$0.08
8.2K
302
290311
HuggingFacezephyr-7b-betaHuggingFace · MIT
1130±9
11,118
$0.15/$0.15
16.4K
303
293311
Azurephi-3-mini-128k-instructMicrosoft · MIT
1128±7
20,685
$0.13/$0.52
N/A
304
294311
Azurephi-3-mini-4k-instructMicrosoft · MIT
1128±6
20,118
$0.13/$0.52
N/A
305
290314
guanaco-33bUW · Non-commercial
1126±12
2,921
N/A
N/A
306
289314
HuggingFacezephyr-7b-alphaHuggingFace · MIT
1126±16
1,785
N/A
N/A
307
297314
stripedhyena-nous-7bTogether AI · Apache 2.0
1120±11
5,182
$0.20/$0.20
N/A
308
292315
Metacodellama-70b-instructMeta · Llama 2 Community
1118±18
1,143
$0.70/$2.80
16.4K
309
302314
vicuna-7bLMSYS · Llama 2 Community
1114±9
6,923
$0.20/$0.20
N/A
310
302314
gemma-1.1-2b-itGoogle · Gemma license
1114±8
10,854
N/A
N/A
311
298315
HuggingFacesmollm2-1.7b-instructHuggingFace · Apache 2.0
1114±14
2,199
N/A
N/A
312
305314
Metallama-3.2-1b-instructMeta · Llama 3.2
1111±8
8,045
$0.03/$0.20
60K
313
305315
mistral-7b-instructMistral · Apache 2.0
1109±9
8,977
$0.07/$0.28
4.1K
314
305315
Metallama-2-7b-chatMeta · Llama 2 Community
1107±7
14,148
$0.15/$0.15
4.1K
315
311318
gemma-2b-itGoogle · Gemma license
1091±12
4,780
$0.10/$0.10
N/A
316
315318
qwen1.5-4b-chatAlibaba · Qianwen LICENSE
1089±9
7,597
$0.10/$0.10
N/A
317
315322
olmo-7b-instructAi2 · Apache-2.0
1074±11
6,328
$0.20/$0.20
N/A
318
317322
koala-13bUC Berkeley · Non-commercial
1070±10
6,965
N/A
N/A
319
317322
alpaca-13bStanford · Non-commercial
1067±12
5,745
N/A
N/A
320
315323
gpt4all-13b-snoozyNomic AI · Non-commercial
1065±15
1,743
N/A
N/A
321
317323
mpt-7b-chatMosaicML · CC-BY-NC-SA-4.0
1061±12
3,924
N/A
N/A
322
317323
chatglm3-6bTsinghua · Apache-2.0
1055±12
4,658
N/A
N/A
323
320325
RWKVRWKV-4-Raven-14BRWKV · Apache 2.0
1040±11
4,845
N/A
N/A
324
323325
chatglm2-6bTsinghua · Apache-2.0
1023±14
2,658
N/A
N/A
325
323325
oasst-pythia-12bOpenAssistant · Apache 2.0
1021±11
6,310
N/A
N/A
326
326329
chatglm-6bTsinghua · Non-commercial
995±13
4,914
N/A
N/A
327
326329
fastchat-t5-3bLMSYS · Apache 2.0
990±12
4,203
N/A
N/A
328
326329
dolly-v2-12bDatabricks · MIT
979±14
3,412
N/A
N/A
329
326330
Metallama-13bMeta · Non-commercial
971±16
2,391
$0.23/$0.23
N/A
330
329330
Stabilitystablelm-tuned-alpha-7bStability AI · CC-BY-NC-SA-4.0
952±13
3,287
N/A
N/A
说明
排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。
模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。
分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。
95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:
±6)。这个区间越小,表示模型的评分越稳定和可靠。票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。
组织/公司:提供该模型的组织或公司。
许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。
数据来源与更新频率
本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。
免责声明
本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。
最后更新于
这有帮助吗?