模型榜單
呢個係基於 Chatbot Arena (lmarena.ai) 數據嘅排行榜,透過自動化流程生成。
數據更新時間: 2026-02-08 08:10:17 UTC / 2026-02-08 16:10:17 CST (北京時間)
排行榜
1
1◄─►2
Anthropicclaude-opus-4-6
1496
±11
2,829
Anthropic
Proprietary
2
1◄─►2
gemini-3-pro
1486
±4
34,419
Proprietary
3
3◄─►6
grok-4.1-thinking
1475
±4
34,455
xAI
Proprietary
4
3◄─►8
gemini-3-flash
1470
±5
25,085
Proprietary
5
3◄─►8
Anthropicclaude-opus-4-5-20251101-thinking-32k
1468
±5
26,178
Anthropic
Proprietary
6
3◄─►8
Anthropicclaude-opus-4-5-20251101
1467
±5
31,069
Anthropic
Proprietary
7
4◄─►9
grok-4.1
1465
±4
38,605
xAI
Proprietary
8
4◄─►10
gemini-3-flash (thinking-minimal)
1463
±6
16,255
Proprietary
9
7◄─►14
gpt-5.1-high
1458
±5
30,500
OpenAI
Proprietary
10
8◄─►19
ernie-5.0-0110
1452Preliminary
±7
10,184
Baidu
Proprietary
11
9◄─►19
Anthropicclaude-sonnet-4-5-20250929
1450
±4
42,437
Anthropic
Proprietary
12
9◄─►19
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
1450
±4
44,799
Anthropic
Proprietary
13
10◄─►19
gemini-2.5-pro
1450
±3
93,835
Proprietary
14
9◄─►23
ernie-5.0-preview-1203
1449
±7
9,775
Baidu
Proprietary
15
9◄─►24
MoonshotAIkimi-k2.5-thinking
1449
±7
7,085
Moonshot
Modified MIT
16
10◄─►19
Anthropicclaude-opus-4-1-20250805-thinking-16k
1449
±4
49,956
Anthropic
Proprietary
17
10◄─►23
Anthropicclaude-opus-4-1-20250805
1445
±3
73,888
Anthropic
Proprietary
18
10◄─►26
gpt-4.5-preview-2025-02-27
1444
±6
14,549
OpenAI
Proprietary
19
15◄─►24
chatgpt-4o-latest-20250326
1442
±3
81,283
OpenAI
Proprietary
20
10◄─►28
glm-4.7
1441
±6
12,021
Z.ai
MIT
21
15◄─►29
gpt-5.2-high
1438
±6
15,062
OpenAI
Proprietary
22
17◄─►28
gpt-5.1
1437
±4
32,684
OpenAI
Proprietary
23
15◄─►30
gpt-5.2
1437
±6
11,695
OpenAI
Proprietary
24
19◄─►31
gpt-5-high
1434
±5
32,626
OpenAI
Proprietary
25
19◄─►34
qwen3-max-preview
1434
±5
27,843
Alibaba
Proprietary
26
15◄─►46
MoonshotAIkimi-k2.5-instant
1433
±11
2,752
Moonshot
Modified MIT
27
20◄─►34
o3-2025-04-16
1433
±4
61,361
OpenAI
Proprietary
28
20◄─►37
grok-4-1-fast-reasoning
1430
±5
27,088
xAI
Proprietary
29
22◄─►44
MoonshotAIkimi-k2-thinking-turbo
1428
±4
32,101
Moonshot
Modified MIT
30
24◄─►46
gpt-5-chat
1426
±4
31,831
OpenAI
Proprietary
31
27◄─►48
glm-4.6
1425
±4
35,339
Z.ai
MIT
32
23◄─►49
qwen3-max-2025-09-23
1425
±6
9,221
Alibaba
Proprietary
33
27◄─►48
Anthropicclaude-opus-4-20250514-thinking-16k
1424
±4
37,974
Anthropic
Proprietary
34
25◄─►51
deepseek-v3.2-exp
1423
±6
11,767
DeepSeek
MIT
35
25◄─►51
deepseek-v3.2-exp-thinking
1423
±7
9,002
DeepSeek
MIT
36
28◄─►48
qwen3-235b-a22b-instruct-2507
1422
±3
68,201
Alibaba
Apache 2.0
37
25◄─►54
grok-4-fast-chat
1422
±8
6,989
xAI
Proprietary
38
28◄─►51
deepseek-v3.2-thinking
1420
±5
21,792
DeepSeek
MIT
39
28◄─►53
deepseek-v3.2
1419
±5
26,704
DeepSeek
MIT
40
28◄─►56
deepseek-r1-0528
1418
±6
19,290
DeepSeek
MIT
41
27◄─►56
ernie-5.0-preview-1022
1418
±9
4,619
Baidu
Proprietary
42
29◄─►56
deepseek-v3.1
1418
±6
15,299
DeepSeek
MIT
43
28◄─►56
MoonshotAIkimi-k2-0905-preview
1418
±6
11,974
Moonshot
Modified MIT
44
29◄─►56
deepseek-v3.1-thinking
1417
±7
11,983
DeepSeek
MIT
45
31◄─►56
MoonshotAIkimi-k2-0711-preview
1417
±5
28,662
Moonshot
Modified MIT
46
28◄─►59
deepseek-v3.1-terminus
1416
±10
3,761
DeepSeek
MIT
47
28◄─►62
deepseek-v3.1-terminus-thinking
1416
±10
3,549
DeepSeek
MIT
48
31◄─►58
qwen3-vl-235b-a22b-instruct
1415
±6
11,683
Alibaba
Apache 2.0
49
34◄─►56
mistral-large-3
1414
±5
23,001
Mistral
Apache 2.0
50
35◄─►56
Anthropicclaude-opus-4-20250514
1414
±4
45,579
Anthropic
Proprietary
51
35◄─►56
gpt-4.1-2025-04-14
1413
±4
52,220
OpenAI
Proprietary
52
38◄─►58
mistral-medium-2508
1411
±3
62,020
Mistral
Proprietary
53
38◄─►59
grok-3-preview-02-24
1411
±4
33,974
xAI
Proprietary
54
40◄─►59
gemini-2.5-flash
1410
±3
93,104
Proprietary
55
39◄─►64
glm-4.5
1410
±5
24,794
Z.ai
MIT
56
40◄─►62
grok-4-0709
1410
±4
42,162
xAI
Proprietary
57
49◄─►69
gemini-2.5-flash-preview-09-2025
1405
±4
32,880
Proprietary
58
51◄─►69
Anthropicclaude-haiku-4-5-20251001
1404
±4
43,455
Anthropic
Proprietary
59
49◄─►70
grok-4-fast-reasoning
1404
±5
18,640
xAI
Proprietary
60
54◄─►71
o1-2024-12-17
1402
±4
27,822
OpenAI
Proprietary
61
54◄─►71
qwen3-235b-a22b-no-thinking
1401
±4
39,549
Alibaba
Apache 2.0
62
54◄─►72
qwen3-next-80b-a3b-instruct
1401
±5
22,856
Alibaba
Apache 2.0
63
57◄─►72
Anthropicclaude-sonnet-4-20250514-thinking-32k
1400
±4
36,209
Anthropic
Proprietary
64
56◄─►78
longcat-flash-chat
1399
±6
11,546
Meituan
MIT
65
57◄─►78
qwen3-235b-a22b-thinking-2507
1398
±6
9,252
Alibaba
Apache 2.0
66
57◄─►78
deepseek-r1
1398
±5
18,537
DeepSeek
MIT
67
57◄─►84
qwen3-vl-235b-a22b-thinking
1395
±7
7,970
Alibaba
Apache 2.0
68
59◄─►80
mimo-v2-flash (non-thinking)
1395
±5
15,575
Xiaomi
MIT
69
57◄─►85
amazon-nova-experimental-chat-12-10
1394
±10
3,717
Amazon
Proprietary
70
60◄─►80
deepseek-v3-0324
1394
±4
46,691
DeepSeek
MIT
71
56◄─►86
Tencenthunyuan-vision-1.5-thinking
1393
±12
2,225
Tencent
Proprietary
72
62◄─►85
mai-1-preview
1391
±5
18,137
Microsoft AI
Proprietary
73
64◄─►84
o4-mini-2025-04-16
1391
±4
46,676
OpenAI
Proprietary
74
64◄─►85
gpt-5-mini-high
1390
±5
27,148
OpenAI
Proprietary
75
64◄─►85
Anthropicclaude-sonnet-4-20250514
1390
±4
41,644
Anthropic
Proprietary
76
64◄─►85
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
1389
±4
39,891
Anthropic
Proprietary
77
64◄─►85
o1-preview
1388
±5
31,120
OpenAI
Proprietary
78
64◄─►88
Tencenthunyuan-t1-20250711
1387
±9
4,802
Tencent
Proprietary
79
67◄─►86
qwen3-coder-480b-a35b-instruct
1387
±5
26,653
Alibaba
Apache 2.0
80
67◄─►87
Minimaxminimax-m2.1-preview
1385
±6
15,063
MiniMax
MIT
81
69◄─►86
mistral-medium-2505
1385
±5
34,551
Mistral
Proprietary
82
69◄─►88
qwen3-30b-a3b-instruct-2507
1383
±5
24,095
Alibaba
Apache 2.0
83
69◄─►89
Tencenthunyuan-turbos-20250416
1383
±6
11,053
Tencent
Proprietary
84
71◄─►89
gpt-4.1-mini-2025-04-14
1382
±4
40,530
OpenAI
Proprietary
85
77◄─►89
gemini-2.5-flash-lite-preview-09-2025-no-thinking
1380
±4
46,388
Proprietary
86
69◄─►99
glm-4.6v
1378
±11
2,819
Z.ai
MIT
87
81◄─►95
gemini-2.5-flash-lite-preview-06-17-thinking
1375
±4
33,936
Proprietary
88
80◄─►95
qwen3-235b-a22b
1375
±5
27,166
Alibaba
Apache 2.0
89
83◄─►95
qwen2.5-max
1374
±4
33,294
Alibaba
Proprietary
90
86◄─►95
Anthropicclaude-3-5-sonnet-20241022
1373
±3
89,471
Anthropic
Proprietary
91
86◄─►97
Anthropicclaude-3-7-sonnet-20250219
1372
±4
44,437
Anthropic
Proprietary
92
86◄─►99
glm-4.5-air
1371
±4
31,386
Z.ai
MIT
93
86◄─►102
qwen3-next-80b-a3b-thinking
1369
±6
13,857
Alibaba
Apache 2.0
94
86◄─►102
Minimaxminimax-m1
1367
±4
36,841
MiniMax
Apache 2.0
95
86◄─►104
amazon-nova-experimental-chat-11-10
1367
±6
15,442
Amazon
Proprietary
96
90◄─►103
gemma-3-27b-it
1365
±4
48,696
Gemma
97
90◄─►107
o3-mini-high
1364
±5
18,584
OpenAI
Proprietary
98
91◄─►110
grok-3-mini-high
1363
±5
17,525
xAI
Proprietary
99
93◄─►110
gemini-2.0-flash-001
1361
±4
44,827
Proprietary
100
91◄─►118
glm-4.7-flash
1360
±8
6,255
Z.ai
MIT
101
93◄─►116
deepseek-v3
1359
±5
21,788
DeepSeek
DeepSeek
102
95◄─►120
grok-3-mini-beta
1357
±5
23,767
xAI
Proprietary
103
97◄─►122
mistral-small-2506
1356
±5
18,341
Mistral
Apache 2.0
104
93◄─►124
intellect-3
1356
±8
5,326
Prime Intellect
MIT
105
98◄─►123
gpt-oss-120b
1354
±4
30,971
OpenAI
Apache 2.0
106
98◄─►124
gemini-2.0-flash-lite-preview-02-05
1353
±4
24,951
Proprietary
107
96◄─►125
glm-4.5v
1353
±8
4,974
Z.ai
MIT
108
100◄─►124
Coherecommand-a-03-2025
1353
±3
57,445
Cohere
CC-BY-NC-4.0
109
100◄─►124
gemini-1.5-pro-002
1352
±3
55,607
Proprietary
110
97◄─►136
Tencenthunyuan-turbos-20250226
1349
±12
2,226
Tencent
Proprietary
111
100◄─►128
amazon-nova-experimental-chat-10-20
1349
±6
11,442
Amazon
Proprietary
112
102◄─►125
o3-mini
1348
±3
58,642
OpenAI
Proprietary
113
98◄─►136
amazon-nova-experimental-chat-10-09
1347
±11
2,889
Amazon
Proprietary
114
97◄─►137
Nvidiallama-3.1-nemotron-ultra-253b-v1
1347
±12
2,546
Nvidia
Nvidia Open Model
115
100◄─►136
qwen3-32b
1347
±9
3,932
Alibaba
Apache 2.0
116
100◄─►134
Minimaxminimax-m2
1347
±8
6,746
MiniMax
Apache 2.0
117
101◄─►132
ling-flash-2.0
1347
±7
7,040
Ant Group
MIT
118
101◄─►134
Stepfunstep-3
1346
±7
6,592
StepFun
Apache 2.0
119
100◄─►135
qwen-plus-0125
1346
±8
5,823
Alibaba
Proprietary
120
105◄─►127
gpt-4o-2024-05-13
1346
±3
112,863
OpenAI
Proprietary
121
103◄─►137
glm-4-plus-0111
1343
±8
5,760
Zhipu
Proprietary
122
109◄─►131
Anthropicclaude-3-5-sonnet-20240620
1343
±3
82,417
Anthropic
Proprietary
123
103◄─►141
gemma-3-12b-it
1342
±10
3,829
Gemma
124
103◄─►142
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
1341
±10
3,432
Nvidia
Nvidia Open
125
102◄─►143
Tencenthunyuan-turbo-0110
1341
±12
2,295
Tencent
Proprietary
126
111◄─►142
gpt-5-nano-high
1338
±7
8,399
OpenAI
Proprietary
127
113◄─►138
o1-mini
1337
±4
51,986
OpenAI
Proprietary
128
111◄─►142
nova-2-lite
1337
±6
12,264
Amazon
Proprietary
129
113◄─►142
qwq-32b
1336
±4
26,118
Alibaba
Apache 2.0
130
114◄─►140
Metallama-3.1-405b-instruct-bf16
1336
±4
41,392
Meta
Llama 3.1 Community
131
114◄─►142
gpt-4o-2024-08-06
1335
±4
45,498
OpenAI
Proprietary
132
117◄─►142
grok-2-2024-08-13
1335
±4
63,495
xAI
Proprietary
133
113◄─►142
gemini-advanced-0514
1335
±5
50,142
Proprietary
134
112◄─►150
Stepfunstep-2-16k-exp-202412
1334
±9
4,829
StepFun
Proprietary
135
118◄─►142
Metallama-3.1-405b-instruct-fp8
1334
±4
59,655
Meta
Llama 3.1 Community
136
123◄─►155
01.AIyi-lightning
1329
±5
27,340
01 AI
Proprietary
137
124◄─►155
qwen3-30b-a3b
1328
±5
27,435
Alibaba
Apache 2.0
138
124◄─►155
Metallama-4-maverick-17b-128e-instruct
1328
±4
41,137
Meta
Llama 4
139
115◄─►164
Nvidiallama-3.3-nemotron-49b-super-v1
1327
±12
2,230
Nvidia
Nvidia
140
121◄─►164
Tencenthunyuan-large-2025-02-10
1327
±10
3,738
Tencent
Proprietary
141
124◄─►160
olmo-3.1-32b-instruct
1327
±7
10,567
Allen AI
Apache 2.0
142
135◄─►160
gpt-4-turbo-2024-04-09
1325
±4
98,130
OpenAI
Proprietary
143
135◄─►160
Anthropicclaude-3-5-haiku-20241022
1324
±3
71,181
Anthropic
Proprietary
144
126◄─►164
deepseek-v2.5-1210
1323
±8
6,793
DeepSeek
DeepSeek
145
135◄─►160
gemini-1.5-pro-001
1323
±4
79,132
Proprietary
146
135◄─►162
Metallama-4-scout-17b-16e-instruct
1323
±5
31,236
Meta
Llama
147
135◄─►160
Anthropicclaude-3-opus-20240229
1323
±3
194,904
Anthropic
Proprietary
148
134◄─►165
gpt-4.1-nano-2025-04-14
1322
±8
6,107
OpenAI
Proprietary
149
135◄─►164
Stepfunstep-1o-turbo-202506
1321
±7
9,689
StepFun
Proprietary
150
135◄─►166
ring-flash-2.0
1320
±7
7,205
Ant Group
MIT
151
139◄─►164
Metallama-3.3-70b-instruct
1320
±3
55,575
Meta
Llama-3.3
152
136◄─►164
glm-4-plus
1319
±5
26,134
Zhipu AI
Proprietary
153
136◄─►165
gemma-3n-e4b-it
1319
±5
23,342
Gemma
154
136◄─►166
qwen-max-0919
1318
±6
16,479
Alibaba
Qwen
155
139◄─►164
gpt-4o-mini-2024-07-18
1318
±3
68,801
OpenAI
Proprietary
156
136◄─►171
gpt-oss-20b
1318
±6
10,810
OpenAI
Apache 2.0
157
139◄─►171
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16
1317
±6
15,505
Nvidia
NVIDIA Open Model
158
139◄─►173
qwen2.5-plus-1127
1315
±6
10,179
Alibaba
Proprietary
159
144◄─►171
athene-v2-chat
1315
±4
24,746
NexusFlow
NexusFlow
160
144◄─►171
mistral-large-2407
1314
±4
45,460
Mistral
Mistral Research
161
145◄─►172
gpt-4-0125-preview
1314
±4
93,439
OpenAI
Proprietary
162
145◄─►171
gpt-4-1106-preview
1314
±4
100,107
OpenAI
Proprietary
163
139◄─►176
Tencenthunyuan-standard-2025-02-10
1312
±10
3,905
Tencent
Proprietary
164
136◄─►180
mercury
1310
±14
1,920
Inception AI
Proprietary
165
152◄─►175
gemini-1.5-flash-002
1310
±4
34,909
Proprietary
166
156◄─►176
grok-2-mini-2024-08-13
1308
±4
52,574
xAI
Proprietary
167
156◄─►176
deepseek-v2.5
1307
±5
24,574
DeepSeek
DeepSeek
168
156◄─►176
athene-70b-0725
1306
±6
19,622
NexusFlow
CC-BY-NC-4.0
169
154◄─►178
olmo-3-32b-think
1305
±8
5,891
Allen AI
Apache 2.0
170
161◄─►176
mistral-large-2411
1305
±4
28,081
Mistral
MRL
171
156◄─►176
magistral-medium-2506
1305
±6
12,064
Mistral
Proprietary
172
162◄─►176
mistral-small-3.1-24b-instruct-2503
1305
±4
34,131
Mistral
Apache 2.0
173
156◄─►183
gemma-3-4b-it
1303
±9
4,177
Gemma
174
163◄─►176
qwen2.5-72b-instruct
1303
±4
39,409
Alibaba
Qwen
175
163◄─►186
Nvidiallama-3.1-nemotron-70b-instruct
1299
±8
7,136
Nvidia
Llama 3.1
176
164◄─►187
Tencenthunyuan-large-vision
1296
±9
5,600
Tencent
Proprietary
177
172◄─►187
Metallama-3.1-70b-instruct
1294
±4
55,234
Meta
Llama 3.1 Community
178
174◄─►188
amazon-nova-pro-v1.0
1290
±5
24,753
Amazon
Proprietary
179
174◄─►191
jamba-1.5-large
1289
±7
8,659
AI21 Labs
Jamba Open
180
173◄─►193
ibm-granite-h-small
1288
±8
5,682
IBM
Apache 2.0
181
175◄─►189
gemma-2-27b-it
1288
±3
75,764
Gemma 許可證
182
174◄─►191
reka-core-20240904
1288
±7
7,309
Reka AI
Proprietary
183
175◄─►191
gpt-4-0314
1288
±5
54,167
OpenAI
Proprietary
184
172◄─►197
llama-3.1-tulu-3-70b
1287
±10
2,846
Ai2
Llama 3.1
185
173◄─►197
Nvidiallama-3.1-nemotron-51b-instruct
1287
±10
3,749
Nvidia
Llama 3.1
186
176◄─►191
gemini-1.5-flash-001
1286
±4
62,823
Proprietary
187
175◄─►197
olmo-3.1-32b-think
1286
±7
8,486
Allen AI
Apache 2.0
188
179◄─►197
Anthropicclaude-3-sonnet-20240229
1282
±4
109,289
Anthropic
Proprietary
189
178◄─►197
gemma-2-9b-it-simpo
1280
±7
10,069
Princeton
MIT
190
180◄─►197
Nvidianemotron-4-340b-instruct
1278
±5
19,661
Nvidia
NVIDIA Open Model
191
180◄─►199
Coherecommand-r-plus-08-2024
1277
±7
9,869
Cohere
CC-BY-NC-4.0
192
184◄─►197
Metallama-3-70b-instruct
1276
±4
156,880
Meta
Llama 3 社群
193
185◄─►198
gpt-4-0613
1276
±4
88,721
OpenAI
Proprietary
194
185◄─►200
mistral-small-24b-instruct-2501
1274
±6
14,677
Mistral
Apache 2.0
195
184◄─►202
glm-4-0520
1274
±7
9,788
Zhipu AI
Proprietary
196
185◄─►203
reka-flash-20240904
1273
±7
7,537
Reka AI
Proprietary
197
185◄─►206
qwen2.5-coder-32b-instruct
1271
±8
5,430
Alibaba
Apache 2.0
198
192◄─►206
Coherec4ai-aya-expanse-32b
1267
±5
27,123
Cohere
CC-BY-NC-4.0
199
194◄─►206
gemma-2-9b-it
1266
±4
54,615
Gemma 許可證
200
193◄─►207
deepseek-coder-v2
1265
±6
15,147
DeepSeek
DeepSeek 許可證
201
195◄─►207
Coherecommand-r-plus
1263
±4
77,556
Cohere
CC-BY-NC-4.0
202
195◄─►207
qwen2-72b-instruct
1262
±5
37,325
Alibaba
Qianwen 許可證
203
197◄─►207
Anthropicclaude-3-haiku-20240307
1262
±4
117,705
Anthropic
Proprietary
204
196◄─►208
amazon-nova-lite-v1.0
1261
±5
19,376
Amazon
Proprietary
205
197◄─►208
gemini-1.5-flash-8b-001
1259
±4
35,556
Proprietary
206
200◄─►208
Azurephi-4
1256
±5
24,126
Microsoft
MIT
207
197◄─►214
olmo-2-0325-32b-instruct
1252
±11
3,335
Allen AI
Apache-2.0
208
204◄─►213
Coherecommand-r-08-2024
1251
±7
10,141
Cohere
CC-BY-NC-4.0
209
207◄─►217
mistral-large-2402
1243
±5
62,437
Mistral
Proprietary
210
207◄─►217
amazon-nova-micro-v1.0
1241
±5
19,355
Amazon
Proprietary
211
207◄─►221
jamba-1.5-mini
1239
±7
8,854
AI21 Labs
Jamba Open
212
207◄─►225
ministral-8b-2410
1237
±9
4,780
Mistral
MRL
213
208◄─►225
gemini-pro-dev-api
1236
±7
18,352
Proprietary
214
209◄─►225
qwen1.5-110b-chat
1235
±6
26,191
Alibaba
Qianwen 許可證
215
209◄─►226
reka-flash-21b-20240226-online
1234
±7
15,451
Reka AI
Proprietary
216
209◄─►225
qwen1.5-72b-chat
1234
±5
39,296
Alibaba
Qianwen 許可證
217
207◄─►227
Tencenthunyuan-standard-256k
1234
±12
2,729
Tencent
Proprietary
218
211◄─►226
mixtral-8x22b-instruct-v0.1
1230
±5
51,417
Mistral
Apache 2.0
219
211◄─►227
Coherecommand-r
1228
±5
54,038
Cohere
CC-BY-NC-4.0
220
211◄─►227
reka-flash-21b-20240226
1227
±6
24,806
Reka AI
Proprietary
221
212◄─►228
gpt-3.5-turbo-0125
1225
±5
66,191
OpenAI
Proprietary
222
212◄─►229
mistral-medium
1224
±5
34,552
Mistral
Proprietary
223
216◄─►228
Metallama-3-8b-instruct
1224
±4
104,636
Meta
Llama 3 社群
224
212◄─►229
Coherec4ai-aya-expanse-8b
1223
±7
9,827
Cohere
CC-BY-NC-4.0
225
211◄─►232
gemini-pro
1222
±12
6,390
Proprietary
226
212◄─►232
llama-3.1-tulu-3-8b
1221
±11
2,895
Ai2
Llama 3.1
227
223◄─►232
01.AIyi-1.5-34b-chat
1214
±5
24,142
01 AI
Apache-2.0
228
218◄─►234
HuggingFacezephyr-orpo-141b-A35b-v0.1
1213
±11
4,653
HuggingFace
Apache 2.0
229
225◄─►232
Metallama-3.1-8b-instruct
1212
±4
49,605
Meta
Llama 3.1 Community
230
221◄─►238
granite-3.1-8b-instruct
1209
±11
3,092
IBM
Apache 2.0
231
225◄─►238
qwen1.5-32b-chat
1205
±6
21,744
Alibaba
Qianwen 許可證
232
225◄─►240
gpt-3.5-turbo-1106
1203
±9
16,616
OpenAI
Proprietary
233
229◄─►239
gemma-2-2b-it
1199
±4
46,618
Gemma 許可證
234
229◄─►240
Azurephi-3-medium-4k-instruct
1199
±5
25,055
Microsoft
MIT
235
230◄─►240
mixtral-8x7b-instruct-v0.1
1198
±4
73,505
Mistral
Apache 2.0
236
230◄─►245
dbrx-instruct-preview
1196
±6
32,196
Databricks
DBRX 許可證
237
230◄─►249
InternLMinternlm2_5-20b-chat
1192
±7
9,902
InternLM
其他
238
230◄─►249
qwen1.5-14b-chat
1191
±7
17,841
Alibaba
Qianwen 許可證
239
233◄─►255
Azurewizardlm-70b
1185
±9
8,214
Microsoft
Llama 2 社群
240
232◄─►256
deepseek-llm-67b-chat
1185
±12
4,933
DeepSeek
DeepSeek 許可證
241
236◄─►252
01.AIyi-34b-chat
1184
±7
15,483
01 AI
Yi 許可證
242
236◄─►255
OpenChatopenchat-3.5-0106
1183
±8
12,636
OpenChat
Apache-2.0
243
236◄─►256
OpenChatopenchat-3.5
1183
±10
7,967
OpenChat
Apache-2.0
244
236◄─►256
granite-3.0-8b-instruct
1183
±9
6,643
IBM
Apache 2.0
245
237◄─►256
gemma-1.1-7b-it
1181
±6
23,893
Gemma 許可證
246
237◄─►256
Snowflakesnowflake-arctic-instruct
1180
±6
32,836
Snowflake
Apache 2.0
247
236◄─►258
granite-3.1-2b-instruct
1180
±11
3,191
IBM
Apache 2.0
248
237◄─►256
tulu-2-dpo-70b
1179
±10
6,534
AllenAI/UW
AI2 ImpACT 低風險
249
237◄─►260
openhermes-2.5-mistral-7b
1176
±10
5,006
NousResearch
Apache-2.0
250
239◄─►259
vicuna-33b
1173
±6
22,479
LMSYS
非商業用途
251
239◄─►262
starling-lm-7b-beta
1172
±7
16,057
Nexusflow
Apache-2.0
252
239◄─►260
Azurephi-3-small-8k-instruct
1172
±6
17,763
Microsoft
MIT
253
240◄─►260
Metallama-2-70b-chat
1171
±5
38,491
Meta
Llama 2 社群
254
240◄─►263
starling-lm-7b-alpha
1168
±8
10,224
UC Berkeley
CC-BY-NC-4.0
255
242◄─►263
Metallama-3.2-3b-instruct
1167
±8
7,936
Meta
Llama 3.2
256
240◄─►266
nous-hermes-2-mixtral-8x7b-dpo
1165
±12
3,776
NousResearch
Apache-2.0
257
248◄─►273
qwq-32b-preview
1157
±12
3,233
Alibaba
Apache 2.0
258
253◄─►270
granite-3.0-2b-instruct
1157
±8
6,837
IBM
Apache 2.0
259
248◄─►273
Nvidiallama2-70b-steerlm-chat
1156
±13
3,584
Nvidia
Llama 2 社群
260
250◄─►276
solar-10.7b-instruct-v1.0
1153
±13
4,155
Upstage AI
CC-BY-NC-4.0
261
249◄─►278
dolphin-2.2.1-mistral-7b
1153
±15
1,679
Cognitive Computations
Apache-2.0
262
254◄─►276
mpt-30b-chat
1151
±12
2,571
MosaicML
CC-BY-NC-SA-4.0
263
256◄─►273
mistral-7b-instruct-v0.2
1150
±7
19,402
Mistral
Apache-2.0
264
256◄─►275
Azurewizardlm-13b
1150
±9
7,046
Microsoft
Llama 2 社群
265
253◄─►280
falcon-180b-chat
1148
±17
1,295
TII
Falcon-180B TII 許可證
266
256◄─►279
qwen1.5-7b-chat
1144
±10
4,735
Alibaba
Qianwen 許可證
267
257◄─►278
Azurephi-3-mini-4k-instruct-june-2024
1144
±6
12,296
Microsoft
MIT
268
257◄─►279
Metallama-2-13b-chat
1142
±7
19,171
Meta
Llama 2 社群
269
257◄─►279
vicuna-13b
1142
±7
19,366
LMSYS
Llama 2 社群
270
257◄─►281
qwen-14b-chat
1139
±11
4,964
Alibaba
Qianwen 許可證
271
258◄─►281
palm-2
1138
±9
8,554
Proprietary
272
258◄─►281
Metacodellama-34b-instruct
1137
±9
7,363
Meta
Llama 2 社群
273
258◄─►281
gemma-7b-it
1136
±10
8,925
Gemma 許可證
274
261◄─►282
HuggingFacezephyr-7b-beta
1132
±9
11,116
HuggingFace
MIT
275
264◄─►282
Azurephi-3-mini-128k-instruct
1130
±7
20,691
Microsoft
MIT
276
266◄─►282
Azurephi-3-mini-4k-instruct
1129
±6
20,115
Microsoft
MIT
277
262◄─►285
guanaco-33b
1128
±12
2,921
UW
非商業用途
278
261◄─►286
HuggingFacezephyr-7b-alpha
1128
±16
1,785
HuggingFace
MIT
279
269◄─►286
stripedhyena-nous-7b
1122
±11
5,184
Together AI
Apache 2.0
280
264◄─►287
Metacodellama-70b-instruct
1120
±18
1,143
Meta
Llama 2 社群
281
274◄─►286
vicuna-7b
1115
±9
6,923
LMSYS
Llama 2 社群
282
270◄─►287
HuggingFacesmollm2-1.7b-instruct
1115
±14
2,201
HuggingFace
Apache 2.0
283
277◄─►286
gemma-1.1-2b-it
1115
±8
10,853
Gemma 許可證
284
277◄─►286
Metallama-3.2-1b-instruct
1112
±8
8,045
Meta
Llama 3.2
285
277◄─►287
mistral-7b-instruct
1110
±9
8,977
Mistral
Apache 2.0
286
278◄─►287
Metallama-2-7b-chat
1109
±7
14,148
Meta
Llama 2 社群
287
283◄─►291
gemma-2b-it
1092
±12
4,779
Gemma 許可證
288
287◄─►290
qwen1.5-4b-chat
1091
±9
7,598
Alibaba
Qianwen 許可證
289
287◄─►294
olmo-7b-instruct
1075
±11
6,329
Allen AI
Apache-2.0
290
288◄─►294
koala-13b
1071
±10
6,964
UC Berkeley
非商業用途
291
289◄─►294
alpaca-13b
1068
±12
5,745
Stanford
非商業用途
292
287◄─►295
gpt4all-13b-snoozy
1067
±15
1,743
Nomic AI
非商業用途
293
289◄─►295
mpt-7b-chat
1063
±12
3,925
MosaicML
CC-BY-NC-SA-4.0
294
289◄─►295
chatglm3-6b
1057
±12
4,658
Tsinghua
Apache-2.0
295
292◄─►297
RWKVRWKV-4-Raven-14B
1042
±11
4,845
RWKV
Apache 2.0
296
295◄─►297
chatglm2-6b
1025
±14
2,657
Tsinghua
Apache-2.0
297
295◄─►297
oasst-pythia-12b
1023
±11
6,311
OpenAssistant
Apache 2.0
298
298◄─►301
chatglm-6b
996
±13
4,914
Tsinghua
非商業用途
299
298◄─►301
fastchat-t5-3b
992
±12
4,203
LMSYS
Apache 2.0
300
298◄─►301
dolly-v2-12b
981
±14
3,412
Databricks
MIT
301
298◄─►302
Metallama-13b
973
±16
2,391
Meta
非商業用途
302
301◄─►302
Stabilitystablelm-tuned-alpha-7b
953
±13
3,287
Stability AI
CC-BY-NC-SA-4.0
說明
排名 (UB):基於 Bradley-Terry 模型計算嘅排名。此排名反映咗模型喺競技場入面嘅綜合表現,並提供咗其 Elo 分數嘅 上界 估計,幫助理解模型嘅潛在競爭力。
模型:大型語言模型 (LLM) 嘅名稱。部分模型名稱可能已嵌入相關連結。
分數:模型喺競技場入面通過用戶投票獲得嘅 Elo 評分。Elo 評分係一種相對排名系統,分數越高表示模型表現越好。
95% 置信區間 (±):模型 Elo 評分嘅95%置信區間(例如:
±6)。呢個區間越細,表示模型嘅評分越穩定同可靠。票數:該模型喺競技場入面收到嘅總投票數量。投票數越多,通常意味住其評分嘅統計可靠性越高。
組織/公司:提供該模型嘅組織或公司。
許可證:模型嘅許可協議類型,例如專有 (Proprietary)、Apache 2.0、MIT 等。
數據來源同更新頻率
本排行榜數據由自動化腳本直接從 1 2 官方網站獲取。此排行榜由 GitHub Actions 每日自動更新。
免责声明
本報告僅供參考。排行榜數據係動態變化嘅,並基於特定時段內用戶喺 Chatbot Arena 上嘅偏好投票。數據嘅完整性同準確性取決於上游數據來源。唔同模型可能採用唔同嘅許可協議,使用時請務必參考模型提供商嘅官方說明。
Last updated
Was this helpful?