模型榜单

这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。

数据更新时间: 2026-02-25 08:18:37 UTC / 2026-02-25 16:18:37 CST (北京时间)

排行榜

Rank
Rank Spread
模型
分数
票数

1

14

Anthropicclaude-opus-4-6Anthropic · Proprietary

1504±7

7,042

2

14

Anthropicclaude-opus-4-6-thinkingAnthropic · Proprietary

1503±8

6,181

3

15

gemini-3.1-pro-previewGoogle · Proprietary

1500±9Preliminary

4,060

4

16

grok-4.20-beta1xAI · Proprietary

1492±10Preliminary

3,695

5

46

gemini-3-proGoogle · Proprietary

1486±4

37,854

6

311

gpt-5.2-chat-latest-20260210OpenAI · Proprietary

1481±11

3,093

7

611

gemini-3-flashGoogle · Proprietary

1473±5

28,847

8

611

grok-4.1-thinkingxAI · Proprietary

1473±4

37,167

9

614

Bytedancedola-seed-2.0-previewBytedance · Proprietary

1472±9Preliminary

4,180

10

611

Anthropicclaude-opus-4-5-20251101-thinking-32kAnthropic · Proprietary

1472±4

30,067

11

614

Anthropicclaude-opus-4-5-20251101Anthropic · Proprietary

1467±4

35,014

12

1018

grok-4.1xAI · Proprietary

1463±4

41,343

13

1018

gemini-3-flash (thinking-minimal)Google · Proprietary

1462±5

20,204

14

1027

Anthropicclaude-sonnet-4-6Anthropic · Proprietary

1457±10

3,294

15

1222

gpt-5.1-highOpenAI · Proprietary

1457±4

33,984

16

1227

glm-5Z.ai · MIT

1454±8

6,061

17

1227

ernie-5.0-0110Baidu · Proprietary

1453±6

13,419

18

1231

qwen3.5-397b-a17bAlibaba · Apache 2.0

1451±9

4,541

19

1429

MoonshotAIkimi-k2.5-thinkingMoonshot · Modified MIT

1450±6

10,687

20

1427

Anthropicclaude-sonnet-4-5-20250929-thinking-32kAnthropic · Proprietary

1450±3

48,477

21

1427

Anthropicclaude-sonnet-4-5-20250929Anthropic · Proprietary

1450±4

46,264

22

1527

gemini-2.5-proGoogle · Proprietary

1449±3

96,876

23

1429

ernie-5.0-preview-1203Baidu · Proprietary

1449±7

9,723

24

1529

Anthropicclaude-opus-4-1-20250805-thinking-16kAnthropic · Proprietary

1449±4

49,618

25

1529

Anthropicclaude-opus-4-1-20250805Anthropic · Proprietary

1446±3

77,242

26

1534

gpt-4.5-preview-2025-02-27OpenAI · Proprietary

1444±6

14,549

27

2132

chatgpt-4o-latest-20250326OpenAI · Proprietary

1443±3

82,954

28

1536

glm-4.7Z.ai · MIT

1441±6

11,936

29

2135

gpt-5.2-highOpenAI · Proprietary

1440±5

18,792

30

2637

gpt-5.1OpenAI · Proprietary

1437±4

36,267

31

2537

gpt-5.2OpenAI · Proprietary

1437±6

15,640

32

2544

MoonshotAIkimi-k2.5-instantMoonshot · Modified MIT

1435±7

6,722

33

2741

qwen3-max-previewAlibaba · Proprietary

1434±5

27,649

34

2741

gpt-5-highOpenAI · Proprietary

1434±5

32,357

35

2843

o3-2025-04-16OpenAI · Proprietary

1433±4

60,965

36

2945

grok-4-1-fast-reasoningxAI · Proprietary

1431±4

30,701

37

3049

MoonshotAIkimi-k2-thinking-turboMoonshot · Modified MIT

1429±4

35,751

38

3255

gpt-5-chatOpenAI · Proprietary

1426±4

31,614

39

3457

glm-4.6Z.ai · MIT

1425±4

35,129

40

3258

qwen3-max-2025-09-23Alibaba · Proprietary

1425±6

9,172

41

3558

Anthropicclaude-opus-4-20250514-thinking-16kAnthropic · Proprietary

1424±4

37,728

42

3260

deepseek-v3.2-expDeepSeek · MIT

1424±6

11,685

43

3260

deepseek-v3.2-exp-thinkingDeepSeek · MIT

1423±7

8,944

44

3758

qwen3-235b-a22b-instruct-2507Alibaba · Apache 2.0

1422±3

71,166

45

3463

grok-4-fast-chatxAI · Proprietary

1422±8

6,962

46

3760

deepseek-v3.2-thinkingDeepSeek · MIT

1421±5

25,287

47

3862

deepseek-v3.2DeepSeek · MIT

1419±4

30,323

48

3864

deepseek-r1-0528DeepSeek · MIT

1419±6

19,178

49

3665

ernie-5.0-preview-1022Baidu · Proprietary

1418±9

4,563

50

3865

deepseek-v3.1DeepSeek · MIT

1418±6

15,198

51

3865

MoonshotAIkimi-k2-0905-previewMoonshot · Modified MIT

1417±6

11,915

52

3865

deepseek-v3.1-thinkingDeepSeek · MIT

1417±7

11,919

53

3965

MoonshotAIkimi-k2-0711-previewMoonshot · Modified MIT

1417±5

28,445

54

3769

deepseek-v3.1-terminusDeepSeek · MIT

1416±10

3,746

55

3774

deepseek-v3.1-terminus-thinkingDeepSeek · MIT

1416±10

3,538

56

4065

mistral-large-3Mistral · Apache 2.0

1415±4

26,713

57

3968

qwen3-vl-235b-a22b-instructAlibaba · Apache 2.0

1415±6

11,599

58

3875

amazon-nova-experimental-chat-26-01-10Amazon · Proprietary

1415±10

3,252

59

4365

gpt-4.1-2025-04-14OpenAI · Proprietary

1413±4

51,856

60

4365

Anthropicclaude-opus-4-20250514Anthropic · Proprietary

1413±4

45,310

61

4669

grok-3-preview-02-24xAI · Proprietary

1411±4

33,845

62

4768

mistral-medium-2508Mistral · Proprietary

1411±3

65,199

63

4868

gemini-2.5-flashGoogle · Proprietary

1411±3

96,163

64

4674

glm-4.5Z.ai · MIT

1410±5

24,608

65

4974

grok-4-0709xAI · Proprietary

1409±4

41,771

66

5779

Anthropicclaude-haiku-4-5-20251001Anthropic · Proprietary

1405±4

46,912

67

5779

gemini-2.5-flash-preview-09-2025Google · Proprietary

1404±4

32,547

68

5779

grok-4-fast-reasoningxAI · Proprietary

1404±5

18,446

69

6281

o1-2024-12-17OpenAI · Proprietary

1402±4

27,822

70

6281

qwen3-235b-a22b-no-thinkingAlibaba · Apache 2.0

1402±4

39,301

71

6282

qwen3-next-80b-a3b-instructAlibaba · Apache 2.0

1402±5

22,675

72

6287

longcat-flash-chatMeituan · MIT

1400±6

11,492

73

6682

Anthropicclaude-sonnet-4-20250514-thinking-32kAnthropic · Proprietary

1400±4

35,984

74

6091

Minimaxminimax-m2.5MiniMax · Modified MIT

1399±8

5,631

75

6590

qwen3-235b-a22b-thinking-2507Alibaba · Apache 2.0

1399±6

9,189

76

6689

deepseek-r1DeepSeek · MIT

1398±5

18,537

77

6697

qwen3-vl-235b-a22b-thinkingAlibaba · Apache 2.0

1395±7

7,926

78

6698

amazon-nova-experimental-chat-12-10Amazon · Proprietary

1395±10

3,700

79

6992

deepseek-v3-0324DeepSeek · MIT

1394±4

46,442

80

6299

Tencenthunyuan-vision-1.5-thinkingTencent · Proprietary

1394±12

2,216

81

7197

mai-1-previewMicrosoft AI · Proprietary

1392±5

18,020

82

7397

o4-mini-2025-04-16OpenAI · Proprietary

1391±4

46,384

83

6998

Stepfunstep-3.5-flashStepFun · Apache 2.0

1391±7

8,214

84

7398

gpt-5-mini-highOpenAI · Proprietary

1390±5

26,951

85

7398

mimo-v2-flash (non-thinking)Xiaomi · MIT

1390±5

19,235

86

7398

Anthropicclaude-sonnet-4-20250514Anthropic · Proprietary

1390±4

41,371

87

7598

Anthropicclaude-3-7-sonnet-20250219-thinking-32kAnthropic · Proprietary

1388±4

39,733

88

7498

o1-previewOpenAI · Proprietary

1388±5

31,120

89

7499

mimo-v2-flash (thinking)Xiaomi · MIT

1387±6

10,877

90

73102

Tencenthunyuan-t1-20250711Tencent · Proprietary

1387±9

4,767

91

7699

qwen3-coder-480b-a35b-instructAlibaba · Apache 2.0

1387±5

26,414

92

7799

Minimaxminimax-m2.1-previewMiniMax · MIT

1386±5

17,099

93

7899

mistral-medium-2505Mistral · Proprietary

1385±5

34,390

94

78101

qwen3-30b-a3b-instruct-2507Alibaba · Apache 2.0

1384±5

23,952

95

78102

Tencenthunyuan-turbos-20250416Tencent · Proprietary

1383±6

11,000

96

81102

gpt-4.1-mini-2025-04-14OpenAI · Proprietary

1382±4

40,321

97

88102

gemini-2.5-flash-lite-preview-09-2025-no-thinkingGoogle · Proprietary

1380±4

46,869

98

78112

glm-4.6vZ.ai · MIT

1378±11

2,785

99

78116

trinity-largeArcee AI · Apache 2.0

1376±14

1,690

100

93109

qwen3-235b-a22bAlibaba · Apache 2.0

1375±5

27,026

101

93109

gemini-2.5-flash-lite-preview-06-17-thinkingGoogle · Proprietary

1375±4

33,681

102

94109

qwen2.5-maxAlibaba · Proprietary

1374±4

33,204

103

98109

Anthropicclaude-3-5-sonnet-20241022Anthropic · Proprietary

1372±3

89,297

104

98112

glm-4.5-airZ.ai · MIT

1372±4

31,168

105

98112

Anthropicclaude-3-7-sonnet-20250219Anthropic · Proprietary

1371±4

44,277

106

98115

qwen3-next-80b-a3b-thinkingAlibaba · Apache 2.0

1369±6

13,768

107

98115

Minimaxminimax-m1MiniMax · Apache 2.0

1367±4

36,573

108

98119

amazon-nova-experimental-chat-11-10Amazon · Proprietary

1365±5

18,678

109

102117

gemma-3-27b-itGoogle · Gemma

1365±4

48,462

110

98122

glm-4.7-flashZ.ai · MIT

1364±6

10,247

111

102120

o3-mini-highOpenAI · Proprietary

1364±5

18,584

112

102123

grok-3-mini-highxAI · Proprietary

1363±5

17,418

113

105123

gemini-2.0-flash-001Google · Proprietary

1361±4

44,689

114

105131

deepseek-v3DeepSeek · DeepSeek

1359±5

21,788

115

107132

grok-3-mini-betaxAI · Proprietary

1357±5

23,619

116

105138

intellect-3Prime Intellect · MIT

1356±8

5,289

117

109136

mistral-small-2506Mistral · Apache 2.0

1356±5

18,240

118

111137

gpt-oss-120bOpenAI · Apache 2.0

1354±4

30,767

119

112138

gemini-2.0-flash-lite-preview-02-05Google · Proprietary

1353±4

24,951

120

108140

glm-4.5vZ.ai · MIT

1353±8

4,954

121

114137

Coherecommand-a-03-2025Cohere · CC-BY-NC-4.0

1353±3

57,108

122

114138

gemini-1.5-pro-002Google · Proprietary

1351±3

55,607

123

114140

amazon-nova-experimental-chat-10-20Amazon · Proprietary

1350±6

11,338

124

109149

Tencenthunyuan-turbos-20250226Tencent · Proprietary

1349±12

2,226

125

116140

o3-miniOpenAI · Proprietary

1348±3

58,461

126

111151

amazon-nova-experimental-chat-10-09Amazon · Proprietary

1347±11

2,877

127

110152

Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia · Nvidia Open Model

1347±12

2,546

128

114147

Minimaxminimax-m2MiniMax · Apache 2.0

1347±8

6,688

129

114149

qwen3-32bAlibaba · Apache 2.0

1347±9

3,932

130

114145

ling-flash-2.0Ant Group · MIT

1347±7

6,998

131

114147

Stepfunstep-3StepFun · Apache 2.0

1346±7

6,568

132

114149

qwen-plus-0125Alibaba · Proprietary

1346±8

5,823

133

119142

gpt-4o-2024-05-13OpenAI · Proprietary

1346±3

112,863

134

116152

glm-4-plus-0111Zhipu · Proprietary

1343±8

5,760

135

122147

Anthropicclaude-3-5-sonnet-20240620Anthropic · Proprietary

1343±3

82,417

136

116154

gemma-3-12b-itGoogle · Gemma

1342±10

3,829

137

116155

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia · Nvidia Open

1342±10

3,402

138

115157

Tencenthunyuan-turbo-0110Tencent · Proprietary

1340±12

2,295

139

122156

gpt-5-nano-highOpenAI · Proprietary

1338±7

8,355

140

125156

nova-2-liteAmazon · Proprietary

1337±6

12,112

141

126153

o1-miniOpenAI · Proprietary

1337±4

51,986

142

126156

qwq-32bAlibaba · Apache 2.0

1336±4

26,003

143

130155

Metallama-3.1-405b-instruct-bf16Meta · Llama 3.1 Community

1335±4

41,392

144

130156

grok-2-2024-08-13xAI · Proprietary

1335±4

63,495

145

127156

gpt-4o-2024-08-06OpenAI · Proprietary

1335±4

45,498

146

126156

gemini-advanced-0514Google · Proprietary

1335±5

50,142

147

125163

Stepfunstep-2-16k-exp-202412StepFun · Proprietary

1334±9

4,829

148

133156

Metallama-3.1-405b-instruct-fp8Meta · Llama 3.1 Community

1334±4

59,655

149

133165

olmo-3.1-32b-instructAllen AI · Apache 2.0

1330±6

12,252

150

117187

molmo-2-8bAI2 · Apache 2.0

1329±21

816

151

136167

01.AIyi-lightning01 AI · Proprietary

1329±5

27,340

152

137167

qwen3-30b-a3bAlibaba · Apache 2.0

1328±5

27,289

153

138167

Metallama-4-maverick-17b-128e-instructMeta · Llama 4

1328±4

40,938

154

127178

Nvidiallama-3.3-nemotron-49b-super-v1Nvidia · Nvidia

1327±12

2,230

155

134178

Tencenthunyuan-large-2025-02-10Tencent · Proprietary

1327±10

3,738

156

148174

gpt-4-turbo-2024-04-09OpenAI · Proprietary

1324±4

98,130

157

148174

Anthropicclaude-3-5-haiku-20241022Anthropic · Proprietary

1324±3

70,983

158

140178

deepseek-v2.5-1210DeepSeek · DeepSeek

1323±8

6,793

159

148174

gemini-1.5-pro-001Google · Proprietary

1323±4

79,132

160

148176

Metallama-4-scout-17b-16e-instructMeta · Llama

1323±5

31,061

161

149174

Anthropicclaude-3-opus-20240229Anthropic · Proprietary

1322±3

194,904

162

148178

Stepfunstep-1o-turbo-202506StepFun · Proprietary

1322±7

9,622

163

147178

gpt-4.1-nano-2025-04-14OpenAI · Proprietary

1322±8

6,107

164

148180

ring-flash-2.0Ant Group · MIT

1320±7

7,149

165

153178

Metallama-3.3-70b-instructMeta · Llama-3.3

1319±3

55,456

166

150178

glm-4-plusZhipu AI · Proprietary

1319±5

26,134

167

149178

gemma-3n-e4b-itGoogle · Gemma

1319±5

23,197

168

150181

qwen-max-0919Alibaba · Qwen

1318±6

16,479

169

153178

gpt-4o-mini-2024-07-18OpenAI · Proprietary

1318±3

68,794

170

153185

gpt-oss-20bOpenAI · Apache 2.0

1317±6

10,760

171

153184

Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia · NVIDIA Open Model

1317±6

15,408

172

153186

qwen2.5-plus-1127Alibaba · Proprietary

1315±6

10,179

173

157185

athene-v2-chatNexusFlow · NexusFlow

1315±5

24,746

174

157185

mistral-large-2407Mistral · Mistral Research

1314±4

45,460

175

158186

gpt-4-0125-previewOpenAI · Proprietary

1313±4

93,439

176

158186

gpt-4-1106-previewOpenAI · Proprietary

1313±4

100,107

177

153190

Tencenthunyuan-standard-2025-02-10Tencent · Proprietary

1312±10

3,905

178

167189

gemini-1.5-flash-002Google · Proprietary

1310±4

34,909

179

153196

mercuryInception AI · Proprietary

1309±14

1,887

180

169190

grok-2-mini-2024-08-13xAI · Proprietary

1308±4

52,574

181

169190

deepseek-v2.5DeepSeek · DeepSeek

1307±5

24,574

182

169190

athene-70b-0725NexusFlow · CC-BY-NC-4.0

1306±6

19,622

183

167190

olmo-3-32b-thinkAllen AI · Apache 2.0

1306±8

5,869

184

173190

mistral-large-2411Mistral · MRL

1305±4

28,081

185

170190

magistral-medium-2506Mistral · Proprietary

1305±6

11,991

186

176190

mistral-small-3.1-24b-instruct-2503Mistral · Apache 2.0

1304±4

33,907

187

168197

gemma-3-4b-itGoogle · Gemma

1303±9

4,177

188

177190

qwen2.5-72b-instructAlibaba · Qwen

1303±4

39,409

189

177200

Nvidiallama-3.1-nemotron-70b-instructNvidia · Llama 3.1

1299±8

7,136

190

178201

Tencenthunyuan-large-visionTencent · Proprietary

1296±9

5,565

191

187201

Metallama-3.1-70b-instructMeta · Llama 3.1 Community

1294±4

55,234

192

188202

amazon-nova-pro-v1.0Amazon · Proprietary

1290±5

24,753

193

187205

jamba-1.5-largeAI21 Labs · Jamba Open

1289±7

8,659

194

189203

gemma-2-27b-itGoogle · Gemma license

1288±3

75,764

195

187205

reka-core-20240904Reka AI · Proprietary

1288±7

7,309

196

187211

ibm-granite-h-smallIBM · Apache 2.0

1287±8

5,622

197

189205

gpt-4-0314OpenAI · Proprietary

1287±5

54,167

198

187211

llama-3.1-tulu-3-70bAi2 · Llama 3.1

1287±10

2,846

199

187211

Nvidiallama-3.1-nemotron-51b-instructNvidia · Llama 3.1

1286±10

3,749

200

190205

gemini-1.5-flash-001Google · Proprietary

1286±4

62,823

201

189211

olmo-3.1-32b-thinkAllen AI · Apache 2.0

1285±7

8,437

202

193211

Anthropicclaude-3-sonnet-20240229Anthropic · Proprietary

1281±4

109,289

203

192211

gemma-2-9b-it-simpoPrinceton · MIT

1279±7

10,069

204

194211

Nvidianemotron-4-340b-instructNvidia · NVIDIA Open Model

1278±5

19,661

205

194213

Coherecommand-r-plus-08-2024Cohere · CC-BY-NC-4.0

1277±7

9,869

206

198211

Metallama-3-70b-instructMeta · Llama 3 Community

1276±4

156,880

207

198212

gpt-4-0613OpenAI · Proprietary

1275±4

88,721

208

198214

mistral-small-24b-instruct-2501Mistral · Apache 2.0

1274±6

14,677

209

198215

glm-4-0520Zhipu AI · Proprietary

1273±7

9,788

210

198217

reka-flash-20240904Reka AI · Proprietary

1272±7

7,537

211

198220

qwen2.5-coder-32b-instructAlibaba · Apache 2.0

1271±8

5,430

212

206220

Coherec4ai-aya-expanse-32bCohere · CC-BY-NC-4.0

1267±5

27,123

213

208220

gemma-2-9b-itGoogle · Gemma license

1266±4

54,615

214

207221

deepseek-coder-v2DeepSeek · DeepSeek License

1264±6

15,147

215

210221

Coherecommand-r-plusCohere · CC-BY-NC-4.0

1262±4

77,556

216

209221

qwen2-72b-instructAlibaba · Qianwen LICENSE

1262±5

37,325

217

211221

Anthropicclaude-3-haiku-20240307Anthropic · Proprietary

1261±4

117,705

218

210222

amazon-nova-lite-v1.0Amazon · Proprietary

1261±5

19,376

219

211222

gemini-1.5-flash-8b-001Google · Proprietary

1259±4

35,556

220

214222

Azurephi-4Microsoft · MIT

1256±5

24,126

221

211228

olmo-2-0325-32b-instructAllen AI · Apache-2.0

1252±11

3,335

222

218227

Coherecommand-r-08-2024Cohere · CC-BY-NC-4.0

1250±7

10,141

223

221231

mistral-large-2402Mistral · Proprietary

1243±5

62,437

224

221231

amazon-nova-micro-v1.0Amazon · Proprietary

1241±5

19,355

225

221235

jamba-1.5-miniAI21 Labs · Jamba Open

1239±7

8,854

226

221239

ministral-8b-2410Mistral · MRL

1237±9

4,780

227

222239

gemini-pro-dev-apiGoogle · Proprietary

1235±7

18,352

228

223239

qwen1.5-110b-chatAlibaba · Qianwen LICENSE

1234±6

26,191

229

221241

Tencenthunyuan-standard-256kTencent · Proprietary

1233±12

2,729

230

223240

reka-flash-21b-20240226-onlineReka AI · Proprietary

1233±7

15,451

231

223239

qwen1.5-72b-chatAlibaba · Qianwen LICENSE

1233±5

39,296

232

225240

mixtral-8x22b-instruct-v0.1Mistral · Apache 2.0

1230±5

51,417

233

225241

Coherecommand-rCohere · CC-BY-NC-4.0

1227±5

54,038

234

225241

reka-flash-21b-20240226Reka AI · Proprietary

1227±6

24,806

235

226242

gpt-3.5-turbo-0125OpenAI · Proprietary

1224±5

66,191

236

230242

Metallama-3-8b-instructMeta · Llama 3 Community

1223±4

104,636

237

226243

mistral-mediumMistral · Proprietary

1223±6

34,552

238

226243

Coherec4ai-aya-expanse-8bCohere · CC-BY-NC-4.0

1223±7

9,827

239

225246

gemini-proGoogle · Proprietary

1222±12

6,390

240

226246

llama-3.1-tulu-3-8bAi2 · Llama 3.1

1221±11

2,895

241

237246

01.AIyi-1.5-34b-chat01 AI · Apache-2.0

1213±5

24,142

242

232248

HuggingFacezephyr-orpo-141b-A35b-v0.1HuggingFace · Apache 2.0

1213±11

4,653

243

239246

Metallama-3.1-8b-instructMeta · Llama 3.1 Community

1212±4

49,605

244

235252

granite-3.1-8b-instructIBM · Apache 2.0

1209±11

3,092

245

239252

qwen1.5-32b-chatAlibaba · Qianwen LICENSE

1204±6

21,744

246

239254

gpt-3.5-turbo-1106OpenAI · Proprietary

1203±9

16,616

247

243253

gemma-2-2b-itGoogle · Gemma license

1199±4

46,618

248

243254

Azurephi-3-medium-4k-instructMicrosoft · MIT

1198±5

25,055

249

244254

mixtral-8x7b-instruct-v0.1Mistral · Apache 2.0

1197±4

73,505

250

244259

dbrx-instruct-previewDatabricks · DBRX LICENSE

1195±6

32,196

251

244263

InternLMinternlm2_5-20b-chatInternLM · Other

1191±7

9,902

252

244263

qwen1.5-14b-chatAlibaba · Qianwen LICENSE

1191±7

17,841

253

247269

Azurewizardlm-70bMicrosoft · Llama 2 Community

1185±9

8,214

254

246270

deepseek-llm-67b-chatDeepSeek · DeepSeek License

1185±12

4,933

255

250266

01.AIyi-34b-chat01 AI · Yi License

1184±7

15,483

256

250269

OpenChatopenchat-3.5-0106OpenChat · Apache-2.0

1182±8

12,636

257

250270

OpenChatopenchat-3.5OpenChat · Apache-2.0

1182±10

7,967

258

250270

granite-3.0-8b-instructIBM · Apache 2.0

1182±9

6,643

259

251270

gemma-1.1-7b-itGoogle · Gemma license

1180±6

23,893

260

251270

Snowflakesnowflake-arctic-instructSnowflake · Apache 2.0

1180±6

32,836

261

250271

granite-3.1-2b-instructIBM · Apache 2.0

1179±11

3,191

262

251271

tulu-2-dpo-70bAllenAI/UW · AI2 ImpACT Low-risk

1178±10

6,534

263

251274

openhermes-2.5-mistral-7bNousResearch · Apache-2.0

1175±10

5,006

264

253273

vicuna-33bLMSYS · Non-commercial

1173±6

22,479

265

253276

starling-lm-7b-betaNexusflow · Apache-2.0

1172±7

16,057

266

253274

Azurephi-3-small-8k-instructMicrosoft · MIT

1171±6

17,763

267

254274

Metallama-2-70b-chatMeta · Llama 2 Community

1171±6

38,491

268

254277

starling-lm-7b-alphaUC Berkeley · CC-BY-NC-4.0

1168±8

10,224

269

256277

Metallama-3.2-3b-instructMeta · Llama 3.2

1167±8

7,936

270

254280

nous-hermes-2-mixtral-8x7b-dpoNousResearch · Apache-2.0

1165±12

3,776

271

261286

qwq-32b-previewAlibaba · Apache 2.0

1157±12

3,233

272

267284

granite-3.0-2b-instructIBM · Apache 2.0

1156±8

6,837

273

263287

Nvidiallama2-70b-steerlm-chatNvidia · Llama 2 Community

1155±13

3,584

274

264290

solar-10.7b-instruct-v1.0Upstage AI · CC-BY-NC-4.0

1153±13

4,155

275

263292

dolphin-2.2.1-mistral-7bCognitive Computations · Apache-2.0

1152±15

1,679

276

268290

mpt-30b-chatMosaicML · CC-BY-NC-SA-4.0

1150±12

2,571

277

270287

mistral-7b-instruct-v0.2Mistral · Apache-2.0

1150±7

19,402

278

270289

Azurewizardlm-13bMicrosoft · Llama 2 Community

1149±9

7,046

279

267294

falcon-180b-chatTII · Falcon-180B TII License

1147±17

1,295

280

270293

qwen1.5-7b-chatAlibaba · Qianwen LICENSE

1144±10

4,735

281

271292

Azurephi-3-mini-4k-instruct-june-2024Microsoft · MIT

1143±6

12,296

282

271293

Metallama-2-13b-chatMeta · Llama 2 Community

1142±7

19,171

283

271293

vicuna-13bLMSYS · Llama 2 Community

1141±7

19,366

284

271295

qwen-14b-chatAlibaba · Qianwen LICENSE

1139±11

4,964

285

272295

palm-2Google · Proprietary

1137±9

8,554

286

272295

Metacodellama-34b-instructMeta · Llama 2 Community

1137±9

7,363

287

273295

gemma-7b-itGoogle · Gemma license

1136±10

8,925

288

275296

HuggingFacezephyr-7b-betaHuggingFace · MIT

1131±9

11,116

289

278297

Azurephi-3-mini-128k-instructMicrosoft · MIT

1129±7

20,691

290

280296

Azurephi-3-mini-4k-instructMicrosoft · MIT

1129±6

20,115

291

276300

guanaco-33bUW · Non-commercial

1127±12

2,921

292

275300

HuggingFacezephyr-7b-alphaHuggingFace · MIT

1127±16

1,785

293

283300

stripedhyena-nous-7bTogether AI · Apache 2.0

1121±11

5,184

294

278301

Metacodellama-70b-instructMeta · Llama 2 Community

1119±18

1,143

295

288300

vicuna-7bLMSYS · Llama 2 Community

1115±9

6,923

296

284301

HuggingFacesmollm2-1.7b-instructHuggingFace · Apache 2.0

1114±14

2,201

297

290300

gemma-1.1-2b-itGoogle · Gemma license

1114±8

10,853

298

291300

Metallama-3.2-1b-instructMeta · Llama 3.2

1111±8

8,045

299

291301

mistral-7b-instructMistral · Apache 2.0

1110±9

8,977

300

291301

Metallama-2-7b-chatMeta · Llama 2 Community

1108±7

14,148

301

297305

gemma-2b-itGoogle · Gemma license

1092±12

4,779

302

301304

qwen1.5-4b-chatAlibaba · Qianwen LICENSE

1090±9

7,598

303

301308

olmo-7b-instructAllen AI · Apache-2.0

1075±11

6,329

304

302308

koala-13bUC Berkeley · Non-commercial

1071±10

6,964

305

303308

alpaca-13bStanford · Non-commercial

1068±12

5,745

306

301309

gpt4all-13b-snoozyNomic AI · Non-commercial

1066±15

1,743

307

303309

mpt-7b-chatMosaicML · CC-BY-NC-SA-4.0

1062±12

3,925

308

303309

chatglm3-6bTsinghua · Apache-2.0

1056±12

4,658

309

306311

RWKVRWKV-4-Raven-14BRWKV · Apache 2.0

1041±11

4,845

310

309311

chatglm2-6bTsinghua · Apache-2.0

1024±14

2,657

311

309311

oasst-pythia-12bOpenAssistant · Apache 2.0

1022±11

6,311

312

312315

chatglm-6bTsinghua · Non-commercial

996±13

4,914

313

312315

fastchat-t5-3bLMSYS · Apache 2.0

991±12

4,203

314

312315

dolly-v2-12bDatabricks · MIT

980±14

3,412

315

312316

Metallama-13bMeta · Non-commercial

972±16

2,391

316

315316

Stabilitystablelm-tuned-alpha-7bStability AI · CC-BY-NC-SA-4.0

953±13

3,287

说明

  • 排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。

  • 模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。

  • 分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。

  • 95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:±6)。这个区间越小,表示模型的评分越稳定和可靠。

  • 票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。

  • 组织/公司:提供该模型的组织或公司。

  • 许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。

数据来源与更新频率

本排行榜数据由自动化脚本直接从 1 2arrow-up-right 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。

免责声明

本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。

最后更新于

这有帮助吗?