模型榜单

这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。

数据更新时间: 2026-03-09 08:17:09 UTC / 2026-03-09 16:17:09 CST (北京时间)

排行榜

Rank
Rank Spread
模型
分数
票数

1

14

Anthropicclaude-opus-4-6Anthropic · Proprietary

1504±7

9,170

2

14

Anthropicclaude-opus-4-6-thinkingAnthropic · Proprietary

1502±7

8,313

3

14

gemini-3.1-pro-previewGoogle · Proprietary

1500±9Preliminary

4,041

4

17

grok-4.20-beta1xAI · Proprietary

1491±8Preliminary

5,280

5

47

gemini-3-proGoogle · Proprietary

1485±4

39,923

6

412

gpt-5.4-highOpenAI · Proprietary

1479±10Preliminary

3,503

7

412

gpt-5.2-chat-latest-20260210OpenAI · Proprietary

1479±8

5,786

8

612

gemini-3-flashGoogle · Proprietary

1473±4

30,600

9

612

grok-4.1-thinkingxAI · Proprietary

1473±4

39,309

10

615

Anthropicclaude-opus-4-5-20251101-thinking-32kAnthropic · Proprietary

1470±4

32,516

11

616

Anthropicclaude-opus-4-5-20251101Anthropic · Proprietary

1467±4

37,462

12

619

Bytedancedola-seed-2.0-previewBytedance · Proprietary

1465±8Preliminary

6,712

13

1018

grok-4.1xAI · Proprietary

1462±4

43,536

14

1020

gemini-3-flash (thinking-minimal)Google · Proprietary

1462±5

22,846

15

1030

gpt-5.4OpenAI · Proprietary

1457±10Preliminary

3,417

16

1128

Anthropicclaude-sonnet-4-6Anthropic · Proprietary

1457±8

5,509

17

1226

gpt-5.1-highOpenAI · Proprietary

1455±4

36,204

18

1231

glm-5Z.ai · MIT

1452±7

8,095

19

1431

MoonshotAIkimi-k2.5-thinkingMoonshot · Modified MIT

1451±6

12,710

20

1531

ernie-5.0-0110Baidu · Proprietary

1451±6

15,402

21

1530

Anthropicclaude-sonnet-4-5-20250929Anthropic · Proprietary

1451±4

48,736

22

1333

qwen3.5-397b-a17bAlibaba · Apache 2.0

1451±7

6,836

23

1530

Anthropicclaude-sonnet-4-5-20250929-thinking-32kAnthropic · Proprietary

1450±3

50,801

24

1533

ernie-5.0-preview-1203Baidu · Proprietary

1449±7

9,712

25

1530

gemini-2.5-proGoogle · Proprietary

1449±3

99,103

26

1531

Anthropicclaude-opus-4-1-20250805-thinking-16kAnthropic · Proprietary

1448±3

49,560

27

1633

Anthropicclaude-opus-4-1-20250805Anthropic · Proprietary

1446±3

77,173

28

1636

gpt-4.5-preview-2025-02-27OpenAI · Proprietary

1444±6

14,549

29

2135

chatgpt-4o-latest-20250326OpenAI · Proprietary

1443±3

82,864

30

1737

gpt-5.2-highOpenAI · Proprietary

1442±5

21,161

31

1738

glm-4.7Z.ai · MIT

1441±6

11,936

32

2539

gpt-5.2OpenAI · Proprietary

1439±5

18,159

33

2839

gpt-5.1OpenAI · Proprietary

1438±4

38,793

34

2549

gemini-3.1-flash-lite-previewGoogle · Proprietary

1435±9Preliminary

3,829

35

2944

qwen3-max-previewAlibaba · Proprietary

1434±5

27,631

36

3045

gpt-5-highOpenAI · Proprietary

1434±5

32,325

37

2849

MoonshotAIkimi-k2.5-instantMoonshot · Modified MIT

1434±6

8,823

38

3146

o3-2025-04-16OpenAI · Proprietary

1432±4

60,909

39

3249

grok-4-1-fast-reasoningxAI · Proprietary

1430±4

33,097

40

3451

MoonshotAIkimi-k2-thinking-turboMoonshot · Modified MIT

1429±4

37,861

41

3459

gpt-5-chatOpenAI · Proprietary

1426±4

31,590

42

3661

glm-4.6Z.ai · MIT

1425±4

35,102

43

3463

qwen3-max-2025-09-23Alibaba · Proprietary

1425±6

9,168

44

3463

deepseek-v3.2-expDeepSeek · MIT

1424±6

11,672

45

3762

Anthropicclaude-opus-4-20250514-thinking-16kAnthropic · Proprietary

1424±4

37,678

46

3465

deepseek-v3.2-exp-thinkingDeepSeek · MIT

1423±7

8,942

47

4162

qwen3-235b-a22b-instruct-2507Alibaba · Apache 2.0

1422±3

73,293

48

3567

grok-4-fast-chatxAI · Proprietary

1422±8

6,964

49

4165

deepseek-v3.2DeepSeek · MIT

1421±4

32,541

50

4165

deepseek-v3.2-thinkingDeepSeek · MIT

1420±4

27,370

51

4167

deepseek-r1-0528DeepSeek · MIT

1419±6

19,153

52

3771

ernie-5.0-preview-1022Baidu · Proprietary

1419±9

4,555

53

4170

deepseek-v3.1DeepSeek · MIT

1418±6

15,192

54

3775

qwen3.5-122b-a10bAlibaba · Apache 2.0

1418±10

3,251

55

4171

deepseek-v3.1-thinkingDeepSeek · MIT

1417±7

11,916

56

4171

MoonshotAIkimi-k2-0905-previewMoonshot · Modified MIT

1417±6

11,907

57

4270

MoonshotAIkimi-k2-0711-previewMoonshot · Modified MIT

1417±5

28,423

58

4078

deepseek-v3.1-terminusDeepSeek · MIT

1416±10

3,744

59

4080

deepseek-v3.1-terminus-thinkingDeepSeek · MIT

1416±10

3,535

60

4275

qwen3-vl-235b-a22b-instructAlibaba · Apache 2.0

1415±6

11,596

61

4571

mistral-large-3Mistral · Apache 2.0

1414±4

29,075

62

4184

amazon-nova-experimental-chat-26-01-10Amazon · Proprietary

1414±10

3,387

63

4772

gpt-4.1-2025-04-14OpenAI · Proprietary

1413±4

51,800

64

4774

Anthropicclaude-opus-4-20250514Anthropic · Proprietary

1413±4

45,279

65

5075

grok-3-preview-02-24xAI · Proprietary

1411±4

33,823

66

5275

gemini-2.5-flashGoogle · Proprietary

1411±3

98,369

67

4386

qwen3.5-27bAlibaba · Apache 2.0

1411±10

3,368

68

5275

mistral-medium-2508Mistral · Proprietary

1410±3

67,631

69

5081

glm-4.5Z.ai · MIT

1410±5

24,595

70

5280

grok-4-0709xAI · Proprietary

1409±4

41,749

71

5884

Anthropicclaude-haiku-4-5-20251001Anthropic · Proprietary

1406±3

49,326

72

5986

gemini-2.5-flash-preview-09-2025Google · Proprietary

1404±4

32,518

73

5986

grok-4-fast-reasoningxAI · Proprietary

1403±5

18,431

74

5494

qwen3.5-flashAlibaba · Proprietary

1402±9

4,330

75

5991

Minimaxminimax-m2.5MiniMax · Modified MIT

1402±7

8,017

76

6587

qwen3-235b-a22b-no-thinkingAlibaba · Apache 2.0

1402±4

39,264

77

6588

qwen3-next-80b-a3b-instructAlibaba · Apache 2.0

1402±5

22,670

78

6688

o1-2024-12-17OpenAI · Proprietary

1402±4

27,822

79

6594

longcat-flash-chatMeituan · MIT

1400±6

11,486

80

6989

Anthropicclaude-sonnet-4-20250514-thinking-32kAnthropic · Proprietary

1400±4

35,948

81

6996

qwen3-235b-a22b-thinking-2507Alibaba · Apache 2.0

1399±6

9,177

82

7196

deepseek-r1DeepSeek · MIT

1398±5

18,537

83

68104

qwen3.5-35b-a3bAlibaba · Apache 2.0

1395±10

3,427

84

71103

qwen3-vl-235b-a22b-thinkingAlibaba · Apache 2.0

1395±7

7,919

85

69104

amazon-nova-experimental-chat-12-10Amazon · Proprietary

1395±10

3,696

86

7499

deepseek-v3-0324DeepSeek · MIT

1394±4

46,409

87

66106

Tencenthunyuan-vision-1.5-thinkingTencent · Proprietary

1394±12

2,216

88

75103

mai-1-previewMicrosoft AI · Proprietary

1392±5

18,005

89

77103

mimo-v2-flash (non-thinking)Xiaomi · MIT

1392±5

21,546

90

79103

o4-mini-2025-04-16OpenAI · Proprietary

1391±4

46,343

91

79104

gpt-5-mini-highOpenAI · Proprietary

1390±5

26,935

92

79104

Anthropicclaude-sonnet-4-20250514Anthropic · Proprietary

1389±4

41,340

93

78106

Stepfunstep-3.5-flashStepFun · Apache 2.0

1389±6

10,401

94

81106

o1-previewOpenAI · Proprietary

1388±5

31,120

95

83104

Anthropicclaude-3-7-sonnet-20250219-thinking-32kAnthropic · Proprietary

1388±4

39,706

96

81106

mimo-v2-flash (thinking)Xiaomi · MIT

1387±6

10,861

97

78109

Tencenthunyuan-t1-20250711Tencent · Proprietary

1387±9

4,764

98

83106

qwen3-coder-480b-a35b-instructAlibaba · Apache 2.0

1386±5

26,393

99

83106

Minimaxminimax-m2.1-previewMiniMax · MIT

1385±5

17,078

100

84106

mistral-medium-2505Mistral · Proprietary

1385±5

34,358

101

84108

qwen3-30b-a3b-instruct-2507Alibaba · Apache 2.0

1384±5

23,933

102

84109

Tencenthunyuan-turbos-20250416Tencent · Proprietary

1383±6

10,995

103

88109

gpt-4.1-mini-2025-04-14OpenAI · Proprietary

1382±4

40,288

104

93109

gemini-2.5-flash-lite-preview-09-2025-no-thinkingGoogle · Proprietary

1380±4

46,840

105

84119

glm-4.6vZ.ai · MIT

1378±11

2,784

106

100116

gemini-2.5-flash-lite-preview-06-17-thinkingGoogle · Proprietary

1375±4

33,641

107

100116

qwen3-235b-a22bAlibaba · Apache 2.0

1375±5

27,000

108

101116

qwen2.5-maxAlibaba · Proprietary

1374±4

33,189

109

93119

trinity-largeArcee AI · Apache 2.0

1374±9

4,191

110

105118

Anthropicclaude-3-5-sonnet-20241022Anthropic · Proprietary

1372±3

89,270

111

105119

glm-4.5-airZ.ai · MIT

1372±4

31,132

112

105119

Anthropicclaude-3-7-sonnet-20250219Anthropic · Proprietary

1371±4

44,242

113

105122

qwen3-next-80b-a3b-thinkingAlibaba · Apache 2.0

1369±6

13,767

114

105125

glm-4.7-flashZ.ai · MIT

1367±6

11,734

115

105122

Minimaxminimax-m1MiniMax · Apache 2.0

1367±4

36,424

116

105125

amazon-nova-experimental-chat-11-10Amazon · Proprietary

1366±5

20,934

117

108123

gemma-3-27b-itGoogle · Gemma

1365±4

48,420

118

108127

o3-mini-highOpenAI · Proprietary

1364±5

18,584

119

109129

grok-3-mini-highxAI · Proprietary

1363±5

17,400

120

113130

gemini-2.0-flash-001Google · Proprietary

1361±4

44,666

121

113138

deepseek-v3DeepSeek · DeepSeek

1358±5

21,788

122

115138

grok-3-mini-betaxAI · Proprietary

1357±5

23,585

123

113145

intellect-3Prime Intellect · MIT

1356±8

5,285

124

116143

mistral-small-2506Mistral · Apache 2.0

1356±5

18,219

125

119144

gpt-oss-120bOpenAI · Apache 2.0

1354±4

30,747

126

120145

gemini-2.0-flash-lite-preview-02-05Google · Proprietary

1353±4

24,951

127

121144

Coherecommand-a-03-2025Cohere · CC-BY-NC-4.0

1353±3

57,059

128

116146

glm-4.5vZ.ai · MIT

1353±8

4,947

129

121145

gemini-1.5-pro-002Google · Proprietary

1351±3

55,607

130

121146

amazon-nova-experimental-chat-10-20Amazon · Proprietary

1351±6

11,317

131

118156

Tencenthunyuan-turbos-20250226Tencent · Proprietary

1349±12

2,226

132

123146

o3-miniOpenAI · Proprietary

1348±3

58,415

133

119158

amazon-nova-experimental-chat-10-09Amazon · Proprietary

1347±11

2,873

134

121151

ling-flash-2.0Ant Group · MIT

1347±7

6,988

135

118159

Nvidiallama-3.1-nemotron-ultra-253b-v1Nvidia · Nvidia Open Model

1347±12

2,546

136

121156

qwen3-32bAlibaba · Apache 2.0

1347±9

3,932

137

121153

Minimaxminimax-m2MiniMax · Apache 2.0

1347±8

6,684

138

121152

Stepfunstep-3StepFun · Apache 2.0

1347±7

6,565

139

121156

qwen-plus-0125Alibaba · Proprietary

1346±8

5,823

140

126149

gpt-4o-2024-05-13OpenAI · Proprietary

1346±3

112,863

141

123159

glm-4-plus-0111Zhipu · Proprietary

1343±8

5,760

142

129154

Anthropicclaude-3-5-sonnet-20240620Anthropic · Proprietary

1342±3

82,417

143

123161

gemma-3-12b-itGoogle · Gemma

1342±10

3,829

144

123163

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5Nvidia · Nvidia Open

1341±10

3,398

145

122164

Tencenthunyuan-turbo-0110Tencent · Proprietary

1340±12

2,295

146

132163

nova-2-liteAmazon · Proprietary

1338±6

12,099

147

132163

gpt-5-nano-highOpenAI · Proprietary

1337±7

8,348

148

133160

o1-miniOpenAI · Proprietary

1337±4

51,986

149

133162

qwq-32bAlibaba · Apache 2.0

1336±4

25,985

150

136163

grok-2-2024-08-13xAI · Proprietary

1335±4

63,495

151

137163

Metallama-3.1-405b-instruct-bf16Meta · Llama 3.1 Community

1335±4

41,392

152

136163

gpt-4o-2024-08-06OpenAI · Proprietary

1335±4

45,498

153

134163

gemini-advanced-0514Google · Proprietary

1335±5

50,142

154

132170

Stepfunstep-2-16k-exp-202412StepFun · Proprietary

1334±9

4,829

155

140163

Metallama-3.1-405b-instruct-fp8Meta · Llama 3.1 Community

1333±4

59,655

156

140171

olmo-3.1-32b-instructAi2 · Apache 2.0

1331±6

12,238

157

124194

molmo-2-8bAi2 · Apache 2.0

1329±21

812

158

143174

01.AIyi-lightning01 AI · Proprietary

1328±5

27,340

159

144174

qwen3-30b-a3bAlibaba · Apache 2.0

1328±5

27,260

160

145175

Metallama-4-maverick-17b-128e-instructMeta · Llama 4

1327±4

40,913

161

135185

Nvidiallama-3.3-nemotron-49b-super-v1Nvidia · Nvidia

1327±12

2,230

162

141185

Tencenthunyuan-large-2025-02-10Tencent · Proprietary

1326±10

3,738

163

155181

gpt-4-turbo-2024-04-09OpenAI · Proprietary

1324±4

98,130

164

146185

deepseek-v2.5-1210DeepSeek · DeepSeek

1323±8

6,793

165

155181

Anthropicclaude-3-5-haiku-20241022Anthropic · Proprietary

1323±3

70,951

166

155181

gemini-1.5-pro-001Google · Proprietary

1323±4

79,132

167

155183

Metallama-4-scout-17b-16e-instructMeta · Llama

1322±5

31,023

168

155185

Stepfunstep-1o-turbo-202506StepFun · Proprietary

1322±7

9,606

169

154186

gpt-4.1-nano-2025-04-14OpenAI · Proprietary

1322±8

6,107

170

156182

Anthropicclaude-3-opus-20240229Anthropic · Proprietary

1322±3

194,904

171

155187

ring-flash-2.0Ant Group · MIT

1320±7

7,147

172

157186

gemma-3n-e4b-itGoogle · Gemma

1319±5

23,170

173

157185

glm-4-plusZhipu AI · Proprietary

1319±5

26,134

174

160185

Metallama-3.3-70b-instructMeta · Llama-3.3

1319±3

55,436

175

157188

qwen-max-0919Alibaba · Qwen

1318±6

16,479

176

160186

gpt-4o-mini-2024-07-18OpenAI · Proprietary

1318±3

68,794

177

160188

Nvidianvidia-nemotron-3-nano-30b-a3b-bf16Nvidia · NVIDIA Open Model

1317±6

15,394

178

159192

gpt-oss-20bOpenAI · Apache 2.0

1317±6

10,751

179

160193

qwen2.5-plus-1127Alibaba · Proprietary

1315±6

10,179

180

163192

athene-v2-chatNexusFlow · NexusFlow

1314±5

24,746

181

164192

mistral-large-2407Mistral · Mistral Research

1314±4

45,460

182

165193

gpt-4-0125-previewOpenAI · Proprietary

1313±4

93,439

183

165193

gpt-4-1106-previewOpenAI · Proprietary

1313±4

100,107

184

160197

Tencenthunyuan-standard-2025-02-10Tencent · Proprietary

1311±10

3,905

185

174196

gemini-1.5-flash-002Google · Proprietary

1310±4

34,909

186

160204

mercuryInception AI · Proprietary

1308±14

1,884

187

177197

grok-2-mini-2024-08-13xAI · Proprietary

1308±4

52,574

188

177197

deepseek-v2.5DeepSeek · DeepSeek

1307±5

24,574

189

171197

olmo-3-32b-thinkAi2 · Apache 2.0

1306±8

5,863

190

177197

athene-70b-0725NexusFlow · CC-BY-NC-4.0

1306±6

19,622

191

180197

mistral-large-2411Mistral · MRL

1305±4

28,081

192

177197

magistral-medium-2506Mistral · Proprietary

1305±6

11,974

193

183197

mistral-small-3.1-24b-instruct-2503Mistral · Apache 2.0

1304±4

33,870

194

175204

gemma-3-4b-itGoogle · Gemma

1303±9

4,177

195

184197

qwen2.5-72b-instructAlibaba · Qwen

1303±4

39,409

196

184207

Nvidiallama-3.1-nemotron-70b-instructNvidia · Llama 3.1

1299±8

7,136

197

185208

Tencenthunyuan-large-visionTencent · Proprietary

1296±9

5,563

198

194208

Metallama-3.1-70b-instructMeta · Llama 3.1 Community

1293±4

55,234

199

194209

amazon-nova-pro-v1.0Amazon · Proprietary

1290±5

24,753

200

194212

jamba-1.5-largeAI21 Labs · Jamba Open

1288±7

8,659

201

196210

gemma-2-27b-itGoogle · Gemma license

1288±3

75,764

202

194212

reka-core-20240904Reka AI · Proprietary

1288±7

7,309

203

194218

ibm-granite-h-smallIBM · Apache 2.0

1287±8

5,619

204

196212

gpt-4-0314OpenAI · Proprietary

1287±5

54,167

205

194218

llama-3.1-tulu-3-70bAi2 · Llama 3.1

1286±10

2,846

206

194218

Nvidiallama-3.1-nemotron-51b-instructNvidia · Llama 3.1

1286±10

3,749

207

197212

gemini-1.5-flash-001Google · Proprietary

1285±4

62,823

208

196218

olmo-3.1-32b-thinkAi2 · Apache 2.0

1285±7

8,441

209

200218

Anthropicclaude-3-sonnet-20240229Anthropic · Proprietary

1281±4

109,289

210

199218

gemma-2-9b-it-simpoPrinceton · MIT

1279±7

10,069

211

201218

Nvidianemotron-4-340b-instructNvidia · NVIDIA Open Model

1277±5

19,661

212

201220

Coherecommand-r-plus-08-2024Cohere · CC-BY-NC-4.0

1276±7

9,869

213

205218

Metallama-3-70b-instructMeta · Llama 3 Community

1276±4

156,880

214

205219

gpt-4-0613OpenAI · Proprietary

1275±4

88,721

215

205221

mistral-small-24b-instruct-2501Mistral · Apache 2.0

1274±6

14,677

216

205222

glm-4-0520Zhipu AI · Proprietary

1273±7

9,788

217

205224

reka-flash-20240904Reka AI · Proprietary

1272±7

7,537

218

205227

qwen2.5-coder-32b-instructAlibaba · Apache 2.0

1270±8

5,430

219

213227

Coherec4ai-aya-expanse-32bCohere · CC-BY-NC-4.0

1267±5

27,123

220

215227

gemma-2-9b-itGoogle · Gemma license

1265±4

54,615

221

214228

deepseek-coder-v2DeepSeek · DeepSeek License

1264±6

15,147

222

217228

Coherecommand-r-plusCohere · CC-BY-NC-4.0

1262±4

77,556

223

216228

qwen2-72b-instructAlibaba · Qianwen LICENSE

1261±5

37,325

224

218228

Anthropicclaude-3-haiku-20240307Anthropic · Proprietary

1261±4

117,705

225

217229

amazon-nova-lite-v1.0Amazon · Proprietary

1261±5

19,376

226

218229

gemini-1.5-flash-8b-001Google · Proprietary

1259±4

35,556

227

221229

Azurephi-4Microsoft · MIT

1256±5

24,126

228

218235

olmo-2-0325-32b-instructAi2 · Apache-2.0

1252±11

3,335

229

225234

Coherecommand-r-08-2024Cohere · CC-BY-NC-4.0

1250±7

10,141

230

228238

mistral-large-2402Mistral · Proprietary

1242±5

62,437

231

228238

amazon-nova-micro-v1.0Amazon · Proprietary

1241±5

19,355

232

228241

jamba-1.5-miniAI21 Labs · Jamba Open

1239±7

8,854

233

228246

ministral-8b-2410Mistral · MRL

1237±9

4,780

234

229246

gemini-pro-dev-apiGoogle · Proprietary

1235±7

18,352

235

230245

qwen1.5-110b-chatAlibaba · Qianwen LICENSE

1234±6

26,191

236

228248

Tencenthunyuan-standard-256kTencent · Proprietary

1233±12

2,729

237

230247

reka-flash-21b-20240226-onlineReka AI · Proprietary

1233±7

15,451

238

230246

qwen1.5-72b-chatAlibaba · Qianwen LICENSE

1233±5

39,296

239

232247

mixtral-8x22b-instruct-v0.1Mistral · Apache 2.0

1229±5

51,417

240

233248

Coherecommand-rCohere · CC-BY-NC-4.0

1227±5

54,038

241

232248

reka-flash-21b-20240226Reka AI · Proprietary

1226±6

24,806

242

233249

gpt-3.5-turbo-0125OpenAI · Proprietary

1224±5

66,191

243

237249

Metallama-3-8b-instructMeta · Llama 3 Community

1223±4

104,636

244

233250

Coherec4ai-aya-expanse-8bCohere · CC-BY-NC-4.0

1223±7

9,827

245

234250

mistral-mediumMistral · Proprietary

1223±6

34,552

246

232253

gemini-proGoogle · Proprietary

1221±12

6,390

247

233252

llama-3.1-tulu-3-8bAi2 · Llama 3.1

1220±11

2,895

248

244253

01.AIyi-1.5-34b-chat01 AI · Apache-2.0

1213±5

24,142

249

239255

HuggingFacezephyr-orpo-141b-A35b-v0.1HuggingFace · Apache 2.0

1212±11

4,653

250

246253

Metallama-3.1-8b-instructMeta · Llama 3.1 Community

1211±4

49,605

251

242259

granite-3.1-8b-instructIBM · Apache 2.0

1208±11

3,092

252

247259

qwen1.5-32b-chatAlibaba · Qianwen LICENSE

1204±6

21,744

253

246261

gpt-3.5-turbo-1106OpenAI · Proprietary

1202±9

16,616

254

250260

gemma-2-2b-itGoogle · Gemma license

1199±4

46,618

255

250261

Azurephi-3-medium-4k-instructMicrosoft · MIT

1198±5

25,055

256

251261

mixtral-8x7b-instruct-v0.1Mistral · Apache 2.0

1197±4

73,505

257

251266

dbrx-instruct-previewDatabricks · DBRX LICENSE

1195±6

32,196

258

251270

InternLMinternlm2_5-20b-chatInternLM · Other

1191±7

9,902

259

251270

qwen1.5-14b-chatAlibaba · Qianwen LICENSE

1190±7

17,841

260

254276

Azurewizardlm-70bMicrosoft · Llama 2 Community

1184±9

8,214

261

253277

deepseek-llm-67b-chatDeepSeek · DeepSeek License

1184±12

4,933

262

257273

01.AIyi-34b-chat01 AI · Yi License

1183±7

15,483

263

257277

OpenChatopenchat-3.5-0106OpenChat · Apache-2.0

1182±8

12,636

264

257277

OpenChatopenchat-3.5OpenChat · Apache-2.0

1182±10

7,967

265

257277

granite-3.0-8b-instructIBM · Apache 2.0

1182±9

6,643

266

258277

gemma-1.1-7b-itGoogle · Gemma license

1180±6

23,893

267

258277

Snowflakesnowflake-arctic-instructSnowflake · Apache 2.0

1179±6

32,836

268

257278

granite-3.1-2b-instructIBM · Apache 2.0

1179±11

3,191

269

258278

tulu-2-dpo-70bAllenAI/UW · AI2 ImpACT Low-risk

1178±10

6,534

270

258281

openhermes-2.5-mistral-7bNousResearch · Apache-2.0

1175±10

5,006

271

260280

vicuna-33bLMSYS · Non-commercial

1172±6

22,479

272

260282

starling-lm-7b-betaNexusflow · Apache-2.0

1171±7

16,057

273

260281

Azurephi-3-small-8k-instructMicrosoft · MIT

1171±6

17,763

274

261281

Metallama-2-70b-chatMeta · Llama 2 Community

1170±6

38,491

275

261284

starling-lm-7b-alphaUC Berkeley · CC-BY-NC-4.0

1167±8

10,224

276

262284

Metallama-3.2-3b-instructMeta · Llama 3.2

1166±8

7,936

277

261287

nous-hermes-2-mixtral-8x7b-dpoNousResearch · Apache-2.0

1164±12

3,776

278

268292

qwq-32b-previewAlibaba · Apache 2.0

1157±12

3,233

279

274291

granite-3.0-2b-instructIBM · Apache 2.0

1156±8

6,837

280

270294

Nvidiallama2-70b-steerlm-chatNvidia · Llama 2 Community

1155±13

3,584

281

271297

solar-10.7b-instruct-v1.0Upstage AI · CC-BY-NC-4.0

1152±13

4,155

282

270299

dolphin-2.2.1-mistral-7bCognitive Computations · Apache-2.0

1152±15

1,679

283

275297

mpt-30b-chatMosaicML · CC-BY-NC-SA-4.0

1150±12

2,571

284

277294

mistral-7b-instruct-v0.2Mistral · Apache-2.0

1149±7

19,402

285

277296

Azurewizardlm-13bMicrosoft · Llama 2 Community

1149±9

7,046

286

275301

falcon-180b-chatTII · Falcon-180B TII License

1147±17

1,295

287

277300

qwen1.5-7b-chatAlibaba · Qianwen LICENSE

1144±10

4,735

288

278298

Azurephi-3-mini-4k-instruct-june-2024Microsoft · MIT

1143±6

12,296

289

278300

Metallama-2-13b-chatMeta · Llama 2 Community

1141±7

19,171

290

278300

vicuna-13bLMSYS · Llama 2 Community

1140±7

19,366

291

278302

qwen-14b-chatAlibaba · Qianwen LICENSE

1138±11

4,964

292

279302

palm-2Google · Proprietary

1137±9

8,554

293

280302

Metacodellama-34b-instructMeta · Llama 2 Community

1136±9

7,363

294

280302

gemma-7b-itGoogle · Gemma license

1136±10

8,925

295

282303

HuggingFacezephyr-7b-betaHuggingFace · MIT

1131±9

11,116

296

286304

Azurephi-3-mini-128k-instructMicrosoft · MIT

1129±7

20,691

297

287303

Azurephi-3-mini-4k-instructMicrosoft · MIT

1128±6

20,115

298

283307

guanaco-33bUW · Non-commercial

1127±12

2,921

299

282307

HuggingFacezephyr-7b-alphaHuggingFace · MIT

1127±16

1,785

300

290307

stripedhyena-nous-7bTogether AI · Apache 2.0

1121±11

5,184

301

285308

Metacodellama-70b-instructMeta · Llama 2 Community

1119±18

1,143

302

295307

vicuna-7bLMSYS · Llama 2 Community

1114±9

6,923

303

291308

HuggingFacesmollm2-1.7b-instructHuggingFace · Apache 2.0

1114±14

2,201

304

297307

gemma-1.1-2b-itGoogle · Gemma license

1114±8

10,853

305

298307

Metallama-3.2-1b-instructMeta · Llama 3.2

1111±8

8,045

306

298308

mistral-7b-instructMistral · Apache 2.0

1109±9

8,977

307

298308

Metallama-2-7b-chatMeta · Llama 2 Community

1108±7

14,148

308

304312

gemma-2b-itGoogle · Gemma license

1091±12

4,779

309

308311

qwen1.5-4b-chatAlibaba · Qianwen LICENSE

1090±9

7,598

310

308315

olmo-7b-instructAi2 · Apache-2.0

1074±11

6,329

311

309315

koala-13bUC Berkeley · Non-commercial

1070±10

6,964

312

310315

alpaca-13bStanford · Non-commercial

1067±12

5,745

313

308316

gpt4all-13b-snoozyNomic AI · Non-commercial

1065±15

1,743

314

310316

mpt-7b-chatMosaicML · CC-BY-NC-SA-4.0

1061±12

3,925

315

310316

chatglm3-6bTsinghua · Apache-2.0

1056±12

4,658

316

313318

RWKVRWKV-4-Raven-14BRWKV · Apache 2.0

1041±11

4,845

317

316318

chatglm2-6bTsinghua · Apache-2.0

1024±14

2,657

318

316318

oasst-pythia-12bOpenAssistant · Apache 2.0

1022±11

6,311

319

319322

chatglm-6bTsinghua · Non-commercial

995±13

4,914

320

319322

fastchat-t5-3bLMSYS · Apache 2.0

991±12

4,203

321

319322

dolly-v2-12bDatabricks · MIT

980±14

3,412

322

319323

Metallama-13bMeta · Non-commercial

972±16

2,391

323

322323

Stabilitystablelm-tuned-alpha-7bStability AI · CC-BY-NC-SA-4.0

952±13

3,287

说明

  • 排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。

  • 模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。

  • 分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。

  • 95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:±6)。这个区间越小,表示模型的评分越稳定和可靠。

  • 票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。

  • 组织/公司:提供该模型的组织或公司。

  • 许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。

数据来源与更新频率

本排行榜数据由自动化脚本直接从 1 2arrow-up-right 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。

免责声明

本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。

最后更新于

这有帮助吗?