模型榜单

这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。

数据更新时间: 2026-01-23 08:09:03 UTC / 2026-01-23 16:09:03 CST (北京时间)

排行榜

Rank
Rank Spread
模型
分数
95% 置信区间 (±)
票数
组织/公司
许可证

1

1◄─►1

gemini-3-pro

1490

±5

27,827

Google

Proprietary

2

2◄─►4

grok-4.1-thinking

1477

±5

27,985

xAI

Proprietary

3

2◄─►8

gemini-3-flash

1472

±6

13,245

Google

Proprietary

4

2◄─►8

Anthropicclaude-opus-4-5-20251101-thinking-32k

1470

±5

19,898

Anthropic

Proprietary

5

3◄─►9

Anthropicclaude-opus-4-5-20251101

1467

±5

21,241

Anthropic

Proprietary

6

3◄─►9

grok-4.1

1465

±5

32,015

xAI

Proprietary

7

3◄─►10

gemini-3-flash (thinking-minimal)

1462

±7

9,644

Google

Proprietary

8

3◄─►15

ernie-5.0-0110

1459Preliminary

±9

4,829

Baidu

Proprietary

9

5◄─►14

gpt-5.1-high

1458

±5

24,439

OpenAI

Proprietary

10

8◄─►18

gemini-2.5-pro

1451

±3

87,641

Google

Proprietary

11

8◄─►18

Anthropicclaude-sonnet-4-5-20250929-thinking-32k

1451

±4

38,441

Anthropic

Proprietary

12

7◄─►20

ernie-5.0-preview-1203

1450

±7

9,709

Baidu

Proprietary

13

8◄─►18

Anthropicclaude-sonnet-4-5-20250929

1450

±4

35,025

Anthropic

Proprietary

14

9◄─►18

Anthropicclaude-opus-4-1-20250805-thinking-16k

1449

±4

50,061

Anthropic

Proprietary

15

10◄─►20

Anthropicclaude-opus-4-1-20250805

1445

±3

67,599

Anthropic

Proprietary

16

8◄─►24

gpt-5.2

1445

±9

5,187

OpenAI

Proprietary

17

10◄─►22

gpt-4.5-preview-2025-02-27

1444

±6

14,549

OpenAI

Proprietary

18

14◄─►22

chatgpt-4o-latest-20250326

1442

±3

74,853

OpenAI

Proprietary

19

10◄─►25

glm-4.7

1441Preliminary

±7

9,556

Z.ai

MIT

20

14◄─►33

gpt-5.2-high

1436

±8

8,594

OpenAI

Proprietary

21

16◄─►27

gpt-5.1

1435

±5

26,241

OpenAI

Proprietary

22

16◄─►28

gpt-5-high

1435

±5

32,688

OpenAI

Proprietary

23

18◄─►31

qwen3-max-preview

1434

±5

27,894

Alibaba

Proprietary

24

18◄─►32

o3-2025-04-16

1433

±4

61,435

OpenAI

Proprietary

25

19◄─►35

grok-4-1-fast-reasoning

1430

±5

21,151

xAI

Proprietary

26

20◄─►38

MoonshotAIkimi-k2-thinking-turbo

1429

±5

26,054

Moonshot

Modified MIT

27

21◄─►44

gpt-5-chat

1426

±4

31,883

OpenAI

Proprietary

28

23◄─►45

glm-4.6

1425

±4

33,537

Z.ai

MIT

29

20◄─►45

qwen3-max-2025-09-23

1424

±6

9,225

Alibaba

Proprietary

30

24◄─►45

Anthropicclaude-opus-4-20250514-thinking-16k

1424

±4

38,020

Anthropic

Proprietary

31

22◄─►48

deepseek-v3.2-exp

1423

±7

11,812

DeepSeek

MIT

32

22◄─►48

deepseek-v3.2-exp-thinking

1423

±7

9,017

DeepSeek

MIT

33

26◄─►45

qwen3-235b-a22b-instruct-2507

1422

±3

62,099

Alibaba

Apache 2.0

34

22◄─►50

grok-4-fast-chat

1422

±8

7,001

xAI

Proprietary

35

26◄─►51

deepseek-v3.2-thinking

1420

±6

15,802

DeepSeek

MIT

36

27◄─►52

deepseek-v3.2

1418

±5

20,503

DeepSeek

MIT

37

27◄─►52

deepseek-r1-0528

1418

±6

19,306

DeepSeek

MIT

38

27◄─►53

MoonshotAIkimi-k2-0905-preview

1418

±6

11,981

Moonshot

Modified MIT

39

25◄─►55

ernie-5.0-preview-1022

1417

±9

4,643

Baidu

Proprietary

40

27◄─►52

MoonshotAIkimi-k2-0711-preview

1417

±5

28,683

Moonshot

Modified MIT

41

27◄─►53

deepseek-v3.1-thinking

1417

±7

11,984

DeepSeek

MIT

42

27◄─►53

deepseek-v3.1

1417

±6

15,294

DeepSeek

MIT

43

26◄─►57

deepseek-v3.1-terminus

1416

±10

3,764

DeepSeek

MIT

44

25◄─►57

deepseek-v3.1-terminus-thinking

1416

±10

3,552

DeepSeek

MIT

45

28◄─►55

qwen3-vl-235b-a22b-instruct

1415

±6

11,700

Alibaba

Apache 2.0

46

32◄─►53

Anthropicclaude-opus-4-20250514

1413

±4

45,596

Anthropic

Proprietary

47

32◄─►53

gpt-4.1-2025-04-14

1413

±4

52,274

OpenAI

Proprietary

48

34◄─►55

mistral-medium-2508

1412

±3

55,868

Mistral

Proprietary

49

32◄─►57

mistral-large-3

1411

±5

16,782

Mistral

Apache 2.0

50

34◄─►57

grok-3-preview-02-24

1410

±4

33,991

xAI

Proprietary

51

36◄─►59

grok-4-0709

1409

±4

42,229

xAI

Proprietary

52

35◄─►63

glm-4.5

1409

±5

24,811

Z.ai

MIT

53

39◄─►57

gemini-2.5-flash

1409

±3

86,990

Google

Proprietary

54

44◄─►67

gemini-2.5-flash-preview-09-2025

1404

±4

32,974

Google

Proprietary

55

44◄─►67

grok-4-fast-reasoning

1404

±5

18,700

xAI

Proprietary

56

47◄─►67

Anthropicclaude-haiku-4-5-20251001

1403

±4

37,029

Anthropic

Proprietary

57

47◄─►67

o1-2024-12-17

1402

±4

27,822

OpenAI

Proprietary

58

53◄─►69

qwen3-235b-a22b-no-thinking

1401

±4

39,576

Alibaba

Apache 2.0

59

52◄─►69

qwen3-next-80b-a3b-instruct

1401

±5

22,903

Alibaba

Apache 2.0

60

53◄─►69

Anthropicclaude-sonnet-4-20250514-thinking-32k

1400

±4

36,250

Anthropic

Proprietary

61

53◄─►75

longcat-flash-chat

1399

±6

11,571

Meituan

MIT

62

54◄─►75

qwen3-235b-a22b-thinking-2507

1398

±6

9,258

Alibaba

Apache 2.0

63

54◄─►75

deepseek-r1

1397

±5

18,537

DeepSeek

MIT

64

52◄─►82

amazon-nova-experimental-chat-12-10

1396

±10

3,398

Amazon

Proprietary

65

54◄─►80

mimo-v2-flash (non-thinking)

1395

±7

9,293

Xiaomi

MIT

66

54◄─►80

qwen3-vl-235b-a22b-thinking

1395

±7

7,981

Alibaba

Apache 2.0

67

58◄─►76

deepseek-v3-0324

1394

±4

46,730

DeepSeek

MIT

68

53◄─►83

Tencenthunyuan-vision-1.5-thinking

1393

±12

2,225

Tencent

Proprietary

69

58◄─►82

mai-1-preview

1391

±5

18,164

Microsoft AI

Proprietary

70

61◄─►81

o4-mini-2025-04-16

1391

±4

46,724

OpenAI

Proprietary

71

61◄─►82

gpt-5-mini-high

1390

±5

27,213

OpenAI

Proprietary

72

61◄─►82

Anthropicclaude-sonnet-4-20250514

1390

±4

41,675

Anthropic

Proprietary

73

61◄─►82

Anthropicclaude-3-7-sonnet-20250219-thinking-32k

1389

±4

39,913

Anthropic

Proprietary

74

61◄─►82

o1-preview

1388

±5

31,120

OpenAI

Proprietary

75

61◄─►85

Tencenthunyuan-t1-20250711

1387

±9

4,806

Tencent

Proprietary

76

64◄─►83

qwen3-coder-480b-a35b-instruct

1386

±5

26,653

Alibaba

Apache 2.0

77

65◄─►83

mistral-medium-2505

1384

±5

34,576

Mistral

Proprietary

78

67◄─►85

qwen3-30b-a3b-instruct-2507

1383

±5

24,110

Alibaba

Apache 2.0

79

65◄─►88

Tencenthunyuan-turbos-20250416

1382

±6

11,060

Tencent

Proprietary

80

68◄─►86

gpt-4.1-mini-2025-04-14

1382

±4

40,558

OpenAI

Proprietary

81

65◄─►89

Minimaxminimax-m2.1-preview

1382Preliminary

±7

8,722

MiniMax

MIT

82

74◄─►90

gemini-2.5-flash-lite-preview-09-2025-no-thinking

1378

±4

40,079

Google

Proprietary

83

65◄─►95

glm-4.6v

1378

±11

2,831

Z.ai

MIT

84

77◄─►92

gemini-2.5-flash-lite-preview-06-17-thinking

1375

±4

33,956

Google

Proprietary

85

77◄─►92

qwen3-235b-a22b

1374

±5

27,180

Alibaba

Apache 2.0

86

79◄─►92

qwen2.5-max

1374

±4

33,297

Alibaba

Proprietary

87

80◄─►92

Anthropicclaude-3-5-sonnet-20241022

1373

±3

89,483

Anthropic

Proprietary

88

80◄─►94

Anthropicclaude-3-7-sonnet-20250219

1372

±4

44,462

Anthropic

Proprietary

89

81◄─►95

glm-4.5-air

1371

±4

31,456

Z.ai

MIT

90

82◄─►98

qwen3-next-80b-a3b-thinking

1369

±6

13,862

Alibaba

Apache 2.0

91

83◄─►98

Minimaxminimax-m1

1367

±4

36,897

MiniMax

Apache 2.0

92

87◄─►98

gemma-3-27b-it

1365

±4

48,730

Google

Gemma

93

87◄─►102

o3-mini-high

1364

±5

18,584

OpenAI

Proprietary

94

83◄─►106

amazon-nova-experimental-chat-11-10

1364

±7

9,583

Amazon

Proprietary

95

88◄─►106

grok-3-mini-high

1362

±5

17,542

xAI

Proprietary

96

90◄─►106

gemini-2.0-flash-001

1361

±4

44,828

Google

Proprietary

97

90◄─►111

deepseek-v3

1358

±5

21,788

DeepSeek

DeepSeek

98

93◄─►117

grok-3-mini-beta

1356

±5

23,789

xAI

Proprietary

99

93◄─►118

mistral-small-2506

1356

±5

18,366

Mistral

Apache 2.0

100

90◄─►120

intellect-3

1355

±8

5,341

Prime Intellect

MIT

101

94◄─►120

gpt-oss-120b

1354

±4

31,010

OpenAI

Apache 2.0

102

94◄─►120

gemini-2.0-flash-lite-preview-02-05

1353

±4

24,951

Google

Proprietary

103

93◄─►121

glm-4.5v

1353

±8

4,983

Z.ai

MIT

104

97◄─►120

Coherecommand-a-03-2025

1353

±3

57,503

Cohere

CC-BY-NC-4.0

105

97◄─►120

gemini-1.5-pro-002

1352

±3

55,607

Google

Proprietary

106

93◄─►132

Tencenthunyuan-turbos-20250226

1349

±12

2,226

Tencent

Proprietary

107

98◄─►121

o3-mini

1348

±3

58,662

OpenAI

Proprietary

108

94◄─►132

amazon-nova-experimental-chat-10-09

1347

±11

2,900

Amazon

Proprietary

109

94◄─►133

Nvidiallama-3.1-nemotron-ultra-253b-v1

1347

±12

2,546

Nvidia

Nvidia Open Model

110

98◄─►124

amazon-nova-experimental-chat-10-20

1347

±6

11,466

Amazon

Proprietary

111

97◄─►132

qwen3-32b

1346

±9

3,932

Alibaba

Apache 2.0

112

98◄─►129

ling-flash-2.0

1346

±7

7,051

Ant Group

MIT

113

98◄─►130

Stepfunstep-3

1346

±7

6,600

StepFun

Apache 2.0

114

97◄─►131

Minimaxminimax-m2

1346

±8

6,767

MiniMax

Apache 2.0

115

100◄─►122

gpt-4o-2024-05-13

1346

±3

112,863

OpenAI

Proprietary

116

97◄─►132

qwen-plus-0125

1346

±8

5,823

Alibaba

Proprietary

117

98◄─►133

glm-4-plus-0111

1343

±8

5,760

Zhipu

Proprietary

118

105◄─►127

Anthropicclaude-3-5-sonnet-20240620

1343

±3

82,417

Anthropic

Proprietary

119

99◄─►136

gemma-3-12b-it

1342

±10

3,829

Google

Gemma

120

100◄─►137

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5

1340

±10

3,430

Nvidia

Nvidia Open

121

98◄─►139

Tencenthunyuan-turbo-0110

1340

±12

2,295

Tencent

Proprietary

122

107◄─►137

gpt-5-nano-high

1338

±7

8,401

OpenAI

Proprietary

123

109◄─►134

o1-mini

1337

±4

51,986

OpenAI

Proprietary

124

108◄─►138

nova-2-lite

1336

±6

12,298

Amazon

Proprietary

125

110◄─►134

Metallama-3.1-405b-instruct-bf16

1336

±4

41,392

Meta

Llama 3.1 Community

126

109◄─►137

qwq-32b

1336

±4

26,141

Alibaba

Apache 2.0

127

110◄─►137

gpt-4o-2024-08-06

1335

±4

45,498

OpenAI

Proprietary

128

109◄─►138

gemini-advanced-0514

1335

±5

50,142

Google

Proprietary

129

112◄─►136

grok-2-2024-08-13

1335

±4

63,495

xAI

Proprietary

130

113◄─►137

Metallama-3.1-405b-instruct-fp8

1334

±4

59,655

Meta

Llama 3.1 Community

131

108◄─►146

Stepfunstep-2-16k-exp-202412

1334

±9

4,829

StepFun

Proprietary

132

119◄─►150

01.AIyi-lightning

1329

±5

27,340

01 AI

Proprietary

133

121◄─►150

Metallama-4-maverick-17b-128e-instruct

1328

±4

41,173

Meta

Llama 4

134

120◄─►152

qwen3-30b-a3b

1328

±5

27,448

Alibaba

Apache 2.0

135

111◄─►160

Nvidiallama-3.3-nemotron-49b-super-v1

1327

±12

2,230

Nvidia

Nvidia

136

117◄─►160

Tencenthunyuan-large-2025-02-10

1326

±10

3,738

Tencent

Proprietary

137

131◄─►156

gpt-4-turbo-2024-04-09

1325

±4

98,130

OpenAI

Proprietary

138

131◄─►156

Anthropicclaude-3-5-haiku-20241022

1324

±3

71,211

Anthropic

Proprietary

139

131◄─►156

gemini-1.5-pro-001

1323

±4

79,132

Google

Proprietary

140

123◄─►160

deepseek-v2.5-1210

1323

±8

6,793

DeepSeek

DeepSeek

141

131◄─►158

Metallama-4-scout-17b-16e-instruct

1323

±5

31,266

Meta

Llama

142

131◄─►156

Anthropicclaude-3-opus-20240229

1323

±3

194,904

Anthropic

Proprietary

143

130◄─►161

gpt-4.1-nano-2025-04-14

1322

±8

6,107

OpenAI

Proprietary

144

131◄─►161

Stepfunstep-1o-turbo-202506

1321

±7

9,701

StepFun

Proprietary

145

128◄─►167

olmo-3.1-32b-instruct

1321

±10

4,161

Allen AI

Apache 2.0

146

131◄─►162

ring-flash-2.0

1320

±7

7,207

Ant Group

MIT

147

134◄─►160

Metallama-3.3-70b-instruct

1320

±3

55,593

Meta

Llama-3.3

148

132◄─►160

glm-4-plus

1319

±5

26,134

Zhipu AI

Proprietary

149

132◄─►161

gemma-3n-e4b-it

1319

±5

23,367

Google

Gemma

150

132◄─►163

qwen-max-0919

1318

±6

16,479

Alibaba

Qwen

151

132◄─►167

gpt-oss-20b

1318

±6

10,812

OpenAI

Apache 2.0

152

135◄─►161

gpt-4o-mini-2024-07-18

1318

±3

68,801

OpenAI

Proprietary

153

135◄─►167

Nvidianvidia-nemotron-3-nano-30b-a3b-bf16

1317

±6

11,861

Nvidia

NVIDIA Open Model

154

135◄─►169

qwen2.5-plus-1127

1315

±6

10,179

Alibaba

Proprietary

155

139◄─►167

mistral-large-2407

1314

±4

45,460

Mistral

Mistral Research

156

139◄─►167

athene-v2-chat

1314

±4

24,746

NexusFlow

NexusFlow

157

140◄─►167

gpt-4-1106-preview

1314

±4

100,107

OpenAI

Proprietary

158

140◄─►167

gpt-4-0125-preview

1314

±4

93,439

OpenAI

Proprietary

159

135◄─►172

Tencenthunyuan-standard-2025-02-10

1312

±10

3,905

Tencent

Proprietary

160

145◄─►171

gemini-1.5-flash-002

1310

±4

34,909

Google

Proprietary

161

134◄─►177

mercury

1309

±14

1,928

Inception AI

Proprietary

162

151◄─►172

grok-2-mini-2024-08-13

1308

±4

52,574

xAI

Proprietary

163

151◄─►172

deepseek-v2.5

1307

±5

24,574

DeepSeek

DeepSeek

164

151◄─►172

athene-70b-0725

1306

±6

19,622

NexusFlow

CC-BY-NC-4.0

165

151◄─►172

magistral-medium-2506

1305

±6

12,077

Mistral

Proprietary

166

158◄─►172

mistral-large-2411

1305

±4

28,081

Mistral

MRL

167

149◄─►176

olmo-3-32b-think

1305

±8

5,916

Allen AI

Apache 2.0

168

158◄─►172

mistral-small-3.1-24b-instruct-2503

1304

±4

34,152

Mistral

Apache 2.0

169

150◄─►179

gemma-3-4b-it

1303

±9

4,177

Google

Gemma

170

159◄─►172

qwen2.5-72b-instruct

1303

±4

39,409

Alibaba

Qwen

171

159◄─►182

Nvidiallama-3.1-nemotron-70b-instruct

1298

±8

7,136

Nvidia

Llama 3.1

172

160◄─►183

Tencenthunyuan-large-vision

1296

±9

5,605

Tencent

Proprietary

173

168◄─►183

Metallama-3.1-70b-instruct

1294

±4

55,234

Meta

Llama 3.1 Community

174

170◄─►185

amazon-nova-pro-v1.0

1290

±5

24,753

Amazon

Proprietary

175

169◄─►187

jamba-1.5-large

1289

±7

8,659

AI21 Labs

Jamba Open

176

171◄─►185

gemma-2-27b-it

1288

±3

75,764

Google

Gemma license

177

168◄─►190

ibm-granite-h-small

1288

±8

5,688

IBM

Apache 2.0

178

170◄─►187

reka-core-20240904

1288

±7

7,309

Reka AI

Proprietary

179

171◄─►187

gpt-4-0314

1288

±5

54,167

OpenAI

Proprietary

180

168◄─►193

Nvidiallama-3.1-nemotron-51b-instruct

1287

±10

3,749

Nvidia

Llama 3.1

181

168◄─►193

llama-3.1-tulu-3-70b

1287

±10

2,846

Ai2

Llama 3.1

182

172◄─►187

gemini-1.5-flash-001

1286

±4

62,823

Google

Proprietary

183

171◄─►193

olmo-3.1-32b-think

1285

±7

8,429

Allen AI

Apache 2.0

184

174◄─►193

Anthropicclaude-3-sonnet-20240229

1282

±4

109,289

Anthropic

Proprietary

185

174◄─►193

gemma-2-9b-it-simpo

1280

±7

10,069

Princeton

MIT

186

176◄─►193

Nvidianemotron-4-340b-instruct

1278

±5

19,661

Nvidia

NVIDIA Open Model

187

176◄─►195

Coherecommand-r-plus-08-2024

1277

±7

9,869

Cohere

CC-BY-NC-4.0

188

180◄─►193

Metallama-3-70b-instruct

1277

±4

156,880

Meta

Llama 3 Community

189

180◄─►194

gpt-4-0613

1276

±4

88,721

OpenAI

Proprietary

190

181◄─►196

mistral-small-24b-instruct-2501

1274

±6

14,677

Mistral

Apache 2.0

191

180◄─►198

glm-4-0520

1274

±7

9,788

Zhipu AI

Proprietary

192

181◄─►199

reka-flash-20240904

1273

±7

7,537

Reka AI

Proprietary

193

181◄─►202

qwen2.5-coder-32b-instruct

1271

±8

5,430

Alibaba

Apache 2.0

194

188◄─►202

Coherec4ai-aya-expanse-32b

1267

±5

27,123

Cohere

CC-BY-NC-4.0

195

190◄─►202

gemma-2-9b-it

1266

±4

54,615

Google

Gemma license

196

189◄─►203

deepseek-coder-v2

1265

±6

15,147

DeepSeek

DeepSeek License

197

191◄─►203

Coherecommand-r-plus

1263

±4

77,556

Cohere

CC-BY-NC-4.0

198

191◄─►203

qwen2-72b-instruct

1263

±5

37,325

Alibaba

Qianwen LICENSE

199

193◄─►203

Anthropicclaude-3-haiku-20240307

1262

±4

117,705

Anthropic

Proprietary

200

192◄─►204

amazon-nova-lite-v1.0

1261

±5

19,376

Amazon

Proprietary

201

193◄─►204

gemini-1.5-flash-8b-001

1259

±4

35,556

Google

Proprietary

202

196◄─►204

Azurephi-4

1256

±5

24,126

Microsoft

MIT

203

193◄─►210

olmo-2-0325-32b-instruct

1252

±11

3,335

Allen AI

Apache-2.0

204

200◄─►209

Coherecommand-r-08-2024

1251

±7

10,141

Cohere

CC-BY-NC-4.0

205

203◄─►213

mistral-large-2402

1243

±5

62,437

Mistral

Proprietary

206

203◄─►213

amazon-nova-micro-v1.0

1241

±5

19,355

Amazon

Proprietary

207

203◄─►217

jamba-1.5-mini

1239

±7

8,854

AI21 Labs

Jamba Open

208

203◄─►221

ministral-8b-2410

1237

±9

4,780

Mistral

MRL

209

204◄─►221

gemini-pro-dev-api

1236

±7

18,352

Google

Proprietary

210

205◄─►221

qwen1.5-110b-chat

1235

±6

26,191

Alibaba

Qianwen LICENSE

211

205◄─►222

reka-flash-21b-20240226-online

1234

±7

15,451

Reka AI

Proprietary

212

205◄─►221

qwen1.5-72b-chat

1234

±5

39,296

Alibaba

Qianwen LICENSE

213

203◄─►223

Tencenthunyuan-standard-256k

1233

±12

2,729

Tencent

Proprietary

214

207◄─►222

mixtral-8x22b-instruct-v0.1

1231

±5

51,417

Mistral

Apache 2.0

215

207◄─►223

Coherecommand-r

1228

±5

54,038

Cohere

CC-BY-NC-4.0

216

207◄─►223

reka-flash-21b-20240226

1227

±6

24,806

Reka AI

Proprietary

217

208◄─►224

gpt-3.5-turbo-0125

1225

±5

66,191

OpenAI

Proprietary

218

208◄─►225

mistral-medium

1224

±5

34,552

Mistral

Proprietary

219

212◄─►224

Metallama-3-8b-instruct

1224

±4

104,636

Meta

Llama 3 Community

220

208◄─►225

Coherec4ai-aya-expanse-8b

1223

±7

9,827

Cohere

CC-BY-NC-4.0

221

207◄─►228

gemini-pro

1223

±12

6,390

Google

Proprietary

222

208◄─►228

llama-3.1-tulu-3-8b

1221

±11

2,895

Ai2

Llama 3.1

223

219◄─►228

01.AIyi-1.5-34b-chat

1214

±5

24,142

01 AI

Apache-2.0

224

214◄─►229

HuggingFacezephyr-orpo-141b-A35b-v0.1

1214

±11

4,653

HuggingFace

Apache 2.0

225

221◄─►228

Metallama-3.1-8b-instruct

1212

±4

49,605

Meta

Llama 3.1 Community

226

217◄─►234

granite-3.1-8b-instruct

1209

±11

3,092

IBM

Apache 2.0

227

221◄─►234

qwen1.5-32b-chat

1205

±6

21,744

Alibaba

Qianwen LICENSE

228

221◄─►236

gpt-3.5-turbo-1106

1204

±9

16,616

OpenAI

Proprietary

229

225◄─►236

Azurephi-3-medium-4k-instruct

1199

±5

25,055

Microsoft

MIT

230

226◄─►236

gemma-2-2b-it

1199

±4

46,618

Google

Gemma license

231

226◄─►236

mixtral-8x7b-instruct-v0.1

1198

±4

73,505

Mistral

Apache 2.0

232

226◄─►241

dbrx-instruct-preview

1196

±6

32,196

Databricks

DBRX LICENSE

233

226◄─►245

InternLMinternlm2_5-20b-chat

1192

±7

9,902

InternLM

Other

234

226◄─►245

qwen1.5-14b-chat

1192

±7

17,841

Alibaba

Qianwen LICENSE

235

228◄─►251

Azurewizardlm-70b

1186

±9

8,214

Microsoft

Llama 2 Community

236

228◄─►252

deepseek-llm-67b-chat

1185

±12

4,933

DeepSeek

DeepSeek License

237

232◄─►248

01.AIyi-34b-chat

1185

±7

15,483

01 AI

Yi License

238

232◄─►251

OpenChatopenchat-3.5-0106

1183

±8

12,636

OpenChat

Apache-2.0

239

232◄─►252

OpenChatopenchat-3.5

1183

±10

7,967

OpenChat

Apache-2.0

240

232◄─►252

granite-3.0-8b-instruct

1183

±9

6,643

IBM

Apache 2.0

241

233◄─►252

gemma-1.1-7b-it

1181

±6

23,893

Google

Gemma license

242

233◄─►252

Snowflakesnowflake-arctic-instruct

1181

±6

32,836

Snowflake

Apache 2.0

243

232◄─►254

granite-3.1-2b-instruct

1180

±11

3,191

IBM

Apache 2.0

244

233◄─►252

tulu-2-dpo-70b

1179

±10

6,534

AllenAI/UW

AI2 ImpACT Low-risk

245

233◄─►256

openhermes-2.5-mistral-7b

1176

±10

5,006

NousResearch

Apache-2.0

246

235◄─►255

vicuna-33b

1174

±6

22,479

LMSYS

Non-commercial

247

235◄─►258

starling-lm-7b-beta

1172

±7

16,057

Nexusflow

Apache-2.0

248

235◄─►256

Azurephi-3-small-8k-instruct

1172

±6

17,763

Microsoft

MIT

249

236◄─►256

Metallama-2-70b-chat

1172

±5

38,491

Meta

Llama 2 Community

250

236◄─►259

starling-lm-7b-alpha

1168

±8

10,224

UC Berkeley

CC-BY-NC-4.0

251

238◄─►259

Metallama-3.2-3b-instruct

1167

±8

7,936

Meta

Llama 3.2

252

236◄─►262

nous-hermes-2-mixtral-8x7b-dpo

1166

±12

3,776

NousResearch

Apache-2.0

253

244◄─►269

qwq-32b-preview

1157

±12

3,233

Alibaba

Apache 2.0

254

249◄─►266

granite-3.0-2b-instruct

1157

±8

6,837

IBM

Apache 2.0

255

244◄─►269

Nvidiallama2-70b-steerlm-chat

1156

±13

3,584

Nvidia

Llama 2 Community

256

246◄─►272

solar-10.7b-instruct-v1.0

1153

±13

4,155

Upstage AI

CC-BY-NC-4.0

257

245◄─►274

dolphin-2.2.1-mistral-7b

1153

±15

1,679

Cognitive Computations

Apache-2.0

258

250◄─►272

mpt-30b-chat

1151

±12

2,571

MosaicML

CC-BY-NC-SA-4.0

259

252◄─►269

mistral-7b-instruct-v0.2

1151

±7

19,402

Mistral

Apache-2.0

260

252◄─►271

Azurewizardlm-13b

1150

±9

7,046

Microsoft

Llama 2 Community

261

249◄─►276

falcon-180b-chat

1148

±17

1,295

TII

Falcon-180B TII License

262

252◄─►275

qwen1.5-7b-chat

1145

±10

4,735

Alibaba

Qianwen LICENSE

263

253◄─►274

Azurephi-3-mini-4k-instruct-june-2024

1144

±6

12,296

Microsoft

MIT

264

253◄─►275

Metallama-2-13b-chat

1142

±7

19,171

Meta

Llama 2 Community

265

253◄─►275

vicuna-13b

1142

±7

19,366

LMSYS

Llama 2 Community

266

253◄─►277

qwen-14b-chat

1140

±11

4,964

Alibaba

Qianwen LICENSE

267

254◄─►277

palm-2

1138

±9

8,554

Google

Proprietary

268

254◄─►277

Metacodellama-34b-instruct

1138

±9

7,363

Meta

Llama 2 Community

269

254◄─►277

gemma-7b-it

1137

±10

8,925

Google

Gemma license

270

257◄─►278

HuggingFacezephyr-7b-beta

1132

±9

11,116

HuggingFace

MIT

271

260◄─►278

Azurephi-3-mini-128k-instruct

1130

±7

20,691

Microsoft

MIT

272

262◄─►278

Azurephi-3-mini-4k-instruct

1129

±6

20,115

Microsoft

MIT

273

258◄─►281

guanaco-33b

1128

±12

2,921

UW

Non-commercial

274

257◄─►282

HuggingFacezephyr-7b-alpha

1128

±16

1,785

HuggingFace

MIT

275

265◄─►282

stripedhyena-nous-7b

1122

±11

5,184

Together AI

Apache 2.0

276

260◄─►283

Metacodellama-70b-instruct

1120

±18

1,143

Meta

Llama 2 Community

277

270◄─►282

vicuna-7b

1116

±9

6,923

LMSYS

Llama 2 Community

278

266◄─►283

HuggingFacesmollm2-1.7b-instruct

1115

±14

2,201

HuggingFace

Apache 2.0

279

273◄─►282

gemma-1.1-2b-it

1115

±8

10,853

Google

Gemma license

280

273◄─►282

Metallama-3.2-1b-instruct

1112

±8

8,045

Meta

Llama 3.2

281

273◄─►283

mistral-7b-instruct

1111

±9

8,977

Mistral

Apache 2.0

282

274◄─►283

Metallama-2-7b-chat

1109

±7

14,148

Meta

Llama 2 Community

283

279◄─►287

gemma-2b-it

1092

±12

4,779

Google

Gemma license

284

283◄─►286

qwen1.5-4b-chat

1091

±9

7,598

Alibaba

Qianwen LICENSE

285

283◄─►290

olmo-7b-instruct

1075

±11

6,329

Allen AI

Apache-2.0

286

284◄─►290

koala-13b

1071

±10

6,964

UC Berkeley

Non-commercial

287

285◄─►290

alpaca-13b

1069

±12

5,745

Stanford

Non-commercial

288

283◄─►291

gpt4all-13b-snoozy

1067

±15

1,743

Nomic AI

Non-commercial

289

285◄─►291

mpt-7b-chat

1063

±12

3,925

MosaicML

CC-BY-NC-SA-4.0

290

285◄─►291

chatglm3-6b

1057

±12

4,658

Tsinghua

Apache-2.0

291

288◄─►293

RWKVRWKV-4-Raven-14B

1042

±11

4,845

RWKV

Apache 2.0

292

291◄─►293

chatglm2-6b

1025

±14

2,657

Tsinghua

Apache-2.0

293

291◄─►293

oasst-pythia-12b

1023

±11

6,311

OpenAssistant

Apache 2.0

294

294◄─►297

chatglm-6b

997

±13

4,914

Tsinghua

Non-commercial

295

294◄─►297

fastchat-t5-3b

992

±12

4,203

LMSYS

Apache 2.0

296

294◄─►297

dolly-v2-12b

981

±14

3,412

Databricks

MIT

297

294◄─►298

Metallama-13b

973

±16

2,391

Meta

Non-commercial

298

297◄─►298

Stabilitystablelm-tuned-alpha-7b

954

±13

3,287

Stability AI

CC-BY-NC-SA-4.0

说明

  • 排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。

  • 模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。

  • 分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。

  • 95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:±6)。这个区间越小,表示模型的评分越稳定和可靠。

  • 票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。

  • 组织/公司:提供该模型的组织或公司。

  • 许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。

数据来源与更新频率

本排行榜数据由自动化脚本直接从 1 2arrow-up-right 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。

免责声明

本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。

最后更新于

这有帮助吗?