模型榜单

这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。

数据更新时间: 2026-01-30 08:12:42 UTC / 2026-01-30 16:12:42 CST (北京时间)

排行榜

Rank
Rank Spread
模型
分数
95% 置信区间 (±)
票数
组织/公司
许可证

1

1◄─►1

gemini-3-pro

1487

±5

32,205

Google

Proprietary

2

2◄─►5

grok-4.1-thinking

1475

±5

32,194

xAI

Proprietary

3

2◄─►7

gemini-3-flash

1471

±5

22,683

Google

Proprietary

4

2◄─►8

Anthropicclaude-opus-4-5-20251101-thinking-32k

1468

±5

23,990

Anthropic

Proprietary

5

2◄─►8

Anthropicclaude-opus-4-5-20251101

1466

±5

28,752

Anthropic

Proprietary

6

3◄─►8

grok-4.1

1466

±4

36,363

xAI

Proprietary

7

3◄─►10

gemini-3-flash (thinking-minimal)

1463

±6

13,989

Google

Proprietary

8

4◄─►12

gpt-5.1-high

1459

±5

28,296

OpenAI

Proprietary

9

7◄─►21

ernie-5.0-0110

1453Preliminary

±7

8,274

Baidu

Proprietary

10

8◄─►19

Anthropicclaude-sonnet-4-5-20250929-thinking-32k

1450

±4

42,538

Anthropic

Proprietary

11

9◄─►21

Anthropicclaude-sonnet-4-5-20250929

1450

±4

40,121

Anthropic

Proprietary

12

7◄─►21

MoonshotAIkimi-k2.5-thinking

1450

±9

4,901

Moonshot

Modified MIT

13

9◄─►19

gemini-2.5-pro

1450

±3

91,734

Google

Proprietary

14

8◄─►21

ernie-5.0-preview-1203

1449

±7

9,786

Baidu

Proprietary

15

9◄─►21

Anthropicclaude-opus-4-1-20250805-thinking-16k

1448

±4

49,975

Anthropic

Proprietary

16

9◄─►21

Anthropicclaude-opus-4-1-20250805

1445

±3

71,681

Anthropic

Proprietary

17

9◄─►23

gpt-4.5-preview-2025-02-27

1444

±6

14,549

OpenAI

Proprietary

18

11◄─►22

chatgpt-4o-latest-20250326

1443

±3

78,991

OpenAI

Proprietary

19

9◄─►26

glm-4.7

1440

±6

12,027

Z.ai

MIT

20

9◄─►27

gpt-5.2

1440

±7

9,387

OpenAI

Proprietary

21

11◄─►27

gpt-5.2-high

1440

±7

12,869

OpenAI

Proprietary

22

17◄─►28

gpt-5.1

1435

±5

30,355

OpenAI

Proprietary

23

18◄─►29

gpt-5-high

1435

±5

32,651

OpenAI

Proprietary

24

19◄─►32

qwen3-max-preview

1434

±5

27,857

Alibaba

Proprietary

25

19◄─►32

o3-2025-04-16

1433

±4

61,365

OpenAI

Proprietary

26

19◄─►37

grok-4-1-fast-reasoning

1430

±5

25,264

xAI

Proprietary

27

20◄─►39

MoonshotAIkimi-k2-thinking-turbo

1429

±4

30,011

Moonshot

Modified MIT

28

23◄─►45

gpt-5-chat

1426

±4

31,849

OpenAI

Proprietary

29

22◄─►47

qwen3-max-2025-09-23

1425

±6

9,223

Alibaba

Proprietary

30

26◄─►46

glm-4.6

1425

±4

35,373

Z.ai

MIT

31

26◄─►46

Anthropicclaude-opus-4-20250514-thinking-16k

1424

±4

37,990

Anthropic

Proprietary

32

24◄─►49

deepseek-v3.2-exp

1423

±6

11,774

DeepSeek

MIT

33

24◄─►49

deepseek-v3.2-exp-thinking

1423

±7

9,000

DeepSeek

MIT

34

27◄─►47

qwen3-235b-a22b-instruct-2507

1422

±3

66,045

Alibaba

Apache 2.0

35

24◄─►51

grok-4-fast-chat

1422

±8

6,992

xAI

Proprietary

36

27◄─►51

deepseek-v3.2-thinking

1420

±5

19,713

DeepSeek

MIT

37

28◄─►51

deepseek-v3.2

1420

±5

24,571

DeepSeek

MIT

38

28◄─►53

deepseek-r1-0528

1418

±6

19,289

DeepSeek

MIT

39

28◄─►54

MoonshotAIkimi-k2-0905-preview

1418

±6

11,977

Moonshot

Modified MIT

40

26◄─►56

ernie-5.0-preview-1022

1418

±9

4,630

Baidu

Proprietary

41

28◄─►54

MoonshotAIkimi-k2-0711-preview

1417

±5

28,666

Moonshot

Modified MIT

42

28◄─►54

deepseek-v3.1

1417

±6

15,304

DeepSeek

MIT

43

28◄─►54

deepseek-v3.1-thinking

1417

±7

11,986

DeepSeek

MIT

44

26◄─►58

deepseek-v3.1-terminus

1416

±10

3,762

DeepSeek

MIT

45

26◄─►59

deepseek-v3.1-terminus-thinking

1416

±10

3,547

DeepSeek

MIT

46

29◄─►56

qwen3-vl-235b-a22b-instruct

1415

±6

11,684

Alibaba

Apache 2.0

47

31◄─►54

mistral-large-3

1414

±5

20,796

Mistral

Apache 2.0

48

33◄─►54

Anthropicclaude-opus-4-20250514

1414

±4

45,572

Anthropic

Proprietary

49

33◄─►54

gpt-4.1-2025-04-14

1413

±4

52,220

OpenAI

Proprietary

50

35◄─►56

mistral-medium-2508

1412

±3

59,880

Mistral

Proprietary

51

35◄─►58

grok-3-preview-02-24

1411

±4

33,980

xAI

Proprietary

52

38◄─►59

grok-4-0709

1410

±4

42,174

xAI

Proprietary

53

38◄─►62

glm-4.5

1410

±5

24,803

Z.ai

MIT

54

39◄─►58

gemini-2.5-flash

1409

±3

90,980

Google

Proprietary

55

46◄─►67

gemini-2.5-flash-preview-09-2025

1405

±4

32,903

Google

Proprietary

56

46◄─►68

grok-4-fast-reasoning

1404

±5

18,646

xAI

Proprietary

57

49◄─►68

Anthropicclaude-haiku-4-5-20251001

1403

±4

41,166

Anthropic

Proprietary

58

49◄─►68

o1-2024-12-17

1402

±4

27,822

OpenAI

Proprietary

59

54◄─►70

qwen3-235b-a22b-no-thinking

1401

±4

39,554

Alibaba

Apache 2.0

60

52◄─►70

qwen3-next-80b-a3b-instruct

1401

±5

22,874

Alibaba

Apache 2.0

61

55◄─►70

Anthropicclaude-sonnet-4-20250514-thinking-32k

1400

±4

36,212

Anthropic

Proprietary

62

54◄─►76

longcat-flash-chat

1399

±6

11,554

Meituan

MIT

63

55◄─►76

qwen3-235b-a22b-thinking-2507

1398

±6

9,248

Alibaba

Apache 2.0

64

55◄─►76

deepseek-r1

1398

±5

18,537

DeepSeek

MIT

65

55◄─►83

amazon-nova-experimental-chat-12-10

1395

±10

3,722

Amazon

Proprietary

66

55◄─►82

qwen3-vl-235b-a22b-thinking

1395

±7

7,974

Alibaba

Apache 2.0

67

56◄─►80

mimo-v2-flash (non-thinking)

1395

±6

13,410

Xiaomi

MIT

68

59◄─►77

deepseek-v3-0324

1394

±4

46,699

DeepSeek

MIT

69

54◄─►84

Tencenthunyuan-vision-1.5-thinking

1393

±12

2,222

Tencent

Proprietary

70

59◄─►83

mai-1-preview

1391

±5

18,137

Microsoft AI

Proprietary

71

62◄─►82

o4-mini-2025-04-16

1391

±4

46,681

OpenAI

Proprietary

72

62◄─►83

gpt-5-mini-high

1390

±5

27,174

OpenAI

Proprietary

73

62◄─►83

Anthropicclaude-sonnet-4-20250514

1390

±4

41,643

Anthropic

Proprietary

74

62◄─►83

Anthropicclaude-3-7-sonnet-20250219-thinking-32k

1389

±4

39,890

Anthropic

Proprietary

75

62◄─►83

o1-preview

1388

±5

31,120

OpenAI

Proprietary

76

62◄─►86

Tencenthunyuan-t1-20250711

1387

±9

4,801

Tencent

Proprietary

77

65◄─►84

qwen3-coder-480b-a35b-instruct

1387

±5

26,648

Alibaba

Apache 2.0

78

66◄─►84

mistral-medium-2505

1384

±5

34,547

Mistral

Proprietary

79

67◄─►86

qwen3-30b-a3b-instruct-2507

1383

±5

24,093

Alibaba

Apache 2.0

80

67◄─►87

Minimaxminimax-m2.1-preview

1383

±6

12,907

MiniMax

MIT

81

66◄─►89

Tencenthunyuan-turbos-20250416

1382

±6

11,050

Tencent

Proprietary

82

69◄─►87

gpt-4.1-mini-2025-04-14

1382

±4

40,533

OpenAI

Proprietary

83

75◄─►89

gemini-2.5-flash-lite-preview-09-2025-no-thinking

1379

±4

44,231

Google

Proprietary

84

66◄─►97

glm-4.6v

1378

±11

2,822

Z.ai

MIT

85

78◄─►94

gemini-2.5-flash-lite-preview-06-17-thinking

1375

±4

33,931

Google

Proprietary

86

78◄─►94

qwen3-235b-a22b

1374

±5

27,156

Alibaba

Apache 2.0

87

80◄─►94

qwen2.5-max

1374

±4

33,288

Alibaba

Proprietary

88

82◄─►94

Anthropicclaude-3-5-sonnet-20241022

1373

±3

89,457

Anthropic

Proprietary

89

82◄─►96

Anthropicclaude-3-7-sonnet-20250219

1372

±4

44,434

Anthropic

Proprietary

90

84◄─►97

glm-4.5-air

1371

±4

31,396

Z.ai

MIT

91

84◄─►100

qwen3-next-80b-a3b-thinking

1369

±6

13,871

Alibaba

Apache 2.0

92

84◄─►100

Minimaxminimax-m1

1367

±4

36,843

MiniMax

Apache 2.0

93

84◄─►103

amazon-nova-experimental-chat-11-10

1367

±6

13,391

Amazon

Proprietary

94

88◄─►100

gemma-3-27b-it

1365

±4

48,686

Google

Gemma

95

88◄─►104

o3-mini-high

1364

±5

18,584

OpenAI

Proprietary

96

89◄─►108

grok-3-mini-high

1362

±5

17,525

xAI

Proprietary

97

84◄─►116

glm-4.7-flash

1362

±9

4,090

Z.ai

MIT

98

91◄─►108

gemini-2.0-flash-001

1361

±4

44,815

Google

Proprietary

99

91◄─►114

deepseek-v3

1359

±5

21,788

DeepSeek

DeepSeek

100

94◄─►119

grok-3-mini-beta

1357

±5

23,763

xAI

Proprietary

101

94◄─►120

mistral-small-2506

1356

±5

18,340

Mistral

Apache 2.0

102

91◄─►122

intellect-3

1356

±8

5,325

Prime Intellect

MIT

103

96◄─►121

gpt-oss-120b

1354

±4

30,977

OpenAI

Apache 2.0

104

96◄─►122

gemini-2.0-flash-lite-preview-02-05

1353

±4

24,951

Google

Proprietary

105

94◄─►124

glm-4.5v

1353

±8

4,977

Z.ai

MIT

106

98◄─►122

Coherecommand-a-03-2025

1353

±3

57,447

Cohere

CC-BY-NC-4.0

107

98◄─►122

gemini-1.5-pro-002

1352

±3

55,607

Google

Proprietary

108

95◄─►134

Tencenthunyuan-turbos-20250226

1349

±12

2,226

Tencent

Proprietary

109

100◄─►124

o3-mini

1348

±3

58,645

OpenAI

Proprietary

110

98◄─►126

amazon-nova-experimental-chat-10-20

1348

±6

11,439

Amazon

Proprietary

111

96◄─►135

Nvidiallama-3.1-nemotron-ultra-253b-v1

1347

±12

2,546

Nvidia

Nvidia Open Model

112

96◄─►134

amazon-nova-experimental-chat-10-09

1347

±11

2,892

Amazon

Proprietary

113

98◄─►134

qwen3-32b

1347

±9

3,932

Alibaba

Apache 2.0

114

98◄─►132

Minimaxminimax-m2

1347

±8

6,749

MiniMax

Apache 2.0

115

99◄─►131

ling-flash-2.0

1346

±7

7,044

Ant Group

MIT

116

99◄─►132

Stepfunstep-3

1346

±7

6,595

StepFun

Apache 2.0

117

98◄─►133

qwen-plus-0125

1346

±8

5,823

Alibaba

Proprietary

118

103◄─►124

gpt-4o-2024-05-13

1346

±3

112,863

OpenAI

Proprietary

119

100◄─►135

glm-4-plus-0111

1343

±8

5,760

Zhipu

Proprietary

120

107◄─►129

Anthropicclaude-3-5-sonnet-20240620

1343

±3

82,417

Anthropic

Proprietary

121

101◄─►138

gemma-3-12b-it

1342

±10

3,829

Google

Gemma

122

102◄─►140

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5

1341

±10

3,430

Nvidia

Nvidia Open

123

100◄─►141

Tencenthunyuan-turbo-0110

1340

±12

2,295

Tencent

Proprietary

124

107◄─►139

gpt-5-nano-high

1338

±7

8,402

OpenAI

Proprietary

125

111◄─►136

o1-mini

1337

±4

51,986

OpenAI

Proprietary

126

110◄─►140

nova-2-lite

1336

±6

12,274

Amazon

Proprietary

127

112◄─►137

Metallama-3.1-405b-instruct-bf16

1336

±4

41,392

Meta

Llama 3.1 Community

128

111◄─►139

qwq-32b

1336

±4

26,123

Alibaba

Apache 2.0

129

112◄─►139

gpt-4o-2024-08-06

1335

±4

45,498

OpenAI

Proprietary

130

111◄─►140

gemini-advanced-0514

1335

±5

50,142

Google

Proprietary

131

114◄─►139

grok-2-2024-08-13

1335

±4

63,495

xAI

Proprietary

132

116◄─►140

Metallama-3.1-405b-instruct-fp8

1334

±4

59,655

Meta

Llama 3.1 Community

133

110◄─►148

Stepfunstep-2-16k-exp-202412

1334

±9

4,829

StepFun

Proprietary

134

121◄─►153

01.AIyi-lightning

1329

±5

27,340

01 AI

Proprietary

135

122◄─►154

qwen3-30b-a3b

1328

±5

27,428

Alibaba

Apache 2.0

136

123◄─►153

Metallama-4-maverick-17b-128e-instruct

1328

±4

41,136

Meta

Llama 4

137

113◄─►162

Nvidiallama-3.3-nemotron-49b-super-v1

1327

±12

2,230

Nvidia

Nvidia

138

118◄─►162

Tencenthunyuan-large-2025-02-10

1327

±10

3,738

Tencent

Proprietary

139

133◄─►158

gpt-4-turbo-2024-04-09

1325

±4

98,130

OpenAI

Proprietary

140

133◄─►158

Anthropicclaude-3-5-haiku-20241022

1324

±3

71,179

Anthropic

Proprietary

141

128◄─►162

olmo-3.1-32b-instruct

1324

±7

8,310

Allen AI

Apache 2.0

142

124◄─►162

deepseek-v2.5-1210

1323

±8

6,793

DeepSeek

DeepSeek

143

133◄─►158

gemini-1.5-pro-001

1323

±4

79,132

Google

Proprietary

144

133◄─►160

Metallama-4-scout-17b-16e-instruct

1323

±5

31,231

Meta

Llama

145

133◄─►158

Anthropicclaude-3-opus-20240229

1323

±3

194,904

Anthropic

Proprietary

146

132◄─►163

gpt-4.1-nano-2025-04-14

1322

±8

6,107

OpenAI

Proprietary

147

133◄─►162

Stepfunstep-1o-turbo-202506

1321

±7

9,687

StepFun

Proprietary

148

133◄─►164

ring-flash-2.0

1320

±7

7,199

Ant Group

MIT

149

136◄─►162

Metallama-3.3-70b-instruct

1320

±3

55,561

Meta

Llama-3.3

150

134◄─►162

glm-4-plus

1319

±5

26,134

Zhipu AI

Proprietary

151

134◄─►163

gemma-3n-e4b-it

1319

±5

23,347

Google

Gemma

152

134◄─►164

qwen-max-0919

1318

±6

16,479

Alibaba

Qwen

153

137◄─►162

gpt-4o-mini-2024-07-18

1318

±3

68,801

OpenAI

Proprietary

154

134◄─►169

gpt-oss-20b

1318

±6

10,807

OpenAI

Apache 2.0

155

137◄─►169

Nvidianvidia-nemotron-3-nano-30b-a3b-bf16

1317

±6

15,510

Nvidia

NVIDIA Open Model

156

137◄─►170

qwen2.5-plus-1127

1315

±6

10,179

Alibaba

Proprietary

157

141◄─►169

athene-v2-chat

1314

±4

24,746

NexusFlow

NexusFlow

158

141◄─►169

mistral-large-2407

1314

±4

45,460

Mistral

Mistral Research

159

142◄─►169

gpt-4-1106-preview

1314

±4

100,107

OpenAI

Proprietary

160

142◄─►169

gpt-4-0125-preview

1314

±4

93,439

OpenAI

Proprietary

161

137◄─►174

Tencenthunyuan-standard-2025-02-10

1312

±10

3,905

Tencent

Proprietary

162

134◄─►178

mercury

1310

±14

1,921

Inception AI

Proprietary

163

149◄─►173

gemini-1.5-flash-002

1310

±4

34,909

Google

Proprietary

164

154◄─►174

grok-2-mini-2024-08-13

1308

±4

52,574

xAI

Proprietary

165

154◄─►174

deepseek-v2.5

1307

±5

24,574

DeepSeek

DeepSeek

166

154◄─►174

athene-70b-0725

1306

±6

19,622

NexusFlow

CC-BY-NC-4.0

167

154◄─►174

magistral-medium-2506

1305

±6

12,061

Mistral

Proprietary

168

160◄─►174

mistral-large-2411

1305

±4

28,081

Mistral

MRL

169

152◄─►177

olmo-3-32b-think

1305

±8

5,891

Allen AI

Apache 2.0

170

161◄─►174

mistral-small-3.1-24b-instruct-2503

1305

±4

34,132

Mistral

Apache 2.0

171

154◄─►181

gemma-3-4b-it

1303

±9

4,177

Google

Gemma

172

161◄─►174

qwen2.5-72b-instruct

1303

±4

39,409

Alibaba

Qwen

173

161◄─►184

Nvidiallama-3.1-nemotron-70b-instruct

1299

±8

7,136

Nvidia

Llama 3.1

174

162◄─►185

Tencenthunyuan-large-vision

1296

±9

5,601

Tencent

Proprietary

175

170◄─►185

Metallama-3.1-70b-instruct

1294

±4

55,234

Meta

Llama 3.1 Community

176

172◄─►186

amazon-nova-pro-v1.0

1290

±5

24,753

Amazon

Proprietary

177

172◄─►189

jamba-1.5-large

1289

±7

8,659

AI21 Labs

Jamba Open

178

171◄─►192

ibm-granite-h-small

1288

±8

5,675

IBM

Apache 2.0

179

173◄─►187

gemma-2-27b-it

1288

±3

75,764

Google

Gemma license

180

172◄─►189

reka-core-20240904

1288

±7

7,309

Reka AI

Proprietary

181

173◄─►189

gpt-4-0314

1288

±5

54,167

OpenAI

Proprietary

182

170◄─►195

Nvidiallama-3.1-nemotron-51b-instruct

1287

±10

3,749

Nvidia

Llama 3.1

183

170◄─►195

llama-3.1-tulu-3-70b

1287

±10

2,846

Ai2

Llama 3.1

184

174◄─►189

gemini-1.5-flash-001

1286

±4

62,823

Google

Proprietary

185

173◄─►195

olmo-3.1-32b-think

1285

±7

8,498

Allen AI

Apache 2.0

186

177◄─►195

Anthropicclaude-3-sonnet-20240229

1282

±4

109,289

Anthropic

Proprietary

187

176◄─►195

gemma-2-9b-it-simpo

1280

±7

10,069

Princeton

MIT

188

178◄─►195

Nvidianemotron-4-340b-instruct

1278

±5

19,661

Nvidia

NVIDIA Open Model

189

178◄─►197

Coherecommand-r-plus-08-2024

1277

±7

9,869

Cohere

CC-BY-NC-4.0

190

182◄─►195

Metallama-3-70b-instruct

1277

±4

156,880

Meta

Llama 3 Community

191

182◄─►196

gpt-4-0613

1276

±4

88,721

OpenAI

Proprietary

192

183◄─►198

mistral-small-24b-instruct-2501

1274

±6

14,677

Mistral

Apache 2.0

193

182◄─►200

glm-4-0520

1274

±7

9,788

Zhipu AI

Proprietary

194

183◄─►201

reka-flash-20240904

1273

±7

7,537

Reka AI

Proprietary

195

183◄─►204

qwen2.5-coder-32b-instruct

1271

±8

5,430

Alibaba

Apache 2.0

196

190◄─►204

Coherec4ai-aya-expanse-32b

1267

±5

27,123

Cohere

CC-BY-NC-4.0

197

192◄─►204

gemma-2-9b-it

1266

±4

54,615

Google

Gemma license

198

191◄─►205

deepseek-coder-v2

1265

±6

15,147

DeepSeek

DeepSeek License

199

193◄─►205

Coherecommand-r-plus

1263

±4

77,556

Cohere

CC-BY-NC-4.0

200

193◄─►205

qwen2-72b-instruct

1263

±5

37,325

Alibaba

Qianwen LICENSE

201

195◄─►205

Anthropicclaude-3-haiku-20240307

1262

±4

117,705

Anthropic

Proprietary

202

194◄─►206

amazon-nova-lite-v1.0

1261

±5

19,376

Amazon

Proprietary

203

195◄─►206

gemini-1.5-flash-8b-001

1259

±4

35,556

Google

Proprietary

204

198◄─►206

Azurephi-4

1256

±5

24,126

Microsoft

MIT

205

195◄─►212

olmo-2-0325-32b-instruct

1253

±11

3,335

Allen AI

Apache-2.0

206

202◄─►211

Coherecommand-r-08-2024

1251

±7

10,141

Cohere

CC-BY-NC-4.0

207

205◄─►215

mistral-large-2402

1243

±5

62,437

Mistral

Proprietary

208

205◄─►215

amazon-nova-micro-v1.0

1241

±5

19,355

Amazon

Proprietary

209

205◄─►219

jamba-1.5-mini

1239

±7

8,854

AI21 Labs

Jamba Open

210

205◄─►223

ministral-8b-2410

1237

±9

4,780

Mistral

MRL

211

206◄─►223

gemini-pro-dev-api

1236

±7

18,352

Google

Proprietary

212

207◄─►223

qwen1.5-110b-chat

1235

±6

26,191

Alibaba

Qianwen LICENSE

213

207◄─►224

reka-flash-21b-20240226-online

1234

±7

15,451

Reka AI

Proprietary

214

207◄─►223

qwen1.5-72b-chat

1234

±5

39,296

Alibaba

Qianwen LICENSE

215

205◄─►225

Tencenthunyuan-standard-256k

1234

±12

2,729

Tencent

Proprietary

216

209◄─►224

mixtral-8x22b-instruct-v0.1

1230

±5

51,417

Mistral

Apache 2.0

217

209◄─►225

Coherecommand-r

1228

±5

54,038

Cohere

CC-BY-NC-4.0

218

209◄─►225

reka-flash-21b-20240226

1227

±6

24,806

Reka AI

Proprietary

219

210◄─►226

gpt-3.5-turbo-0125

1225

±5

66,191

OpenAI

Proprietary

220

210◄─►227

mistral-medium

1224

±5

34,552

Mistral

Proprietary

221

214◄─►226

Metallama-3-8b-instruct

1224

±4

104,636

Meta

Llama 3 Community

222

210◄─►227

Coherec4ai-aya-expanse-8b

1223

±7

9,827

Cohere

CC-BY-NC-4.0

223

209◄─►230

gemini-pro

1223

±12

6,390

Google

Proprietary

224

210◄─►230

llama-3.1-tulu-3-8b

1221

±11

2,895

Ai2

Llama 3.1

225

221◄─►230

01.AIyi-1.5-34b-chat

1214

±5

24,142

01 AI

Apache-2.0

226

216◄─►231

HuggingFacezephyr-orpo-141b-A35b-v0.1

1214

±11

4,653

HuggingFace

Apache 2.0

227

223◄─►230

Metallama-3.1-8b-instruct

1212

±4

49,605

Meta

Llama 3.1 Community

228

219◄─►236

granite-3.1-8b-instruct

1209

±11

3,092

IBM

Apache 2.0

229

223◄─►236

qwen1.5-32b-chat

1205

±6

21,744

Alibaba

Qianwen LICENSE

230

223◄─►238

gpt-3.5-turbo-1106

1204

±9

16,616

OpenAI

Proprietary

231

227◄─►238

Azurephi-3-medium-4k-instruct

1199

±5

25,055

Microsoft

MIT

232

228◄─►238

gemma-2-2b-it

1199

±4

46,618

Google

Gemma license

233

228◄─►238

mixtral-8x7b-instruct-v0.1

1198

±4

73,505

Mistral

Apache 2.0

234

228◄─►243

dbrx-instruct-preview

1196

±6

32,196

Databricks

DBRX LICENSE

235

228◄─►247

InternLMinternlm2_5-20b-chat

1192

±7

9,902

InternLM

Other

236

228◄─►247

qwen1.5-14b-chat

1192

±7

17,841

Alibaba

Qianwen LICENSE

237

230◄─►253

Azurewizardlm-70b

1186

±9

8,214

Microsoft

Llama 2 Community

238

230◄─►254

deepseek-llm-67b-chat

1185

±12

4,933

DeepSeek

DeepSeek License

239

234◄─►250

01.AIyi-34b-chat

1185

±7

15,483

01 AI

Yi License

240

234◄─►253

OpenChatopenchat-3.5-0106

1183

±8

12,636

OpenChat

Apache-2.0

241

234◄─►254

OpenChatopenchat-3.5

1183

±10

7,967

OpenChat

Apache-2.0

242

234◄─►254

granite-3.0-8b-instruct

1183

±9

6,643

IBM

Apache 2.0

243

235◄─►254

gemma-1.1-7b-it

1181

±6

23,893

Google

Gemma license

244

235◄─►254

Snowflakesnowflake-arctic-instruct

1180

±6

32,836

Snowflake

Apache 2.0

245

234◄─►256

granite-3.1-2b-instruct

1180

±11

3,191

IBM

Apache 2.0

246

235◄─►254

tulu-2-dpo-70b

1179

±10

6,534

AllenAI/UW

AI2 ImpACT Low-risk

247

235◄─►258

openhermes-2.5-mistral-7b

1176

±10

5,006

NousResearch

Apache-2.0

248

237◄─►257

vicuna-33b

1174

±6

22,479

LMSYS

Non-commercial

249

237◄─►260

starling-lm-7b-beta

1172

±7

16,057

Nexusflow

Apache-2.0

250

237◄─►258

Azurephi-3-small-8k-instruct

1172

±6

17,763

Microsoft

MIT

251

238◄─►258

Metallama-2-70b-chat

1172

±5

38,491

Meta

Llama 2 Community

252

238◄─►261

starling-lm-7b-alpha

1168

±8

10,224

UC Berkeley

CC-BY-NC-4.0

253

240◄─►261

Metallama-3.2-3b-instruct

1167

±8

7,936

Meta

Llama 3.2

254

238◄─►264

nous-hermes-2-mixtral-8x7b-dpo

1166

±12

3,776

NousResearch

Apache-2.0

255

246◄─►270

qwq-32b-preview

1158

±12

3,233

Alibaba

Apache 2.0

256

251◄─►268

granite-3.0-2b-instruct

1157

±8

6,837

IBM

Apache 2.0

257

246◄─►271

Nvidiallama2-70b-steerlm-chat

1156

±13

3,584

Nvidia

Llama 2 Community

258

248◄─►274

solar-10.7b-instruct-v1.0

1153

±13

4,155

Upstage AI

CC-BY-NC-4.0

259

247◄─►276

dolphin-2.2.1-mistral-7b

1153

±15

1,679

Cognitive Computations

Apache-2.0

260

252◄─►274

mpt-30b-chat

1151

±12

2,571

MosaicML

CC-BY-NC-SA-4.0

261

254◄─►271

mistral-7b-instruct-v0.2

1151

±7

19,402

Mistral

Apache-2.0

262

254◄─►273

Azurewizardlm-13b

1150

±9

7,046

Microsoft

Llama 2 Community

263

251◄─►278

falcon-180b-chat

1148

±17

1,295

TII

Falcon-180B TII License

264

254◄─►277

qwen1.5-7b-chat

1145

±10

4,735

Alibaba

Qianwen LICENSE

265

255◄─►276

Azurephi-3-mini-4k-instruct-june-2024

1144

±6

12,296

Microsoft

MIT

266

255◄─►277

Metallama-2-13b-chat

1142

±7

19,171

Meta

Llama 2 Community

267

255◄─►277

vicuna-13b

1142

±7

19,366

LMSYS

Llama 2 Community

268

255◄─►279

qwen-14b-chat

1140

±11

4,964

Alibaba

Qianwen LICENSE

269

256◄─►279

palm-2

1138

±9

8,554

Google

Proprietary

270

256◄─►279

Metacodellama-34b-instruct

1138

±9

7,363

Meta

Llama 2 Community

271

257◄─►279

gemma-7b-it

1137

±10

8,925

Google

Gemma license

272

259◄─►280

HuggingFacezephyr-7b-beta

1132

±9

11,116

HuggingFace

MIT

273

262◄─►280

Azurephi-3-mini-128k-instruct

1130

±7

20,691

Microsoft

MIT

274

264◄─►280

Azurephi-3-mini-4k-instruct

1129

±6

20,115

Microsoft

MIT

275

260◄─►283

guanaco-33b

1128

±12

2,921

UW

Non-commercial

276

259◄─►284

HuggingFacezephyr-7b-alpha

1128

±16

1,785

HuggingFace

MIT

277

267◄─►284

stripedhyena-nous-7b

1122

±11

5,184

Together AI

Apache 2.0

278

262◄─►285

Metacodellama-70b-instruct

1120

±18

1,143

Meta

Llama 2 Community

279

272◄─►284

vicuna-7b

1116

±9

6,923

LMSYS

Llama 2 Community

280

268◄─►285

HuggingFacesmollm2-1.7b-instruct

1115

±14

2,201

HuggingFace

Apache 2.0

281

275◄─►284

gemma-1.1-2b-it

1115

±8

10,853

Google

Gemma license

282

275◄─►284

Metallama-3.2-1b-instruct

1112

±8

8,045

Meta

Llama 3.2

283

275◄─►285

mistral-7b-instruct

1111

±9

8,977

Mistral

Apache 2.0

284

276◄─►285

Metallama-2-7b-chat

1109

±7

14,148

Meta

Llama 2 Community

285

281◄─►289

gemma-2b-it

1092

±12

4,779

Google

Gemma license

286

285◄─►288

qwen1.5-4b-chat

1091

±9

7,598

Alibaba

Qianwen LICENSE

287

285◄─►292

olmo-7b-instruct

1075

±11

6,329

Allen AI

Apache-2.0

288

286◄─►292

koala-13b

1071

±10

6,964

UC Berkeley

Non-commercial

289

287◄─►292

alpaca-13b

1068

±12

5,745

Stanford

Non-commercial

290

285◄─►293

gpt4all-13b-snoozy

1067

±15

1,743

Nomic AI

Non-commercial

291

287◄─►293

mpt-7b-chat

1063

±12

3,925

MosaicML

CC-BY-NC-SA-4.0

292

287◄─►293

chatglm3-6b

1057

±12

4,658

Tsinghua

Apache-2.0

293

290◄─►295

RWKVRWKV-4-Raven-14B

1042

±11

4,845

RWKV

Apache 2.0

294

293◄─►295

chatglm2-6b

1025

±14

2,657

Tsinghua

Apache-2.0

295

293◄─►295

oasst-pythia-12b

1023

±11

6,311

OpenAssistant

Apache 2.0

296

296◄─►299

chatglm-6b

997

±13

4,914

Tsinghua

Non-commercial

297

296◄─►299

fastchat-t5-3b

992

±12

4,203

LMSYS

Apache 2.0

298

296◄─►299

dolly-v2-12b

981

±14

3,412

Databricks

MIT

299

296◄─►300

Metallama-13b

973

±16

2,391

Meta

Non-commercial

300

299◄─►300

Stabilitystablelm-tuned-alpha-7b

954

±13

3,287

Stability AI

CC-BY-NC-SA-4.0

说明

  • 排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。

  • 模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。

  • 分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。

  • 95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:±6)。这个区间越小,表示模型的评分越稳定和可靠。

  • 票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。

  • 组织/公司:提供该模型的组织或公司。

  • 许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。

数据来源与更新频率

本排行榜数据由自动化脚本直接从 1 2arrow-up-right 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。

免责声明

本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。

最后更新于

这有帮助吗?