模型榜单

这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。

数据更新时间: 2025-12-22 08:09:23 UTC / 2025-12-22 16:09:23 CST (北京时间)

排行榜

Rank
Rank Spread
模型
分数
95% 置信区间 (±)
票数
组织/公司
许可证

1

1◄─►2

gemini-3-pro

1490

±6

19,199

Google

Proprietary

2

2◄─►6

grok-4.1-thinking

1477

±6

20,074

xAI

Proprietary

3

1◄─►7

gemini-3-flash

1475Preliminary

±10

4,431

Google

Proprietary

4

2◄─►8

Anthropicclaude-opus-4-5-20251101-thinking-32k

1469

±6

12,829

Anthropic

Proprietary

5

2◄─►8

Anthropicclaude-opus-4-5-20251101

1465

±6

13,654

Anthropic

Proprietary

6

3◄─►8

grok-4.1

1465

±6

20,553

xAI

Proprietary

7

2◄─►11

gemini-3-flash (thinking-minimal)

1463Preliminary

±9

5,605

Google

Proprietary

8

4◄─►11

gpt-5.1-high

1457

±6

17,084

OpenAI

Proprietary

9

7◄─►15

gemini-2.5-pro

1451

±3

79,900

Google

Proprietary

10

7◄─►15

Anthropicclaude-sonnet-4-5-20250929-thinking-32k

1450

±4

31,225

Anthropic

Proprietary

11

9◄─►15

Anthropicclaude-opus-4-1-20250805-thinking-16k

1448

±4

46,958

Anthropic

Proprietary

12

9◄─►18

Anthropicclaude-sonnet-4-5-20250929

1446

±5

26,513

Anthropic

Proprietary

13

9◄─►22

gpt-4.5-preview-2025-02-27

1442

±6

14,644

OpenAI

Proprietary

14

7◄─►27

gpt-5.2

1442Preliminary

±12

2,422

OpenAI

Proprietary

15

12◄─►22

Anthropicclaude-opus-4-1-20250805

1440

±4

60,044

Anthropic

Proprietary

16

12◄─►22

chatgpt-4o-latest-20250326

1440

±3

66,721

OpenAI

Proprietary

17

9◄─►24

gpt-5.2-high

1440Preliminary

±9

6,192

OpenAI

Proprietary

18

12◄─►24

gpt-5.1

1437

±6

18,423

OpenAI

Proprietary

19

13◄─►24

gpt-5-high

1436

±5

32,779

OpenAI

Proprietary

20

13◄─►25

o3-2025-04-16

1434

±4

61,385

OpenAI

Proprietary

21

13◄─►29

qwen3-max-preview

1433

±5

28,039

Alibaba

Proprietary

22

13◄─►33

grok-4-1-fast-reasoning

1431

±6

12,526

xAI

Proprietary

23

16◄─►38

MoonshotAIkimi-k2-thinking-turbo

1428

±5

18,724

Moonshot

Modified MIT

24

16◄─►43

ernie-5.0-preview-1103

1427Preliminary

±8

6,699

Baidu

Proprietary

25

20◄─►43

glm-4.6

1425

±5

27,300

Z.ai

MIT

26

21◄─►43

gpt-5-chat

1425

±4

32,045

OpenAI

Proprietary

27

19◄─►43

qwen3-max-2025-09-23

1424

±6

9,227

Alibaba

Proprietary

28

20◄─►43

deepseek-v3.2-exp

1423

±7

11,929

DeepSeek AI

MIT

29

22◄─►43

Anthropicclaude-opus-4-20250514-thinking-16k

1423

±4

37,714

Anthropic

Proprietary

30

21◄─►46

deepseek-v3.2-thinking

1422

±7

8,916

DeepSeek AI

MIT

31

23◄─►43

qwen3-235b-a22b-instruct-2507

1422

±4

54,369

Alibaba

Apache 2.0

32

22◄─►46

deepseek-v3.2-exp-thinking

1421

±7

9,198

DeepSeek AI

MIT

33

22◄─►49

grok-4-fast-chat

1420

±8

7,040

xAI

Proprietary

34

22◄─►49

mistral-large-3

1419Preliminary

±7

9,518

Mistral

Apache 2.0

35

23◄─►49

MoonshotAIkimi-k2-0905-preview

1418

±7

11,845

Moonshot

Modified MIT

36

23◄─►49

deepseek-r1-0528

1418

±6

19,162

DeepSeek

MIT

37

24◄─►50

deepseek-v3.1

1416

±6

15,213

DeepSeek

MIT

38

24◄─►49

MoonshotAIkimi-k2-0711-preview

1416

±5

28,532

Moonshot

Modified MIT

39

24◄─►51

deepseek-v3.1-thinking

1416

±7

11,947

DeepSeek

MIT

40

24◄─►51

deepseek-v3.2

1415

±7

10,253

DeepSeek AI

MIT

41

23◄─►54

deepseek-v3.1-terminus

1415

±10

3,737

DeepSeek AI

MIT

42

24◄─►51

qwen3-vl-235b-a22b-instruct

1414

±7

9,011

Alibaba

Apache 2.0

43

23◄─►57

deepseek-v3.1-terminus-thinking

1414

±10

3,511

DeepSeek AI

MIT

44

31◄─►51

gpt-4.1-2025-04-14

1412

±4

52,398

OpenAI

Proprietary

45

31◄─►51

Anthropicclaude-opus-4-20250514

1412

±4

45,477

Anthropic

Proprietary

46

31◄─►51

mistral-medium-2508

1411

±4

48,471

Mistral

Proprietary

47

33◄─►54

grok-3-preview-02-24

1410

±4

34,039

xAI

Proprietary

48

33◄─►56

grok-4-0709

1409

±4

42,410

xAI

Proprietary

49

33◄─►57

glm-4.5

1408

±5

24,736

Z.ai

MIT

50

38◄─►56

gemini-2.5-flash

1408

±3

79,419

Google

Proprietary

51

39◄─►60

gemini-2.5-flash-preview-09-2025

1405

±4

32,599

Google

Proprietary

52

45◄─►62

Anthropicclaude-haiku-4-5-20251001

1402

±4

29,594

Anthropic

Proprietary

53

45◄─►63

grok-4-fast-reasoning

1402

±5

18,821

xAI

Proprietary

54

47◄─►63

o1-2024-12-17

1400

±4

28,039

OpenAI

Proprietary

55

47◄─►65

qwen3-next-80b-a3b-instruct

1400

±5

23,035

Alibaba

Apache 2.0

56

45◄─►66

longcat-flash-chat

1400

±6

11,470

Meituan

MIT

57

49◄─►65

qwen3-235b-a22b-no-thinking

1399

±4

39,246

Alibaba

Apache 2.0

58

51◄─►66

Anthropicclaude-sonnet-4-20250514-thinking-32k

1399

±4

36,070

Anthropic

Proprietary

59

51◄─►71

qwen3-235b-a22b-thinking-2507

1397

±6

9,311

Alibaba

Apache 2.0

60

52◄─►71

deepseek-r1

1396

±5

18,718

DeepSeek

MIT

61

52◄─►72

qwen3-vl-235b-a22b-thinking

1394

±7

7,965

Alibaba

Apache 2.0

62

53◄─►72

gpt-5-mini-high

1392

±5

27,344

OpenAI

Proprietary

63

55◄─►71

deepseek-v3-0324

1392

±4

46,624

DeepSeek

MIT

64

51◄─►79

Tencenthunyuan-vision-1.5-thinking

1391

±12

2,210

Tencent

Proprietary

65

57◄─►72

o4-mini-2025-04-16

1391

±4

46,703

OpenAI

Proprietary

66

55◄─►74

mai-1-preview

1390

±5

18,116

Microsoft AI

Proprietary

67

59◄─►75

Anthropicclaude-sonnet-4-20250514

1389

±4

41,493

Anthropic

Proprietary

68

59◄─►75

o1-preview

1387

±5

31,505

OpenAI

Proprietary

69

59◄─►75

Anthropicclaude-3-7-sonnet-20250219-thinking-32k

1387

±4

39,817

Anthropic

Proprietary

70

59◄─►77

qwen3-coder-480b-a35b-instruct

1386

±5

23,591

Alibaba

Apache 2.0

71

59◄─►80

Tencenthunyuan-t1-20250711

1385

±9

4,803

Tencent

Proprietary

72

62◄─►79

mistral-medium-2505

1383

±5

34,401

Mistral

Proprietary

73

65◄─►79

qwen3-30b-a3b-instruct-2507

1382

±5

24,120

Alibaba

Apache 2.0

74

66◄─►80

gpt-4.1-mini-2025-04-14

1380

±4

40,370

OpenAI

Proprietary

75

65◄─►83

Tencenthunyuan-turbos-20250416

1380

±6

11,094

Tencent

Proprietary

76

69◄─►83

gemini-2.5-flash-lite-preview-09-2025-no-thinking

1378

±4

32,630

Google

Proprietary

77

70◄─►84

gemini-2.5-flash-lite-preview-06-17-thinking

1375

±4

33,818

Google

Proprietary

78

70◄─►85

qwen3-235b-a22b

1374

±5

27,077

Alibaba

Apache 2.0

79

73◄─►85

qwen2.5-max

1373

±4

33,489

Alibaba

Proprietary

80

75◄─►85

Anthropicclaude-3-5-sonnet-20241022

1372

±3

89,727

Anthropic

Proprietary

81

75◄─►88

Anthropicclaude-3-7-sonnet-20250219

1371

±4

44,473

Anthropic

Proprietary

82

69◄─►94

amazon-nova-experimental-chat-11-10

1370Preliminary

±12

2,743

Amazon

Proprietary

83

75◄─►88

glm-4.5-air

1370

±4

31,551

Z.ai

MIT

84

77◄─►91

qwen3-next-80b-a3b-thinking

1367

±6

13,779

Alibaba

Apache 2.0

85

78◄─►90

Minimaxminimax-m1

1366

±4

36,732

MiniMax

Apache 2.0

86

81◄─►91

gemma-3-27b-it

1364

±4

49,181

Google

Gemma

87

81◄─►96

o3-mini-high

1362

±5

18,735

OpenAI

Proprietary

88

81◄─►96

grok-3-mini-high

1362

±5

17,505

xAI

Proprietary

89

83◄─►98

gemini-2.0-flash-001

1360

±4

45,015

Google

Proprietary

90

83◄─►107

deepseek-v3

1357

±5

21,994

DeepSeek

DeepSeek

91

84◄─►107

grok-3-mini-beta

1357

±5

23,701

xAI

Proprietary

92

86◄─►112

mistral-small-2506

1355

±5

18,254

Mistral

Apache 2.0

93

89◄─►113

gpt-oss-120b

1352

±4

31,145

OpenAI

Apache 2.0

94

89◄─►112

gemini-2.0-flash-lite-preview-02-05

1352

±4

25,215

Google

Proprietary

95

86◄─►115

glm-4.5v

1352

±8

4,971

Z.ai

MIT

96

90◄─►112

Coherecommand-a-03-2025

1352

±3

57,648

Cohere

CC-BY-NC-4.0

97

90◄─►113

gemini-1.5-pro-002

1351

±3

56,012

Google

Proprietary

98

86◄─►116

intellect-3

1351

±9

4,753

Prime Intellect

MIT

99

90◄─►116

amazon-nova-experimental-chat-10-20

1349

±6

10,971

Amazon

Proprietary

100

92◄─►115

o3-mini

1348

±3

58,693

OpenAI

Proprietary

101

87◄─►126

Tencenthunyuan-turbos-20250226

1346

±12

2,250

Tencent

Proprietary

102

90◄─►119

ling-flash-2.0

1346

±7

7,126

Ant Group

MIT

103

90◄─►122

Minimaxminimax-m2

1346

±8

7,093

MiniMax

Apache 2.0

104

90◄─►121

Stepfunstep-3

1346

±7

6,620

StepFun

Apache 2.0

105

87◄─►127

Nvidiallama-3.1-nemotron-ultra-253b-v1

1346

±12

2,573

Nvidia

Nvidia Open Model

106

89◄─►126

amazon-nova-experimental-chat-10-09

1345

±11

2,878

Amazon

Proprietary

107

94◄─►116

gpt-4o-2024-05-13

1345

±3

113,568

OpenAI

Proprietary

108

90◄─►126

qwen3-32b

1345

±9

3,943

Alibaba

Apache 2.0

109

90◄─►126

qwen-plus-0125

1344

±8

5,861

Alibaba

Proprietary

110

92◄─►126

glm-4-plus-0111

1343

±8

5,806

Zhipu

Proprietary

111

97◄─►119

Anthropicclaude-3-5-sonnet-20240620

1342

±3

82,864

Anthropic

Proprietary

112

92◄─►129

gemma-3-12b-it

1340

±9

3,866

Google

Gemma

113

92◄─►131

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5

1340

±10

3,469

Nvidia

Nvidia Open

114

92◄─►133

Tencenthunyuan-turbo-0110

1339

±11

2,322

Tencent

Proprietary

115

97◄─►128

gpt-5-nano-high

1338

±7

8,369

OpenAI

Proprietary

116

99◄─►131

nova-2-lite

1336

±7

8,997

Amazon

Proprietary

117

102◄─►128

o1-mini

1335

±4

52,301

OpenAI

Proprietary

118

104◄─►128

Metallama-3.1-405b-instruct-bf16

1335

±4

41,932

Meta

Llama 3.1 Community

119

104◄─►131

gpt-4o-2024-08-06

1334

±4

45,787

OpenAI

Proprietary

120

106◄─►129

grok-2-2024-08-13

1334

±4

63,725

xAI

Proprietary

121

105◄─►131

qwq-32b

1334

±4

26,196

Alibaba

Apache 2.0

122

102◄─►131

gemini-advanced-0514

1334

±5

50,654

Google

Proprietary

123

106◄─►131

Metallama-3.1-405b-instruct-fp8

1333

±3

60,272

Meta

Llama 3.1 Community

124

102◄─►141

Stepfunstep-2-16k-exp-202412

1332

±9

4,895

StepFun

Proprietary

125

112◄─►142

01.AIyi-lightning

1328

±5

27,624

01 AI

Proprietary

126

115◄─►143

Metallama-4-maverick-17b-128e-instruct

1327

±4

41,093

Meta

Llama 4

127

106◄─►153

Nvidianvidia-nemotron-3-nano-30b-a3b-bf16

1326Preliminary

±10

4,247

Nvidia

NVIDIA Open Model

128

117◄─►145

qwen3-30b-a3b

1326

±5

27,405

Alibaba

Apache 2.0

129

106◄─►154

Nvidiallama-3.3-nemotron-49b-super-v1

1325

±12

2,243

Nvidia

Nvidia

130

111◄─►153

Tencenthunyuan-large-2025-02-10

1325

±10

3,760

Tencent

Proprietary

131

123◄─►146

gpt-4-turbo-2024-04-09

1324

±4

98,965

OpenAI

Proprietary

132

124◄─►148

Anthropicclaude-3-5-haiku-20241022

1322

±3

71,241

Anthropic

Proprietary

133

124◄─►148

Metallama-4-scout-17b-16e-instruct

1322

±5

31,074

Meta

Llama

134

117◄─►154

deepseek-v2.5-1210

1322

±8

6,877

DeepSeek

DeepSeek

135

124◄─►148

gemini-1.5-pro-001

1322

±4

79,769

Google

Proprietary

136

124◄─►148

Anthropicclaude-3-opus-20240229

1322

±3

196,368

Anthropic

Proprietary

137

123◄─►154

gpt-4.1-nano-2025-04-14

1321

±8

6,143

OpenAI

Proprietary

138

124◄─►154

ring-flash-2.0

1320

±7

7,250

Ant Group

MIT

139

124◄─►154

Stepfunstep-1o-turbo-202506

1319

±7

9,635

StepFun

Proprietary

140

127◄─►153

Metallama-3.3-70b-instruct

1319

±3

55,961

Meta

Llama-3.3

141

125◄─►154

gemma-3n-e4b-it

1318

±5

23,421

Google

Gemma

142

126◄─►154

glm-4-plus

1318

±5

26,342

Zhipu AI

Proprietary

143

124◄─►155

gpt-oss-20b

1318

±6

10,828

OpenAI

Apache 2.0

144

127◄─►155

qwen-max-0919

1317

±6

16,598

Alibaba

Qwen

145

129◄─►154

gpt-4o-mini-2024-07-18

1316

±3

69,290

OpenAI

Proprietary

146

128◄─►161

qwen2.5-plus-1127

1314

±6

10,252

Alibaba

Proprietary

147

133◄─►159

gpt-4-1106-preview

1313

±4

101,117

OpenAI

Proprietary

148

133◄─►159

mistral-large-2407

1313

±4

45,968

Mistral

Mistral Research

149

133◄─►159

gpt-4-0125-preview

1313

±4

94,534

OpenAI

Proprietary

150

133◄─►160

athene-v2-chat

1313

±4

24,880

NexusFlow

NexusFlow

151

124◄─►164

mercury

1311

±14

1,961

Inception AI

Proprietary

152

129◄─►164

Tencenthunyuan-standard-2025-02-10

1310

±10

3,920

Tencent

Proprietary

153

136◄─►161

gemini-1.5-flash-002

1310

±4

35,180

Google

Proprietary

154

133◄─►164

olmo-3-32b-think

1308

±9

5,307

Allen AI

Apache 2.0

155

146◄─►163

grok-2-mini-2024-08-13

1307

±4

52,789

xAI

Proprietary

156

146◄─►164

deepseek-v2.5

1306

±5

24,839

DeepSeek

DeepSeek

157

146◄─►164

athene-70b-0725

1305

±6

19,796

NexusFlow

CC-BY-NC-4.0

158

149◄─►164

mistral-large-2411

1304

±4

28,455

Mistral

MRL

159

146◄─►164

magistral-medium-2506

1304

±6

11,957

Mistral

Proprietary

160

150◄─►164

mistral-small-3.1-24b-instruct-2503

1303

±4

34,030

Mistral

Apache 2.0

161

144◄─►170

gemma-3-4b-it

1302

±9

4,195

Google

Gemma

162

152◄─►164

qwen2.5-72b-instruct

1302

±4

39,632

Alibaba

Qwen

163

152◄─►173

Nvidiallama-3.1-nemotron-70b-instruct

1297

±8

7,216

Nvidia

Llama 3.1

164

153◄─►175

Tencenthunyuan-large-vision

1294

±9

5,575

Tencent

Proprietary

165

162◄─►173

Metallama-3.1-70b-instruct

1293

±4

56,003

Meta

Llama 3.1 Community

166

162◄─►178

jamba-1.5-large

1288

±7

8,730

AI21 Labs

Jamba Open

167

163◄─►176

amazon-nova-pro-v1.0

1288

±4

25,218

Amazon

Proprietary

168

162◄─►183

ibm-granite-h-small

1287

±8

5,690

IBM

Apache 2.0

169

163◄─►176

gemma-2-27b-it

1287

±3

76,195

Google

Gemma license

170

162◄─►178

reka-core-20240904

1287

±7

7,380

Reka AI

Proprietary

171

163◄─►178

gpt-4-0314

1286

±5

54,754

OpenAI

Proprietary

172

162◄─►184

Nvidiallama-3.1-nemotron-51b-instruct

1286

±10

3,777

Nvidia

Llama 3.1

173

162◄─►184

llama-3.1-tulu-3-70b

1286

±10

2,881

Ai2

Llama 3.1

174

165◄─►178

gemini-1.5-flash-001

1284

±4

63,418

Google

Proprietary

175

166◄─►183

Anthropicclaude-3-sonnet-20240229

1281

±4

110,173

Anthropic

Proprietary

176

165◄─►184

gemma-2-9b-it-simpo

1278

±7

10,108

Princeton

MIT

177

168◄─►184

Nvidianemotron-4-340b-instruct

1277

±5

19,913

Nvidia

NVIDIA Open Model

178

168◄─►185

Coherecommand-r-plus-08-2024

1277

±7

9,931

Cohere

CC-BY-NC-4.0

179

172◄─►184

Metallama-3-70b-instruct

1276

±3

158,908

Meta

Llama 3 Community

180

172◄─►185

gpt-4-0613

1275

±4

89,612

OpenAI

Proprietary

181

172◄─►187

mistral-small-24b-instruct-2501

1273

±6

14,830

Mistral

Apache 2.0

182

172◄─►189

glm-4-0520

1273

±7

9,857

Zhipu AI

Proprietary

183

172◄─►189

reka-flash-20240904

1272

±7

7,583

Reka AI

Proprietary

184

174◄─►193

qwen2.5-coder-32b-instruct

1269

±8

5,452

Alibaba

Apache 2.0

185

179◄─►193

Coherec4ai-aya-expanse-32b

1266

±5

27,362

Cohere

CC-BY-NC-4.0

186

181◄─►193

gemma-2-9b-it

1264

±4

54,954

Google

Gemma license

187

181◄─►195

deepseek-coder-v2

1264

±6

15,242

DeepSeek AI

DeepSeek License

188

182◄─►194

Coherecommand-r-plus

1263

±4

78,401

Cohere

CC-BY-NC-4.0

189

182◄─►195

qwen2-72b-instruct

1262

±5

37,688

Alibaba

Qianwen LICENSE

190

184◄─►195

Anthropicclaude-3-haiku-20240307

1261

±4

118,626

Anthropic

Proprietary

191

184◄─►195

amazon-nova-lite-v1.0

1259

±5

19,760

Amazon

Proprietary

192

184◄─►195

gemini-1.5-flash-8b-001

1259

±4

35,914

Google

Proprietary

193

187◄─►195

Azurephi-4

1255

±4

24,354

Microsoft

MIT

194

184◄─►201

olmo-2-0325-32b-instruct

1252

±11

3,377

Allen AI

Apache-2.0

195

188◄─►200

Coherecommand-r-08-2024

1251

±7

10,229

Cohere

CC-BY-NC-4.0

196

194◄─►204

mistral-large-2402

1242

±5

63,404

Mistral

Proprietary

197

194◄─►204

amazon-nova-micro-v1.0

1241

±5

19,774

Amazon

Proprietary

198

194◄─►209

jamba-1.5-mini

1239

±7

8,918

AI21 Labs

Jamba Open

199

194◄─►212

ministral-8b-2410

1237

±9

4,833

Mistral

MRL

200

195◄─►212

gemini-pro-dev-api

1234

±7

18,454

Google

Proprietary

201

196◄─►212

qwen1.5-110b-chat

1234

±5

26,679

Alibaba

Qianwen LICENSE

202

196◄─►212

qwen1.5-72b-chat

1234

±5

39,689

Alibaba

Qianwen LICENSE

203

196◄─►213

reka-flash-21b-20240226-online

1233

±7

15,606

Reka AI

Proprietary

204

194◄─►214

Tencenthunyuan-standard-256k

1233

±12

2,761

Tencent

Proprietary

205

198◄─►213

mixtral-8x22b-instruct-v0.1

1230

±4

52,214

Mistral

Apache 2.0

206

198◄─►214

Coherecommand-r

1228

±5

54,710

Cohere

CC-BY-NC-4.0

207

198◄─►214

reka-flash-21b-20240226

1227

±6

25,026

Reka AI

Proprietary

208

199◄─►215

gpt-3.5-turbo-0125

1224

±5

67,214

OpenAI

Proprietary

209

199◄─►216

mistral-medium

1223

±5

34,893

Mistral

Proprietary

210

199◄─►216

Coherec4ai-aya-expanse-8b

1223

±7

9,922

Cohere

CC-BY-NC-4.0

211

203◄─►215

Metallama-3-8b-instruct

1223

±4

106,055

Meta

Llama 3 Community

212

198◄─►219

gemini-pro

1222

±12

6,418

Google

Proprietary

213

198◄─►218

llama-3.1-tulu-3-8b

1221

±11

2,943

Ai2

Llama 3.1

214

205◄─►220

HuggingFacezephyr-orpo-141b-A35b-v0.1

1214

±11

4,712

HuggingFace

Apache 2.0

215

210◄─►219

01.AIyi-1.5-34b-chat

1213

±5

24,417

01 AI

Apache-2.0

216

212◄─►219

Metallama-3.1-8b-instruct

1211

±4

50,234

Meta

Llama 3.1 Community

217

208◄─►225

granite-3.1-8b-instruct

1210

±11

3,142

IBM

Apache 2.0

218

212◄─►224

qwen1.5-32b-chat

1205

±6

22,068

Alibaba

Qianwen LICENSE

219

213◄─►227

gpt-3.5-turbo-1106

1202

±9

16,760

OpenAI

Proprietary

220

216◄─►227

Azurephi-3-medium-4k-instruct

1198

±5

25,301

Microsoft

MIT

221

217◄─►227

gemma-2-2b-it

1198

±4

46,901

Google

Gemma license

222

217◄─►227

mixtral-8x7b-instruct-v0.1

1198

±4

74,303

Mistral

Apache 2.0

223

217◄─►232

dbrx-instruct-preview

1196

±6

32,760

Databricks

DBRX LICENSE

224

217◄─►236

qwen1.5-14b-chat

1193

±7

18,066

Alibaba

Qianwen LICENSE

225

218◄─►236

InternLMinternlm2_5-20b-chat

1192

±7

10,038

InternLM

Other

226

219◄─►242

Azurewizardlm-70b

1185

±9

8,270

Microsoft

Llama 2 Community

227

219◄─►243

deepseek-llm-67b-chat

1184

±12

4,950

DeepSeek AI

DeepSeek License

228

223◄─►240

01.AIyi-34b-chat

1184

±7

15,624

01 AI

Yi License

229

223◄─►242

granite-3.0-8b-instruct

1183

±9

6,727

IBM

Apache 2.0

230

223◄─►242

OpenChatopenchat-3.5-0106

1183

±8

12,712

OpenChat

Apache-2.0

231

223◄─►243

OpenChatopenchat-3.5

1182

±10

8,009

OpenChat

Apache-2.0

232

223◄─►244

granite-3.1-2b-instruct

1181

±11

3,235

IBM

Apache 2.0

233

224◄─►242

Snowflakesnowflake-arctic-instruct

1180

±6

33,272

Snowflake

Apache 2.0

234

224◄─►243

gemma-1.1-7b-it

1180

±6

24,327

Google

Gemma license

235

224◄─►245

tulu-2-dpo-70b

1179

±10

6,579

AllenAI/UW

AI2 ImpACT Low-risk

236

224◄─►247

openhermes-2.5-mistral-7b

1176

±10

5,026

NousResearch

Apache-2.0

237

226◄─►246

vicuna-33b

1173

±6

22,613

LMSYS

Non-commercial

238

226◄─►247

starling-lm-7b-beta

1173

±7

16,190

Nexusflow

Apache-2.0

239

226◄─►247

Azurephi-3-small-8k-instruct

1172

±6

17,983

Microsoft

MIT

240

227◄─►247

Metallama-2-70b-chat

1171

±5

38,767

Meta

Llama 2 Community

241

227◄─►250

starling-lm-7b-alpha

1168

±8

10,267

UC Berkeley

CC-BY-NC-4.0

242

231◄─►251

Metallama-3.2-3b-instruct

1167

±8

8,043

Meta

Llama 3.2

243

226◄─►252

nous-hermes-2-mixtral-8x7b-dpo

1166

±12

3,792

NousResearch

Apache-2.0

244

234◄─►257

qwq-32b-preview

1159

±11

3,256

Alibaba

Apache 2.0

245

235◄─►260

Nvidiallama2-70b-steerlm-chat

1157

±13

3,605

Nvidia

Llama 2 Community

246

241◄─►257

granite-3.0-2b-instruct

1157

±8

6,922

IBM

Apache 2.0

247

237◄─►261

solar-10.7b-instruct-v1.0

1154

±13

4,187

Upstage AI

CC-BY-NC-4.0

248

236◄─►265

dolphin-2.2.1-mistral-7b

1152

±15

1,685

Cognitive Computations

Apache-2.0

249

241◄─►263

mpt-30b-chat

1151

±12

2,606

MosaicML

CC-BY-NC-SA-4.0

250

243◄─►261

mistral-7b-instruct-v0.2

1151

±7

19,603

Mistral

Apache-2.0

251

242◄─►261

Azurewizardlm-13b

1150

±9

7,122

Microsoft

Llama 2 Community

252

241◄─►268

falcon-180b-chat

1147

±17

1,312

TII

Falcon-180B TII License

253

244◄─►266

qwen1.5-7b-chat

1144

±10

4,782

Alibaba

Qianwen LICENSE

254

244◄─►265

Azurephi-3-mini-4k-instruct-june-2024

1143

±6

12,415

Microsoft

MIT

255

244◄─►265

Metallama-2-13b-chat

1143

±7

19,357

Meta

Llama 2 Community

256

244◄─►266

vicuna-13b

1142

±7

19,539

LMSYS

Llama 2 Community

257

244◄─►268

qwen-14b-chat

1139

±11

5,004

Alibaba

Qianwen LICENSE

258

246◄─►268

palm-2

1137

±9

8,634

Google

Proprietary

259

246◄─►268

Metacodellama-34b-instruct

1137

±9

7,417

Meta

Llama 2 Community

260

246◄─►268

gemma-7b-it

1135

±9

9,034

Google

Gemma license

261

250◄─►269

HuggingFacezephyr-7b-beta

1132

±9

11,220

HuggingFace

MIT

262

251◄─►269

Azurephi-3-mini-128k-instruct

1131

±7

21,024

Microsoft

MIT

263

254◄─►269

Azurephi-3-mini-4k-instruct

1129

±6

20,539

Microsoft

MIT

264

247◄─►273

HuggingFacezephyr-7b-alpha

1128

±16

1,803

HuggingFace

MIT

265

250◄─►272

guanaco-33b

1128

±12

2,955

UW

Non-commercial

266

256◄─►273

stripedhyena-nous-7b

1121

±11

5,214

Together AI

Apache 2.0

267

251◄─►274

Metacodellama-70b-instruct

1119

±18

1,151

Meta

Llama 2 Community

268

256◄─►273

HuggingFacesmollm2-1.7b-instruct

1119

±14

2,244

HuggingFace

Apache 2.0

269

261◄─►273

vicuna-7b

1115

±9

6,972

LMSYS

Llama 2 Community

270

264◄─►273

gemma-1.1-2b-it

1114

±8

11,035

Google

Gemma license

271

264◄─►273

Metallama-3.2-1b-instruct

1113

±8

8,166

Meta

Llama 3.2

272

264◄─►274

mistral-7b-instruct

1111

±9

9,042

Mistral

Apache 2.0

273

265◄─►274

Metallama-2-7b-chat

1109

±7

14,272

Meta

Llama 2 Community

274

271◄─►278

gemma-2b-it

1091

±12

4,817

Google

Gemma license

275

274◄─►276

qwen1.5-4b-chat

1091

±9

7,662

Alibaba

Qianwen LICENSE

276

274◄─►281

olmo-7b-instruct

1074

±11

6,412

Allen AI

Apache-2.0

277

275◄─►281

koala-13b

1071

±10

6,998

UC Berkeley

Non-commercial

278

276◄─►281

alpaca-13b

1067

±11

5,828

Stanford

Non-commercial

279

275◄─►282

gpt4all-13b-snoozy

1066

±15

1,773

Nomic AI

Non-commercial

280

276◄─►282

mpt-7b-chat

1062

±12

3,977

MosaicML

CC-BY-NC-SA-4.0

281

276◄─►282

chatglm3-6b

1057

±12

4,692

Tsinghua

Apache-2.0

282

279◄─►284

RWKVRWKV-4-Raven-14B

1042

±11

4,898

RWKV

Apache 2.0

283

282◄─►284

chatglm2-6b

1026

±14

2,683

Tsinghua

Apache-2.0

284

282◄─►284

oasst-pythia-12b

1023

±11

6,343

OpenAssistant

Apache 2.0

285

285◄─►288

chatglm-6b

996

±13

4,968

Tsinghua

Non-commercial

286

285◄─►288

fastchat-t5-3b

992

±12

4,270

LMSYS

Apache 2.0

287

285◄─►288

dolly-v2-12b

980

±14

3,471

Databricks

MIT

288

285◄─►289

Metallama-13b

972

±16

2,441

Meta

Non-commercial

289

288◄─►289

Stabilitystablelm-tuned-alpha-7b

953

±13

3,325

Stability AI

CC-BY-NC-SA-4.0

说明

  • 排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。

  • 模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。

  • 分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。

  • 95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:±6)。这个区间越小,表示模型的评分越稳定和可靠。

  • 票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。

  • 组织/公司:提供该模型的组织或公司。

  • 许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。

数据来源与更新频率

本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。

免责声明

本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。

最后更新于

这有帮助吗?