模型榜单

这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。

数据更新时间: 2025-11-22 08:07:09 UTC / 2025-11-22 16:07:09 CST (北京时间)

排行榜

Rank
Rank Spread(Upper-Lower)
模型
分数
95% 置信区间 (±)
票数
组织/公司
许可证

1

1◄─►2

gemini-3-pro

1495Preliminary

±9

5,471

Google

Proprietary

2

1◄─►2

grok-4.1-thinking

1481Preliminary

±9

5,822

xAI

Proprietary

3

3◄─►6

grok-4.1

1462Preliminary

±9

5,825

xAI

Proprietary

4

3◄─►9

gpt-5.1-high

1454

±9

4,980

OpenAI

Proprietary

5

3◄─►9

gemini-2.5-pro

1451

±4

67,956

Google

Proprietary

6

3◄─►11

Anthropicclaude-sonnet-4-5-20250929-thinking-32k

1449

±5

19,073

Anthropic

Proprietary

7

4◄─►9

Anthropicclaude-opus-4-1-20250805-thinking-16k

1449

±4

34,691

Anthropic

Proprietary

8

4◄─►13

Anthropicclaude-sonnet-4-5-20250929

1444

±6

13,765

Anthropic

Proprietary

9

4◄─►16

gpt-4.5-preview-2025-02-27

1442

±6

14,644

OpenAI

Proprietary

10

7◄─►16

Anthropicclaude-opus-4-1-20250805

1440

±4

47,504

Anthropic

Proprietary

11

8◄─►16

chatgpt-4o-latest-20250326

1439

±4

53,924

OpenAI

Proprietary

12

8◄─►16

gpt-5-high

1437

±5

32,910

OpenAI

Proprietary

13

7◄─►23

gpt-5.1

1435

±9

5,135

OpenAI

Proprietary

14

9◄─►17

o3-2025-04-16

1434

±4

61,635

OpenAI

Proprietary

15

9◄─►19

qwen3-max-preview

1433

±5

28,154

Alibaba

Proprietary

16

9◄─►30

MoonshotAIkimi-k2-thinking

1429

±7

8,126

Moonshot

Modified MIT

17

13◄─►32

glm-4.6

1426

±5

16,268

Z.ai

MIT

18

14◄─►31

gpt-5-chat

1425

±4

32,171

OpenAI

Proprietary

19

14◄─►33

qwen3-max-2025-09-23

1423

±6

9,266

Alibaba

Proprietary

20

15◄─►33

Anthropicclaude-opus-4-20250514-thinking-16k

1423

±4

37,905

Anthropic

Proprietary

21

16◄─►33

qwen3-235b-a22b-instruct-2507

1421

±4

42,468

Alibaba

Apache 2.0

22

15◄─►35

deepseek-v3.2-exp-thinking

1421

±7

9,228

DeepSeek AI

MIT

23

15◄─►39

grok-4-fast

1420

±8

7,067

xAI

Proprietary

24

15◄─►41

ernie-5.0-preview-1022

1418Preliminary

±9

4,714

Baidu

Proprietary

25

16◄─►39

deepseek-r1-0528

1418

±6

19,246

DeepSeek

MIT

26

16◄─►41

MoonshotAIkimi-k2-0905-preview

1416

±7

10,652

Moonshot

Modified MIT

27

16◄─►41

deepseek-v3.1

1416

±6

15,261

DeepSeek

MIT

28

16◄─►41

deepseek-v3.1-thinking

1416

±7

11,997

DeepSeek

MIT

29

18◄─►40

MoonshotAIkimi-k2-0711-preview

1416

±5

28,204

Moonshot

Modified MIT

30

16◄─►44

deepseek-v3.1-terminus

1415

±10

3,751

DeepSeek AI

MIT

31

17◄─►41

qwen3-vl-235b-a22b-instruct

1415

±7

8,539

Alibaba

Apache 2.0

32

16◄─►46

deepseek-v3.1-terminus-thinking

1414

±10

3,525

DeepSeek AI

MIT

33

19◄─►43

deepseek-v3.2-exp

1412

±6

11,053

DeepSeek AI

MIT

34

22◄─►42

Anthropicclaude-opus-4-20250514

1412

±4

45,701

Anthropic

Proprietary

35

22◄─►42

gpt-4.1-2025-04-14

1412

±4

52,608

OpenAI

Proprietary

36

23◄─►43

mistral-medium-2508

1410

±4

36,634

Mistral

Proprietary

37

23◄─►44

grok-3-preview-02-24

1410

±4

34,147

xAI

Proprietary

38

23◄─►46

grok-4-0709

1409

±4

41,963

xAI

Proprietary

39

23◄─►46

glm-4.5

1408

±5

24,834

Z.ai

MIT

40

25◄─►46

gemini-2.5-flash

1407

±4

67,181

Google

Proprietary

41

26◄─►50

gemini-2.5-flash-preview-09-2025

1405

±5

20,319

Google

Proprietary

42

31◄─►52

grok-4-fast-reasoning

1403

±5

18,235

xAI

Proprietary

43

33◄─►53

Anthropicclaude-haiku-4-5-20251001

1402

±5

16,895

Anthropic

Proprietary

44

37◄─►53

o1-2024-12-17

1400

±4

28,039

OpenAI

Proprietary

45

37◄─►55

qwen3-next-80b-a3b-instruct

1400

±5

23,138

Alibaba

Apache 2.0

46

34◄─►58

longcat-flash-chat

1399

±6

11,509

Meituan

MIT

47

40◄─►56

Anthropicclaude-sonnet-4-20250514-thinking-32k

1399

±4

36,237

Anthropic

Proprietary

48

41◄─►56

qwen3-235b-a22b-no-thinking

1399

±5

39,395

Alibaba

Apache 2.0

49

41◄─►60

qwen3-235b-a22b-thinking-2507

1397

±6

9,349

Alibaba

Apache 2.0

50

42◄─►60

deepseek-r1

1395

±5

18,718

DeepSeek

MIT

51

42◄─►63

qwen3-vl-235b-a22b-thinking

1393

±7

7,991

Alibaba

Apache 2.0

52

43◄─►61

gpt-5-mini-high

1393

±5

27,464

OpenAI

Proprietary

53

45◄─►61

deepseek-v3-0324

1391

±4

46,819

DeepSeek

MIT

54

46◄─►62

o4-mini-2025-04-16

1391

±4

46,870

OpenAI

Proprietary

55

41◄─►67

Tencenthunyuan-vision-1.5-thinking

1391

±12

2,217

Tencent

Proprietary

56

45◄─►65

mai-1-preview

1390

±5

18,203

Microsoft AI

Proprietary

57

48◄─►65

Anthropicclaude-sonnet-4-20250514

1389

±4

41,674

Anthropic

Proprietary

58

49◄─►66

o1-preview

1387

±5

31,505

OpenAI

Proprietary

59

49◄─►66

Anthropicclaude-3-7-sonnet-20250219-thinking-32k

1387

±4

39,928

Anthropic

Proprietary

60

51◄─►66

qwen3-coder-480b-a35b-instruct

1385

±5

23,162

Alibaba

Apache 2.0

61

48◄─►68

Tencenthunyuan-t1-20250711

1385

±9

4,821

Tencent

Proprietary

62

53◄─►68

mistral-medium-2505

1383

±5

34,534

Mistral

Proprietary

63

54◄─►68

qwen3-30b-a3b-instruct-2507

1382

±5

24,219

Alibaba

Apache 2.0

64

57◄─►69

gpt-4.1-mini-2025-04-14

1380

±4

40,507

OpenAI

Proprietary

65

55◄─►72

Tencenthunyuan-turbos-20250416

1380

±6

11,138

Tencent

Proprietary

66

55◄─►69

gemini-2.5-flash-lite-preview-09-2025-no-thinking

1380

±5

20,164

Google

Proprietary

67

60◄─►73

gemini-2.5-flash-lite-preview-06-17-thinking

1375

±5

33,996

Google

Proprietary

68

61◄─►74

qwen3-235b-a22b

1374

±5

27,179

Alibaba

Apache 2.0

69

64◄─►74

qwen2.5-max

1372

±4

33,551

Alibaba

Proprietary

70

66◄─►74

Anthropicclaude-3-5-sonnet-20241022

1372

±3

89,858

Anthropic

Proprietary

71

66◄─►77

Anthropicclaude-3-7-sonnet-20250219

1371

±4

44,569

Anthropic

Proprietary

72

66◄─►77

glm-4.5-air

1370

±4

31,688

Z.ai

MIT

73

67◄─►80

qwen3-next-80b-a3b-thinking

1367

±6

13,838

Alibaba

Apache 2.0

74

68◄─►80

Minimaxminimax-m1

1365

±4

36,890

MiniMax

Apache 2.0

75

71◄─►80

gemma-3-27b-it

1364

±4

49,341

Google

Gemma

76

71◄─►84

o3-mini-high

1362

±5

18,735

OpenAI

Proprietary

77

71◄─►84

grok-3-mini-high

1362

±5

17,595

xAI

Proprietary

78

73◄─►87

gemini-2.0-flash-001

1360

±4

45,120

Google

Proprietary

79

73◄─►94

deepseek-v3

1357

±5

21,994

DeepSeek

DeepSeek

80

73◄─►95

grok-3-mini-beta

1357

±5

23,812

xAI

Proprietary

81

76◄─►100

mistral-small-2506

1354

±5

18,337

Mistral

Apache 2.0

82

78◄─►101

gpt-oss-120b

1352

±4

31,298

OpenAI

Apache 2.0

83

78◄─►101

gemini-2.0-flash-lite-preview-02-05

1352

±4

25,215

Google

Proprietary

84

79◄─►100

Coherecommand-a-03-2025

1352

±3

57,871

Cohere

CC-BY-NC-4.0

85

76◄─►103

glm-4.5v

1352

±8

4,984

Z.ai

MIT

86

79◄─►101

gemini-1.5-pro-002

1351

±3

56,012

Google

Proprietary

87

76◄─►105

amazon-nova-experimental-chat-10-20

1349Preliminary

±10

3,952

Amazon

Proprietary

88

81◄─►103

o3-mini

1348

±3

58,828

OpenAI

Proprietary

89

79◄─►108

Minimaxminimax-m2

1346

±8

7,132

MiniMax

Apache 2.0

90

79◄─►107

ling-flash-2.0

1346

±7

7,160

Ant Group

MIT

91

76◄─►112

Tencenthunyuan-turbos-20250226

1346

±12

2,250

Tencent

Proprietary

92

78◄─►113

Nvidiallama-3.1-nemotron-ultra-253b-v1

1345

±12

2,573

Nvidia

Nvidia Open Model

93

79◄─►108

Stepfunstep-3

1345

±7

6,652

StepFun

Apache 2.0

94

82◄─►103

gpt-4o-2024-05-13

1345

±3

113,568

OpenAI

Proprietary

95

79◄─►113

amazon-nova-experimental-chat-10-09

1345

±11

2,895

Amazon

Proprietary

96

79◄─►112

qwen3-32b

1344

±9

3,943

Alibaba

Apache 2.0

97

80◄─►112

qwen-plus-0125

1344

±8

5,861

Alibaba

Proprietary

98

81◄─►112

glm-4-plus-0111

1343

±8

5,806

Zhipu

Proprietary

99

86◄─►104

Anthropicclaude-3-5-sonnet-20240620

1343

±3

82,864

Anthropic

Proprietary

100

81◄─►115

gemma-3-12b-it

1340

±9

3,866

Google

Gemma

101

81◄─►117

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5

1340

±10

3,496

Nvidia

Nvidia Open

102

86◄─►115

gpt-5-nano-high

1338

±7

8,396

OpenAI

Proprietary

103

81◄─►120

Tencenthunyuan-turbo-0110

1338

±11

2,322

Tencent

Proprietary

104

91◄─►114

Metallama-3.1-405b-instruct-bf16

1335

±4

41,932

Meta

Llama 3.1 Community

105

91◄─►114

o1-mini

1335

±4

52,301

OpenAI

Proprietary

106

92◄─►116

gpt-4o-2024-08-06

1334

±4

45,787

OpenAI

Proprietary

107

90◄─►117

gemini-advanced-0514

1334

±5

50,654

Google

Proprietary

108

94◄─►116

grok-2-2024-08-13

1334

±4

63,725

xAI

Proprietary

109

94◄─►116

Metallama-3.1-405b-instruct-fp8

1334

±3

60,272

Meta

Llama 3.1 Community

110

94◄─►117

qwq-32b

1333

±4

26,277

Alibaba

Apache 2.0

111

89◄─►127

Stepfunstep-2-16k-exp-202412

1332

±9

4,895

StepFun

Proprietary

112

100◄─►128

01.AIyi-lightning

1328

±5

27,624

01 AI

Proprietary

113

102◄─►128

Metallama-4-maverick-17b-128e-instruct

1327

±4

41,227

Meta

Llama 4

114

104◄─►131

qwen3-30b-a3b

1326

±5

27,509

Alibaba

Apache 2.0

115

94◄─►139

Nvidiallama-3.3-nemotron-49b-super-v1

1325

±12

2,243

Nvidia

Nvidia

116

98◄─►138

Tencenthunyuan-large-2025-02-10

1324

±10

3,760

Tencent

Proprietary

117

110◄─►131

gpt-4-turbo-2024-04-09

1324

±4

98,965

OpenAI

Proprietary

118

111◄─►133

Anthropicclaude-3-opus-20240229

1322

±3

196,368

Anthropic

Proprietary

119

110◄─►136

Metallama-4-scout-17b-16e-instruct

1322

±5

31,219

Meta

Llama

120

111◄─►134

Anthropicclaude-3-5-haiku-20241022

1322

±3

71,407

Anthropic

Propretary

121

111◄─►134

gemini-1.5-pro-001

1322

±4

79,769

Google

Proprietary

122

107◄─►139

deepseek-v2.5-1210

1322

±8

6,877

DeepSeek

DeepSeek

123

110◄─►139

gpt-4.1-nano-2025-04-14

1320

±8

6,143

OpenAI

Proprietary

124

111◄─►139

ring-flash-2.0

1319

±7

7,286

Ant Group

MIT

125

111◄─►139

Stepfunstep-1o-turbo-202506

1319

±7

9,671

StepFun

Proprietary

126

114◄─►138

Metallama-3.3-70b-instruct

1319

±3

56,021

Meta

Llama-3.3

127

114◄─►139

glm-4-plus

1318

±5

26,342

Zhipu AI

Proprietary

128

112◄─►139

gemma-3n-e4b-it

1318

±5

23,483

Google

Gemma

129

111◄─►140

gpt-oss-20b

1318

±6

10,858

OpenAI

Apache 2.0

130

114◄─►140

qwen-max-0919

1317

±6

16,598

Alibaba

Qwen

131

116◄─►139

gpt-4o-mini-2024-07-18

1316

±3

69,291

OpenAI

Proprietary

132

119◄─►144

gpt-4-1106-preview

1314

±4

101,117

OpenAI

Proprietary

133

119◄─►144

gpt-4-0125-preview

1314

±4

94,534

OpenAI

Proprietary

134

116◄─►145

qwen2.5-plus-1127

1314

±6

10,252

Alibaba

Proprietary

135

120◄─►144

mistral-large-2407

1313

±4

45,968

Mistral

Mistral Research

136

120◄─►145

athene-v2-chat

1313

±4

24,880

NexusFlow

NexusFlow

137

111◄─►149

mercury

1312

±14

1,974

Inception AI

Proprietary

138

122◄─►146

gemini-1.5-flash-002

1310

±4

35,180

Google

Proprietary

139

117◄─►149

Tencenthunyuan-standard-2025-02-10

1310

±10

3,920

Tencent

Proprietary

140

132◄─►149

grok-2-mini-2024-08-13

1307

±4

52,789

xAI

Proprietary

141

132◄─►149

deepseek-v2.5

1306

±5

24,839

DeepSeek

DeepSeek

142

132◄─►149

athene-70b-0725

1305

±6

19,796

NexusFlow

CC-BY-NC-4.0

143

132◄─►149

magistral-medium-2506

1305

±6

12,001

Mistral

Proprietary

144

135◄─►149

mistral-large-2411

1304

±4

28,455

Mistral

MRL

145

137◄─►149

mistral-small-3.1-24b-instruct-2503

1303

±4

34,153

Mistral

Apache 2.0

146

130◄─►154

gemma-3-4b-it

1302

±9

4,195

Google

Gemma

147

138◄─►149

qwen2.5-72b-instruct

1302

±4

39,632

Alibaba

Qwen

148

138◄─►157

Nvidiallama-3.1-nemotron-70b-instruct

1297

±8

7,216

Nvidia

Llama 3.1

149

138◄─►159

Tencenthunyuan-large-vision

1295

±9

5,600

Tencent

Proprietary

150

147◄─►157

Metallama-3.1-70b-instruct

1293

±4

56,003

Meta

Llama 3.1 Community

151

147◄─►162

jamba-1.5-large

1288

±7

8,730

AI21 Labs

Jamba Open

152

148◄─►161

amazon-nova-pro-v1.0

1288

±4

25,218

Amazon

Proprietary

153

148◄─►162

gpt-4-0314

1287

±5

54,754

OpenAI

Proprietary

154

147◄─►163

reka-core-20240904

1287

±7

7,380

Reka AI

Proprietary

155

148◄─►160

gemma-2-27b-it

1287

±3

76,195

Google

Gemma license

156

147◄─►168

Nvidiallama-3.1-nemotron-51b-instruct

1286

±10

3,777

Nvidia

Llama 3.1

157

147◄─►168

llama-3.1-tulu-3-70b

1286

±10

2,881

Ai2

Llama 3.1

158

150◄─►163

gemini-1.5-flash-001

1285

±4

63,418

Google

Proprietary

159

150◄─►167

Anthropicclaude-3-sonnet-20240229

1282

±4

110,173

Anthropic

Proprietary

160

150◄─►168

gemma-2-9b-it-simpo

1279

±7

10,108

Princeton

MIT

161

153◄─►168

Nvidianemotron-4-340b-instruct

1278

±5

19,913

Nvidia

NVIDIA Open Model

162

152◄─►169

Coherecommand-r-plus-08-2024

1277

±6

9,931

Cohere

CC-BY-NC-4.0

163

157◄─►168

Metallama-3-70b-instruct

1276

±3

158,908

Meta

Llama 3 Community

164

157◄─►168

gpt-4-0613

1276

±4

89,612

OpenAI

Proprietary

165

155◄─►173

glm-4-0520

1274

±7

9,857

Zhipu AI

Proprietary

166

157◄─►172

mistral-small-24b-instruct-2501

1273

±6

14,830

Mistral

Apache 2.0

167

157◄─►173

reka-flash-20240904

1273

±7

7,583

Reka AI

Proprietary

168

158◄─►177

qwen2.5-coder-32b-instruct

1269

±8

5,452

Alibaba

Apache 2.0

169

164◄─►177

Coherec4ai-aya-expanse-32b

1267

±5

27,362

Cohere

CC-BY-NC-4.0

170

165◄─►177

gemma-2-9b-it

1264

±4

54,954

Google

Gemma license

171

165◄─►179

deepseek-coder-v2

1264

±6

15,242

DeepSeek AI

DeepSeek License

172

165◄─►178

Coherecommand-r-plus

1264

±4

78,401

Cohere

CC-BY-NC-4.0

173

166◄─►179

qwen2-72b-instruct

1262

±5

37,688

Alibaba

Qianwen LICENSE

174

168◄─►178

Anthropicclaude-3-haiku-20240307

1262

±4

118,626

Anthropic

Proprietary

175

168◄─►179

amazon-nova-lite-v1.0

1259

±5

19,760

Amazon

Proprietary

176

168◄─►179

gemini-1.5-flash-8b-001

1259

±4

35,914

Google

Proprietary

177

171◄─►179

Azurephi-4

1255

±4

24,354

Microsoft

MIT

178

168◄─►186

olmo-2-0325-32b-instruct

1252

±11

3,377

Allen AI

Apache-2.0

179

172◄─►183

Coherecommand-r-08-2024

1252

±7

10,229

Cohere

CC-BY-NC-4.0

180

178◄─►188

mistral-large-2402

1243

±5

63,404

Mistral

Proprietary

181

178◄─►188

amazon-nova-micro-v1.0

1241

±5

19,774

Amazon

Proprietary

182

178◄─►193

jamba-1.5-mini

1239

±7

8,918

AI21 Labs

Jamba Open

183

178◄─►196

ministral-8b-2410

1237

±9

4,833

Mistral

MRL

184

179◄─►196

gemini-pro-dev-api

1235

±7

18,454

Google

Proprietary

185

180◄─►195

qwen1.5-110b-chat

1235

±5

26,679

Alibaba

Qianwen LICENSE

186

180◄─►196

qwen1.5-72b-chat

1235

±5

39,689

Alibaba

Qianwen LICENSE

187

179◄─►197

reka-flash-21b-20240226-online

1234

±7

15,606

Reka AI

Proprietary

188

179◄─►198

Tencenthunyuan-standard-256k

1233

±12

2,761

Tencent

Proprietary

189

182◄─►197

mixtral-8x22b-instruct-v0.1

1230

±4

52,214

Mistral

Apache 2.0

190

182◄─►198

Coherecommand-r

1229

±5

54,710

Cohere

CC-BY-NC-4.0

191

182◄─►198

reka-flash-21b-20240226

1228

±6

25,026

Reka AI

Proprietary

192

184◄─►199

gpt-3.5-turbo-0125

1225

±5

67,214

OpenAI

Proprietary

193

183◄─►199

mistral-medium

1225

±5

34,893

Mistral

Proprietary

194

187◄─►199

Metallama-3-8b-instruct

1224

±4

106,055

Meta

Llama 3 Community

195

183◄─►200

Coherec4ai-aya-expanse-8b

1223

±7

9,922

Cohere

CC-BY-NC-4.0

196

182◄─►203

gemini-pro

1223

±12

6,418

Google

Proprietary

197

182◄─►203

llama-3.1-tulu-3-8b

1221

±11

2,943

Ai2

Llama 3.1

198

189◄─►204

HuggingFacezephyr-orpo-141b-A35b-v0.1

1215

±11

4,712

HuggingFace

Apache 2.0

199

195◄─►203

01.AIyi-1.5-34b-chat

1214

±5

24,417

01 AI

Apache-2.0

200

196◄─►203

Metallama-3.1-8b-instruct

1211

±4

50,234

Meta

Llama 3.1 Community

201

192◄─►209

granite-3.1-8b-instruct

1210

±11

3,142

IBM

Apache 2.0

202

196◄─►208

qwen1.5-32b-chat

1206

±6

22,068

Alibaba

Qianwen LICENSE

203

196◄─►211

gpt-3.5-turbo-1106

1203

±9

16,760

OpenAI

Proprietary

204

200◄─►211

Azurephi-3-medium-4k-instruct

1199

±5

25,301

Microsoft

MIT

205

201◄─►211

mixtral-8x7b-instruct-v0.1

1199

±4

74,303

Mistral

Apache 2.0

206

201◄─►211

gemma-2-2b-it

1198

±4

46,901

Google

Gemma license

207

201◄─►216

dbrx-instruct-preview

1197

±6

32,760

Databricks

DBRX LICENSE

208

201◄─►219

qwen1.5-14b-chat

1194

±7

18,066

Alibaba

Qianwen LICENSE

209

202◄─►220

InternLMinternlm2_5-20b-chat

1193

±7

10,038

InternLM

Other

210

203◄─►226

Azurewizardlm-70b

1186

±9

8,270

Microsoft

Llama 2 Community

211

203◄─►227

deepseek-llm-67b-chat

1185

±11

4,950

DeepSeek AI

DeepSeek License

212

207◄─►224

01.AIyi-34b-chat

1185

±7

15,624

01 AI

Yi License

213

207◄─►226

OpenChatopenchat-3.5-0106

1184

±8

12,712

OpenChat

Apache-2.0

214

207◄─►226

granite-3.0-8b-instruct

1184

±9

6,727

IBM

Apache 2.0

215

207◄─►227

OpenChatopenchat-3.5

1183

±10

8,009

OpenChat

Apache-2.0

216

208◄─►226

Snowflakesnowflake-arctic-instruct

1181

±6

33,272

Snowflake

Apache 2.0

217

207◄─►229

granite-3.1-2b-instruct

1181

±11

3,235

IBM

Apache 2.0

218

209◄─►227

gemma-1.1-7b-it

1180

±6

24,327

Google

Gemma license

219

208◄─►229

tulu-2-dpo-70b

1180

±10

6,579

AllenAI/UW

AI2 ImpACT Low-risk

220

208◄─►231

openhermes-2.5-mistral-7b

1177

±10

5,026

NousResearch

Apache-2.0

221

210◄─►230

vicuna-33b

1175

±6

22,613

LMSYS

Non-commercial

222

210◄─►231

starling-lm-7b-beta

1174

±7

16,190

Nexusflow

Apache-2.0

223

210◄─►231

Azurephi-3-small-8k-instruct

1173

±6

17,983

Microsoft

MIT

224

211◄─►231

Metallama-2-70b-chat

1173

±5

38,767

Meta

Llama 2 Community

225

211◄─►234

starling-lm-7b-alpha

1169

±8

10,267

UC Berkeley

CC-BY-NC-4.0

226

215◄─►235

Metallama-3.2-3b-instruct

1167

±8

8,043

Meta

Llama 3.2

227

210◄─►236

nous-hermes-2-mixtral-8x7b-dpo

1167

±12

3,792

NousResearch

Apache-2.0

228

218◄─►241

qwq-32b-preview

1159

±11

3,256

Alibaba

Apache 2.0

229

218◄─►244

Nvidiallama2-70b-steerlm-chat

1158

±13

3,605

Nvidia

Llama 2 Community

230

225◄─►241

granite-3.0-2b-instruct

1157

±8

6,922

IBM

Apache 2.0

231

221◄─►246

solar-10.7b-instruct-v1.0

1155

±13

4,187

Upstage AI

CC-BY-NC-4.0

232

220◄─►249

dolphin-2.2.1-mistral-7b

1153

±15

1,685

Cognitive Computations

Apache-2.0

233

225◄─►247

mpt-30b-chat

1153

±12

2,606

MosaicML

CC-BY-NC-SA-4.0

234

226◄─►245

Azurewizardlm-13b

1152

±9

7,122

Microsoft

Llama 2 Community

235

227◄─►244

mistral-7b-instruct-v0.2

1152

±7

19,603

Mistral

Apache-2.0

236

225◄─►252

falcon-180b-chat

1149

±17

1,312

TII

Falcon-180B TII License

237

228◄─►250

qwen1.5-7b-chat

1145

±10

4,782

Alibaba

Qianwen LICENSE

238

228◄─►249

Metallama-2-13b-chat

1144

±7

19,357

Meta

Llama 2 Community

239

228◄─►249

Azurephi-3-mini-4k-instruct-june-2024

1144

±6

12,415

Microsoft

MIT

240

228◄─►249

vicuna-13b

1143

±7

19,539

LMSYS

Llama 2 Community

241

228◄─►252

qwen-14b-chat

1140

±11

5,004

Alibaba

Qianwen LICENSE

242

230◄─►252

palm-2

1138

±9

8,634

Google

Proprietary

243

230◄─►252

Metacodellama-34b-instruct

1138

±9

7,417

Meta

Llama 2 Community

244

232◄─►252

gemma-7b-it

1136

±9

9,034

Google

Gemma license

245

234◄─►253

HuggingFacezephyr-7b-beta

1133

±9

11,220

HuggingFace

MIT

246

235◄─►253

Azurephi-3-mini-128k-instruct

1132

±7

21,024

Microsoft

MIT

247

233◄─►256

guanaco-33b

1130

±12

2,955

UW

Non-commercial

248

239◄─►253

Azurephi-3-mini-4k-instruct

1130

±6

20,539

Microsoft

MIT

249

230◄─►257

HuggingFacezephyr-7b-alpha

1130

±16

1,803

HuggingFace

MIT

250

240◄─►257

stripedhyena-nous-7b

1122

±11

5,214

Together AI

Apache 2.0

251

235◄─►258

Metacodellama-70b-instruct

1120

±18

1,151

Meta

Llama 2 Community

252

240◄─►257

HuggingFacesmollm2-1.7b-instruct

1119

±14

2,244

HuggingFace

Apache 2.0

253

245◄─►257

vicuna-7b

1117

±9

6,972

LMSYS

Llama 2 Community

254

248◄─►257

gemma-1.1-2b-it

1114

±8

11,035

Google

Gemma license

255

248◄─►257

Metallama-3.2-1b-instruct

1113

±8

8,166

Meta

Llama 3.2

256

248◄─►258

mistral-7b-instruct

1112

±9

9,042

Mistral

Apache 2.0

257

249◄─►258

Metallama-2-7b-chat

1110

±7

14,272

Meta

Llama 2 Community

258

258◄─►260

qwen1.5-4b-chat

1092

±9

7,662

Alibaba

Qianwen LICENSE

259

255◄─►262

gemma-2b-it

1092

±12

4,817

Google

Gemma license

260

258◄─►265

olmo-7b-instruct

1076

±11

6,412

Allen AI

Apache-2.0

261

259◄─►265

koala-13b

1073

±10

6,998

UC Berkeley

Non-commercial

262

260◄─►265

alpaca-13b

1068

±11

5,828

Stanford

Non-commercial

263

259◄─►266

gpt4all-13b-snoozy

1068

±15

1,773

Nomic AI

Non-commercial

264

260◄─►266

mpt-7b-chat

1064

±12

3,977

MosaicML

CC-BY-NC-SA-4.0

265

260◄─►266

chatglm3-6b

1058

±12

4,692

Tsinghua

Apache-2.0

266

263◄─►268

RWKVRWKV-4-Raven-14B

1044

±11

4,898

RWKV

Apache 2.0

267

266◄─►268

chatglm2-6b

1027

±14

2,683

Tsinghua

Apache-2.0

268

266◄─►268

oasst-pythia-12b

1024

±11

6,343

OpenAssistant

Apache 2.0

269

269◄─►272

chatglm-6b

998

±13

4,968

Tsinghua

Non-commercial

270

269◄─►272

fastchat-t5-3b

994

±12

4,270

LMSYS

Apache 2.0

271

269◄─►273

dolly-v2-12b

981

±14

3,471

Databricks

MIT

272

269◄─►273

Metallama-13b

973

±16

2,441

Meta

Non-commercial

273

271◄─►273

Stabilitystablelm-tuned-alpha-7b

955

±13

3,325

Stability AI

CC-BY-NC-SA-4.0

说明

  • 排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。

  • 模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。

  • 分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。

  • 95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:±6)。这个区间越小,表示模型的评分越稳定和可靠。

  • 票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。

  • 组织/公司:提供该模型的组织或公司。

  • 许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。

数据来源与更新频率

本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。

免责声明

本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。

最后更新于

这有帮助吗?