模型榜单

这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。

数据更新时间: 2026-01-12 08:09:21 UTC / 2026-01-12 16:09:21 CST (北京时间)

排行榜

Rank
Rank Spread
模型
分数
95% 置信区间 (±)
票数
组织/公司
许可证

1

1◄─►1

gemini-3-pro

1490

±5

25,178

Google

Proprietary

2

2◄─►5

grok-4.1-thinking

1477

±5

25,440

xAI

Proprietary

3

2◄─►7

gemini-3-flash

1471

±7

10,204

Google

Proprietary

4

2◄─►7

Anthropicclaude-opus-4-5-20251101-thinking-32k

1469

±6

17,570

Anthropic

Proprietary

5

3◄─►8

grok-4.1

1466

±5

29,539

xAI

Proprietary

6

3◄─►8

Anthropicclaude-opus-4-5-20251101

1465

±6

18,754

Anthropic

Proprietary

7

2◄─►8

gemini-3-flash (thinking-minimal)

1464

±9

5,624

Google

Proprietary

8

5◄─►11

gpt-5.1-high

1457

±5

22,116

OpenAI

Proprietary

9

8◄─►15

gemini-2.5-pro

1450

±3

84,916

Google

Proprietary

10

8◄─►16

Anthropicclaude-sonnet-4-5-20250929-thinking-32k

1450

±4

36,318

Anthropic

Proprietary

11

9◄─►17

Anthropicclaude-opus-4-1-20250805-thinking-16k

1448

±4

50,339

Anthropic

Proprietary

12

9◄─►17

Anthropicclaude-sonnet-4-5-20250929

1448

±4

32,372

Anthropic

Proprietary

13

8◄─►21

ernie-5.0-preview-1203

1446

±8

7,250

Baidu

Proprietary

14

9◄─►21

gpt-4.5-preview-2025-02-27

1443

±6

14,646

OpenAI

Proprietary

15

10◄─►20

Anthropicclaude-opus-4-1-20250805

1443

±3

65,088

Anthropic

Proprietary

16

9◄─►24

glm-4.7

1443Preliminary

±8

7,157

Z.ai

MIT

17

11◄─►20

chatgpt-4o-latest-20250326

1442

±3

72,793

OpenAI

Proprietary

18

13◄─►26

gpt-5-high

1436

±5

32,723

OpenAI

Proprietary

19

13◄─►27

gpt-5.1

1435

±5

23,716

OpenAI

Proprietary

20

13◄─►33

gpt-5.2-high

1434

±7

9,657

OpenAI

Proprietary

21

15◄─►29

qwen3-max-preview

1434

±5

28,055

Alibaba

Proprietary

22

17◄─►28

o3-2025-04-16

1433

±4

61,321

OpenAI

Proprietary

23

17◄─►33

grok-4-1-fast-reasoning

1431

±5

18,610

xAI

Proprietary

24

18◄─►37

MoonshotAIkimi-k2-thinking-turbo

1430

±5

23,702

Moonshot

Modified MIT

25

17◄─►42

ernie-5.0-preview-1103

1429

±7

9,049

Baidu

Proprietary

26

18◄─►44

gpt-5.2

1425

±6

10,415

OpenAI

Proprietary

27

21◄─►44

gpt-5-chat

1425

±4

32,024

OpenAI

Proprietary

28

19◄─►45

qwen3-max-2025-09-23

1424

±6

9,211

Alibaba

Proprietary

29

22◄─►44

glm-4.6

1424

±4

31,660

Z.ai

MIT

30

20◄─►45

deepseek-v3.2-exp

1423

±6

11,910

DeepSeek AI

MIT

31

22◄─►44

Anthropicclaude-opus-4-20250514-thinking-16k

1423

±4

37,671

Anthropic

Proprietary

32

24◄─►44

qwen3-235b-a22b-instruct-2507

1422

±3

59,848

Alibaba

Apache 2.0

33

22◄─►48

deepseek-v3.2-exp-thinking

1422

±7

9,179

DeepSeek AI

MIT

34

22◄─►51

grok-4-fast-chat

1420

±8

7,013

xAI

Proprietary

35

24◄─►50

deepseek-v3.2

1420

±6

18,091

DeepSeek AI

MIT

36

24◄─►50

deepseek-v3.2-thinking

1419

±6

13,621

DeepSeek AI

MIT

37

25◄─►50

deepseek-r1-0528

1419

±6

19,149

DeepSeek

MIT

38

25◄─►51

MoonshotAIkimi-k2-0905-preview

1418

±6

12,061

Moonshot

Modified MIT

39

25◄─►52

deepseek-v3.1

1417

±6

15,189

DeepSeek

MIT

40

26◄─►51

MoonshotAIkimi-k2-0711-preview

1416

±5

28,491

Moonshot

Modified MIT

41

25◄─►52

deepseek-v3.1-thinking

1416

±7

11,932

DeepSeek

MIT

42

24◄─►57

deepseek-v3.1-terminus

1415

±10

3,731

DeepSeek AI

MIT

43

26◄─►53

qwen3-vl-235b-a22b-instruct

1414

±7

11,676

Alibaba

Apache 2.0

44

25◄─►61

deepseek-v3.1-terminus-thinking

1414

±10

3,506

DeepSeek AI

MIT

45

31◄─►54

mistral-large-3

1413

±6

14,472

Mistral

Apache 2.0

46

33◄─►53

gpt-4.1-2025-04-14

1412

±4

52,315

OpenAI

Proprietary

47

33◄─►53

Anthropicclaude-opus-4-20250514

1412

±4

45,407

Anthropic

Proprietary

48

33◄─►53

mistral-medium-2508

1412

±4

53,609

Mistral

Proprietary

49

34◄─►56

grok-3-preview-02-24

1411

±4

34,031

xAI

Proprietary

50

36◄─►59

grok-4-0709

1409

±4

42,343

xAI

Proprietary

51

34◄─►61

glm-4.5

1409

±5

24,694

Z.ai

MIT

52

40◄─►59

gemini-2.5-flash

1408

±3

84,420

Google

Proprietary

53

42◄─►66

gemini-2.5-flash-preview-09-2025

1405

±4

33,365

Google

Proprietary

54

47◄─►66

Anthropicclaude-haiku-4-5-20251001

1403

±4

34,960

Anthropic

Proprietary

55

46◄─►66

grok-4-fast-reasoning

1402

±5

18,781

xAI

Proprietary

56

49◄─►68

o1-2024-12-17

1401

±4

28,039

OpenAI

Proprietary

57

49◄─►69

qwen3-next-80b-a3b-instruct

1401

±5

22,974

Alibaba

Apache 2.0

58

47◄─►70

longcat-flash-chat

1400

±6

11,455

Meituan

MIT

59

51◄─►69

qwen3-235b-a22b-no-thinking

1400

±4

39,205

Alibaba

Apache 2.0

60

53◄─►70

Anthropicclaude-sonnet-4-20250514-thinking-32k

1399

±4

36,020

Anthropic

Proprietary

61

53◄─►75

qwen3-235b-a22b-thinking-2507

1398

±6

9,299

Alibaba

Apache 2.0

62

51◄─►76

mimo-v2-flash (non-thinking)

1397

±8

6,623

Xiaomi

MIT

63

53◄─►75

deepseek-r1

1396

±5

18,722

DeepSeek

MIT

64

48◄─►79

amazon-nova-experimental-chat-12-10

1396

±10

3,217

Amazon

Proprietary

65

53◄─►77

qwen3-vl-235b-a22b-thinking

1394

±7

7,951

Alibaba

Apache 2.0

66

56◄─►76

deepseek-v3-0324

1393

±4

46,580

DeepSeek

MIT

67

56◄─►77

gpt-5-mini-high

1392

±5

27,302

OpenAI

Proprietary

68

53◄─►82

Tencenthunyuan-vision-1.5-thinking

1392

±12

2,202

Tencent

Proprietary

69

59◄─►79

o4-mini-2025-04-16

1391

±4

46,663

OpenAI

Proprietary

70

57◄─►79

mai-1-preview

1391

±5

18,092

Microsoft AI

Proprietary

71

61◄─►80

Anthropicclaude-sonnet-4-20250514

1389

±4

41,446

Anthropic

Proprietary

72

61◄─►80

o1-preview

1388

±5

31,505

OpenAI

Proprietary

73

63◄─►80

Anthropicclaude-3-7-sonnet-20250219-thinking-32k

1387

±4

39,795

Anthropic

Proprietary

74

61◄─►81

qwen3-coder-480b-a35b-instruct

1387

±5

26,656

Alibaba

Apache 2.0

75

61◄─►84

Tencenthunyuan-t1-20250711

1385

±9

4,800

Tencent

Proprietary

76

61◄─►84

Minimaxminimax-m2.1-preview

1384Preliminary

±8

6,148

MiniMax

MIT

77

65◄─►83

mistral-medium-2505

1384

±5

34,374

Mistral

Proprietary

78

67◄─►83

qwen3-30b-a3b-instruct-2507

1382

±5

24,077

Alibaba

Apache 2.0

79

70◄─►84

gpt-4.1-mini-2025-04-14

1381

±4

40,335

OpenAI

Proprietary

80

67◄─►87

Tencenthunyuan-turbos-20250416

1381

±6

11,082

Tencent

Proprietary

81

73◄─►87

gemini-2.5-flash-lite-preview-09-2025-no-thinking

1379

±4

37,839

Google

Proprietary

82

74◄─►89

gemini-2.5-flash-lite-preview-06-17-thinking

1375

±4

33,772

Google

Proprietary

83

75◄─►89

qwen3-235b-a22b

1374

±5

27,047

Alibaba

Apache 2.0

84

77◄─►89

qwen2.5-max

1373

±4

33,487

Alibaba

Proprietary

85

80◄─►89

Anthropicclaude-3-5-sonnet-20241022

1372

±3

89,708

Anthropic

Proprietary

86

80◄─►93

Anthropicclaude-3-7-sonnet-20250219

1371

±4

44,455

Anthropic

Proprietary

87

80◄─►93

glm-4.5-air

1371

±4

31,472

Z.ai

MIT

88

82◄─►96

qwen3-next-80b-a3b-thinking

1368

±6

13,750

Alibaba

Apache 2.0

89

82◄─►95

Minimaxminimax-m1

1367

±4

36,679

MiniMax

Apache 2.0

90

86◄─►98

gemma-3-27b-it

1365

±4

49,137

Google

Gemma

91

86◄─►102

o3-mini-high

1363

±5

18,725

OpenAI

Proprietary

92

86◄─►103

grok-3-mini-high

1362

±5

17,475

xAI

Proprietary

93

86◄─►108

amazon-nova-experimental-chat-11-10

1361

±7

7,391

Amazon

Proprietary

94

88◄─►104

gemini-2.0-flash-001

1360

±4

45,001

Google

Proprietary

95

88◄─►112

deepseek-v3

1358

±5

21,995

DeepSeek

DeepSeek

96

90◄─►112

grok-3-mini-beta

1357

±5

23,670

xAI

Proprietary

97

91◄─►117

mistral-small-2506

1355

±5

18,225

Mistral

Apache 2.0

98

88◄─►118

intellect-3

1354

±8

5,424

Prime Intellect

MIT

99

90◄─►120

glm-4.5v

1353

±8

4,961

Z.ai

MIT

100

92◄─►118

gpt-oss-120b

1353

±4

31,079

OpenAI

Apache 2.0

101

93◄─►118

gemini-2.0-flash-lite-preview-02-05

1353

±4

25,222

Google

Proprietary

102

94◄─►117

Coherecommand-a-03-2025

1352

±3

57,583

Cohere

CC-BY-NC-4.0

103

94◄─►118

gemini-1.5-pro-002

1351

±3

56,014

Google

Proprietary

104

97◄─►120

o3-mini

1348

±3

58,663

OpenAI

Proprietary

105

95◄─►122

amazon-nova-experimental-chat-10-20

1347

±6

11,615

Amazon

Proprietary

106

91◄─►130

Tencenthunyuan-turbos-20250226

1347

±12

2,254

Tencent

Proprietary

107

91◄─►130

amazon-nova-experimental-chat-10-09

1347

±11

2,876

Amazon

Proprietary

108

95◄─►123

ling-flash-2.0

1347

±7

7,111

Ant Group

MIT

109

94◄─►127

Minimaxminimax-m2

1346

±8

7,073

MiniMax

Apache 2.0

110

95◄─►126

Stepfunstep-3

1346

±7

6,600

StepFun

Apache 2.0

111

91◄─►131

Nvidiallama-3.1-nemotron-ultra-253b-v1

1346

±12

2,573

Nvidia

Nvidia Open Model

112

99◄─►121

gpt-4o-2024-05-13

1345

±3

113,568

OpenAI

Proprietary

113

94◄─►130

qwen3-32b

1345

±9

3,944

Alibaba

Apache 2.0

114

95◄─►130

qwen-plus-0125

1345

±8

5,862

Alibaba

Proprietary

115

97◄─►131

glm-4-plus-0111

1343

±8

5,808

Zhipu

Proprietary

116

102◄─►124

Anthropicclaude-3-5-sonnet-20240620

1342

±3

82,864

Anthropic

Proprietary

117

97◄─►135

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5

1341

±10

3,465

Nvidia

Nvidia Open

118

97◄─►133

gemma-3-12b-it

1341

±9

3,866

Google

Gemma

119

97◄─►137

Tencenthunyuan-turbo-0110

1339

±11

2,324

Tencent

Proprietary

120

103◄─►133

gpt-5-nano-high

1338

±7

8,348

OpenAI

Proprietary

121

105◄─►135

nova-2-lite

1337

±6

12,362

Amazon

Proprietary

122

107◄─►132

o1-mini

1336

±4

52,303

OpenAI

Proprietary

123

109◄─►132

Metallama-3.1-405b-instruct-bf16

1335

±4

41,948

Meta

Llama 3.1 Community

124

110◄─►135

gpt-4o-2024-08-06

1335

±4

45,787

OpenAI

Proprietary

125

109◄─►135

qwq-32b

1335

±4

26,177

Alibaba

Apache 2.0

126

111◄─►135

grok-2-2024-08-13

1334

±4

63,725

xAI

Proprietary

127

108◄─►135

gemini-advanced-0514

1334

±5

50,652

Google

Proprietary

128

111◄─►135

Metallama-3.1-405b-instruct-fp8

1333

±3

60,273

Meta

Llama 3.1 Community

129

106◄─►146

Stepfunstep-2-16k-exp-202412

1333

±9

4,896

StepFun

Proprietary

130

117◄─►147

01.AIyi-lightning

1328

±5

27,624

01 AI

Proprietary

131

119◄─►148

Metallama-4-maverick-17b-128e-instruct

1327

±4

41,039

Meta

Llama 4

132

121◄─►150

qwen3-30b-a3b

1326

±5

27,383

Alibaba

Apache 2.0

133

111◄─►158

Nvidiallama-3.3-nemotron-49b-super-v1

1326

±12

2,246

Nvidia

Nvidia

134

115◄─►157

Tencenthunyuan-large-2025-02-10

1325

±10

3,760

Tencent

Proprietary

135

128◄─►153

gpt-4-turbo-2024-04-09

1324

±4

98,965

OpenAI

Proprietary

136

129◄─►153

Anthropicclaude-3-5-haiku-20241022

1323

±3

71,210

Anthropic

Proprietary

137

121◄─►157

deepseek-v2.5-1210

1323

±8

6,877

DeepSeek

DeepSeek

138

129◄─►154

Metallama-4-scout-17b-16e-instruct

1322

±5

31,044

Meta

Llama

139

129◄─►153

gemini-1.5-pro-001

1322

±4

79,769

Google

Proprietary

140

129◄─►153

Anthropicclaude-3-opus-20240229

1322

±3

196,365

Anthropic

Proprietary

141

128◄─►158

gpt-4.1-nano-2025-04-14

1321

±8

6,146

OpenAI

Proprietary

142

129◄─►158

Stepfunstep-1o-turbo-202506

1320

±7

9,632

StepFun

Proprietary

143

129◄─►159

ring-flash-2.0

1320

±7

7,243

Ant Group

MIT

144

132◄─►157

Metallama-3.3-70b-instruct

1319

±3

55,942

Meta

Llama-3.3

145

131◄─►158

glm-4-plus

1319

±5

26,343

Zhipu AI

Proprietary

146

130◄─►158

gemma-3n-e4b-it

1319

±5

23,401

Google

Gemma

147

129◄─►159

gpt-oss-20b

1318

±6

10,820

OpenAI

Apache 2.0

148

132◄─►159

qwen-max-0919

1317

±6

16,599

Alibaba

Qwen

149

129◄─►164

Nvidianvidia-nemotron-3-nano-30b-a3b-bf16

1317

±7

9,342

Nvidia

NVIDIA Open Model

150

133◄─►158

gpt-4o-mini-2024-07-18

1317

±3

69,291

OpenAI

Proprietary

151

133◄─►165

qwen2.5-plus-1127

1314

±6

10,254

Alibaba

Proprietary

152

138◄─►164

mistral-large-2407

1314

±4

45,968

Mistral

Mistral Research

153

137◄─►165

athene-v2-chat

1313

±4

24,880

NexusFlow

NexusFlow

154

138◄─►164

gpt-4-0125-preview

1313

±4

94,534

OpenAI

Proprietary

155

138◄─►164

gpt-4-1106-preview

1313

±4

101,116

OpenAI

Proprietary

156

129◄─►169

mercury

1312

±14

1,960

Inception AI

Proprietary

157

133◄─►169

Tencenthunyuan-standard-2025-02-10

1311

±10

3,923

Tencent

Proprietary

158

141◄─►167

gemini-1.5-flash-002

1310

±4

35,180

Google

Proprietary

159

150◄─►168

grok-2-mini-2024-08-13

1307

±4

52,792

xAI

Proprietary

160

150◄─►169

deepseek-v2.5

1306

±5

24,839

DeepSeek

DeepSeek

161

150◄─►169

athene-70b-0725

1305

±6

19,795

NexusFlow

CC-BY-NC-4.0

162

154◄─►169

mistral-large-2411

1305

±4

28,454

Mistral

MRL

163

147◄─►170

olmo-3-32b-think

1305

±8

5,977

Allen AI

Apache 2.0

164

150◄─►169

magistral-medium-2506

1305

±6

11,952

Mistral

Proprietary

165

156◄─►169

mistral-small-3.1-24b-instruct-2503

1304

±4

33,996

Mistral

Apache 2.0

166

150◄─►175

gemma-3-4b-it

1303

±9

4,196

Google

Gemma

167

156◄─►169

qwen2.5-72b-instruct

1302

±4

39,634

Alibaba

Qwen

168

157◄─►178

Nvidiallama-3.1-nemotron-70b-instruct

1298

±8

7,216

Nvidia

Llama 3.1

169

158◄─►180

Tencenthunyuan-large-vision

1295

±9

5,573

Tencent

Proprietary

170

166◄─►178

Metallama-3.1-70b-instruct

1293

±4

56,002

Meta

Llama 3.1 Community

171

167◄─►183

jamba-1.5-large

1289

±7

8,730

AI21 Labs

Jamba Open

172

168◄─►181

amazon-nova-pro-v1.0

1289

±4

25,218

Amazon

Proprietary

173

167◄─►188

ibm-granite-h-small

1288

±8

5,681

IBM

Apache 2.0

174

168◄─►181

gemma-2-27b-it

1288

±3

76,196

Google

Gemma license

175

167◄─►183

reka-core-20240904

1287

±7

7,380

Reka AI

Proprietary

176

168◄─►183

gpt-4-0314

1287

±5

54,754

OpenAI

Proprietary

177

167◄─►189

Nvidiallama-3.1-nemotron-51b-instruct

1286

±10

3,777

Nvidia

Llama 3.1

178

167◄─►189

llama-3.1-tulu-3-70b

1286

±10

2,882

Ai2

Llama 3.1

179

170◄─►183

gemini-1.5-flash-001

1285

±4

63,418

Google

Proprietary

180

171◄─►189

Anthropicclaude-3-sonnet-20240229

1281

±4

110,173

Anthropic

Proprietary

181

170◄─►189

gemma-2-9b-it-simpo

1279

±7

10,108

Princeton

MIT

182

173◄─►189

Nvidianemotron-4-340b-instruct

1277

±5

19,913

Nvidia

NVIDIA Open Model

183

173◄─►190

Coherecommand-r-plus-08-2024

1277

±7

9,932

Cohere

CC-BY-NC-4.0

184

177◄─►189

Metallama-3-70b-instruct

1276

±4

158,907

Meta

Llama 3 Community

185

177◄─►190

gpt-4-0613

1275

±4

89,612

OpenAI

Proprietary

186

177◄─►192

mistral-small-24b-instruct-2501

1274

±6

14,831

Mistral

Apache 2.0

187

177◄─►194

glm-4-0520

1273

±7

9,857

Zhipu AI

Proprietary

188

177◄─►194

reka-flash-20240904

1273

±7

7,585

Reka AI

Proprietary

189

178◄─►198

qwen2.5-coder-32b-instruct

1270

±8

5,452

Alibaba

Apache 2.0

190

184◄─►198

Coherec4ai-aya-expanse-32b

1267

±5

27,362

Cohere

CC-BY-NC-4.0

191

186◄─►198

gemma-2-9b-it

1265

±4

54,957

Google

Gemma license

192

186◄─►200

deepseek-coder-v2

1264

±6

15,242

DeepSeek AI

DeepSeek License

193

187◄─►199

Coherecommand-r-plus

1263

±4

78,400

Cohere

CC-BY-NC-4.0

194

187◄─►200

qwen2-72b-instruct

1262

±5

37,688

Alibaba

Qianwen LICENSE

195

189◄─►200

Anthropicclaude-3-haiku-20240307

1261

±4

118,626

Anthropic

Proprietary

196

189◄─►200

amazon-nova-lite-v1.0

1260

±5

19,765

Amazon

Proprietary

197

189◄─►200

gemini-1.5-flash-8b-001

1259

±4

35,914

Google

Proprietary

198

192◄─►200

Azurephi-4

1256

±5

24,355

Microsoft

MIT

199

189◄─►206

olmo-2-0325-32b-instruct

1252

±11

3,379

Allen AI

Apache-2.0

200

193◄─►205

Coherecommand-r-08-2024

1251

±7

10,229

Cohere

CC-BY-NC-4.0

201

199◄─►209

mistral-large-2402

1242

±5

63,404

Mistral

Proprietary

202

199◄─►209

amazon-nova-micro-v1.0

1241

±5

19,776

Amazon

Proprietary

203

199◄─►214

jamba-1.5-mini

1239

±7

8,918

AI21 Labs

Jamba Open

204

199◄─►217

ministral-8b-2410

1237

±9

4,833

Mistral

MRL

205

200◄─►217

gemini-pro-dev-api

1235

±7

18,454

Google

Proprietary

206

201◄─►216

qwen1.5-110b-chat

1234

±5

26,679

Alibaba

Qianwen LICENSE

207

201◄─►217

qwen1.5-72b-chat

1234

±5

39,689

Alibaba

Qianwen LICENSE

208

199◄─►219

Tencenthunyuan-standard-256k

1234

±12

2,761

Tencent

Proprietary

209

201◄─►218

reka-flash-21b-20240226-online

1233

±7

15,606

Reka AI

Proprietary

210

203◄─►218

mixtral-8x22b-instruct-v0.1

1230

±4

52,214

Mistral

Apache 2.0

211

203◄─►219

Coherecommand-r

1228

±5

54,710

Cohere

CC-BY-NC-4.0

212

203◄─►219

reka-flash-21b-20240226

1227

±6

25,026

Reka AI

Proprietary

213

204◄─►220

gpt-3.5-turbo-0125

1224

±5

67,214

OpenAI

Proprietary

214

204◄─►221

Coherec4ai-aya-expanse-8b

1224

±7

9,922

Cohere

CC-BY-NC-4.0

215

205◄─►221

mistral-medium

1223

±5

34,893

Mistral

Proprietary

216

208◄─►220

Metallama-3-8b-instruct

1223

±4

106,055

Meta

Llama 3 Community

217

203◄─►224

gemini-pro

1222

±12

6,418

Google

Proprietary

218

203◄─►223

llama-3.1-tulu-3-8b

1222

±11

2,943

Ai2

Llama 3.1

219

210◄─►225

HuggingFacezephyr-orpo-141b-A35b-v0.1

1214

±11

4,712

HuggingFace

Apache 2.0

220

215◄─►224

01.AIyi-1.5-34b-chat

1213

±5

24,417

01 AI

Apache-2.0

221

217◄─►224

Metallama-3.1-8b-instruct

1211

±4

50,235

Meta

Llama 3.1 Community

222

213◄─►230

granite-3.1-8b-instruct

1210

±11

3,142

IBM

Apache 2.0

223

217◄─►229

qwen1.5-32b-chat

1205

±6

22,068

Alibaba

Qianwen LICENSE

224

218◄─►232

gpt-3.5-turbo-1106

1202

±9

16,760

OpenAI

Proprietary

225

222◄─►232

gemma-2-2b-it

1199

±4

46,900

Google

Gemma license

226

221◄─►232

Azurephi-3-medium-4k-instruct

1198

±5

25,301

Microsoft

MIT

227

222◄─►232

mixtral-8x7b-instruct-v0.1

1198

±4

74,303

Mistral

Apache 2.0

228

222◄─►237

dbrx-instruct-preview

1196

±6

32,760

Databricks

DBRX LICENSE

229

222◄─►241

qwen1.5-14b-chat

1193

±7

18,066

Alibaba

Qianwen LICENSE

230

223◄─►241

InternLMinternlm2_5-20b-chat

1192

±7

10,038

InternLM

Other

231

224◄─►247

Azurewizardlm-70b

1185

±9

8,270

Microsoft

Llama 2 Community

232

224◄─►248

deepseek-llm-67b-chat

1184

±12

4,950

DeepSeek AI

DeepSeek License

233

228◄─►245

01.AIyi-34b-chat

1184

±7

15,624

01 AI

Yi License

234

228◄─►247

granite-3.0-8b-instruct

1184

±9

6,727

IBM

Apache 2.0

235

228◄─►247

OpenChatopenchat-3.5-0106

1183

±8

12,712

OpenChat

Apache-2.0

236

228◄─►248

OpenChatopenchat-3.5

1182

±10

8,009

OpenChat

Apache-2.0

237

228◄─►249

granite-3.1-2b-instruct

1181

±11

3,236

IBM

Apache 2.0

238

229◄─►248

gemma-1.1-7b-it

1180

±6

24,327

Google

Gemma license

239

229◄─►247

Snowflakesnowflake-arctic-instruct

1180

±6

33,272

Snowflake

Apache 2.0

240

229◄─►250

tulu-2-dpo-70b

1179

±10

6,579

AllenAI/UW

AI2 ImpACT Low-risk

241

229◄─►252

openhermes-2.5-mistral-7b

1176

±10

5,026

NousResearch

Apache-2.0

242

231◄─►251

vicuna-33b

1174

±6

22,613

LMSYS

Non-commercial

243

231◄─►252

starling-lm-7b-beta

1173

±7

16,190

Nexusflow

Apache-2.0

244

231◄─►252

Azurephi-3-small-8k-instruct

1172

±6

17,983

Microsoft

MIT

245

232◄─►252

Metallama-2-70b-chat

1172

±5

38,767

Meta

Llama 2 Community

246

232◄─►255

starling-lm-7b-alpha

1168

±8

10,267

UC Berkeley

CC-BY-NC-4.0

247

236◄─►256

Metallama-3.2-3b-instruct

1167

±8

8,043

Meta

Llama 3.2

248

231◄─►257

nous-hermes-2-mixtral-8x7b-dpo

1166

±12

3,792

NousResearch

Apache-2.0

249

239◄─►262

qwq-32b-preview

1159

±11

3,256

Alibaba

Apache 2.0

250

240◄─►266

Nvidiallama2-70b-steerlm-chat

1157

±13

3,605

Nvidia

Llama 2 Community

251

246◄─►262

granite-3.0-2b-instruct

1157

±8

6,922

IBM

Apache 2.0

252

242◄─►266

solar-10.7b-instruct-v1.0

1154

±13

4,187

Upstage AI

CC-BY-NC-4.0

253

241◄─►270

dolphin-2.2.1-mistral-7b

1152

±15

1,685

Cognitive Computations

Apache-2.0

254

246◄─►268

mpt-30b-chat

1151

±12

2,606

MosaicML

CC-BY-NC-SA-4.0

255

248◄─►266

mistral-7b-instruct-v0.2

1151

±7

19,603

Mistral

Apache-2.0

256

247◄─►266

Azurewizardlm-13b

1150

±9

7,122

Microsoft

Llama 2 Community

257

246◄─►273

falcon-180b-chat

1148

±17

1,312

TII

Falcon-180B TII License

258

249◄─►271

qwen1.5-7b-chat

1144

±10

4,782

Alibaba

Qianwen LICENSE

259

249◄─►270

Azurephi-3-mini-4k-instruct-june-2024

1143

±6

12,414

Microsoft

MIT

260

249◄─►270

Metallama-2-13b-chat

1143

±7

19,357

Meta

Llama 2 Community

261

249◄─►271

vicuna-13b

1142

±7

19,539

LMSYS

Llama 2 Community

262

249◄─►273

qwen-14b-chat

1139

±11

5,004

Alibaba

Qianwen LICENSE

263

251◄─►273

palm-2

1137

±9

8,634

Google

Proprietary

264

251◄─►273

Metacodellama-34b-instruct

1137

±9

7,417

Meta

Llama 2 Community

265

251◄─►273

gemma-7b-it

1136

±9

9,034

Google

Gemma license

266

255◄─►274

HuggingFacezephyr-7b-beta

1132

±9

11,220

HuggingFace

MIT

267

256◄─►274

Azurephi-3-mini-128k-instruct

1131

±7

21,024

Microsoft

MIT

268

259◄─►274

Azurephi-3-mini-4k-instruct

1129

±6

20,539

Microsoft

MIT

269

251◄─►278

HuggingFacezephyr-7b-alpha

1129

±16

1,803

HuggingFace

MIT

270

255◄─►277

guanaco-33b

1128

±12

2,955

UW

Non-commercial

271

261◄─►278

stripedhyena-nous-7b

1121

±11

5,214

Together AI

Apache 2.0

272

256◄─►279

Metacodellama-70b-instruct

1119

±18

1,151

Meta

Llama 2 Community

273

261◄─►278

HuggingFacesmollm2-1.7b-instruct

1119

±14

2,244

HuggingFace

Apache 2.0

274

266◄─►278

vicuna-7b

1116

±9

6,972

LMSYS

Llama 2 Community

275

269◄─►278

gemma-1.1-2b-it

1115

±8

11,035

Google

Gemma license

276

269◄─►278

Metallama-3.2-1b-instruct

1113

±8

8,166

Meta

Llama 3.2

277

269◄─►279

mistral-7b-instruct

1111

±9

9,042

Mistral

Apache 2.0

278

270◄─►279

Metallama-2-7b-chat

1109

±7

14,272

Meta

Llama 2 Community

279

276◄─►283

gemma-2b-it

1092

±12

4,817

Google

Gemma license

280

279◄─►281

qwen1.5-4b-chat

1091

±9

7,662

Alibaba

Qianwen LICENSE

281

279◄─►286

olmo-7b-instruct

1074

±11

6,412

Allen AI

Apache-2.0

282

280◄─►286

koala-13b

1071

±10

6,998

UC Berkeley

Non-commercial

283

281◄─►286

alpaca-13b

1067

±11

5,828

Stanford

Non-commercial

284

280◄─►287

gpt4all-13b-snoozy

1066

±15

1,773

Nomic AI

Non-commercial

285

281◄─►287

mpt-7b-chat

1062

±12

3,977

MosaicML

CC-BY-NC-SA-4.0

286

281◄─►287

chatglm3-6b

1057

±12

4,692

Tsinghua

Apache-2.0

287

284◄─►289

RWKVRWKV-4-Raven-14B

1042

±11

4,898

RWKV

Apache 2.0

288

287◄─►289

chatglm2-6b

1026

±14

2,683

Tsinghua

Apache-2.0

289

287◄─►289

oasst-pythia-12b

1023

±11

6,343

OpenAssistant

Apache 2.0

290

290◄─►293

chatglm-6b

996

±13

4,968

Tsinghua

Non-commercial

291

290◄─►293

fastchat-t5-3b

992

±12

4,270

LMSYS

Apache 2.0

292

290◄─►293

dolly-v2-12b

980

±14

3,471

Databricks

MIT

293

290◄─►294

Metallama-13b

972

±16

2,441

Meta

Non-commercial

294

293◄─►294

Stabilitystablelm-tuned-alpha-7b

953

±13

3,325

Stability AI

CC-BY-NC-SA-4.0

说明

  • 排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。

  • 模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。

  • 分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。

  • 95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:±6)。这个区间越小,表示模型的评分越稳定和可靠。

  • 票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。

  • 组织/公司:提供该模型的组织或公司。

  • 许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。

数据来源与更新频率

本排行榜数据由自动化脚本直接从 1 2 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。

免责声明

本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。

最后更新于

这有帮助吗?