模型榜单

这是一个基于 Chatbot Arena (lmarena.ai) 数据的排行榜,通过自动化流程生成。

数据更新时间: 2026-01-16 08:08:37 UTC / 2026-01-16 16:08:37 CST (北京时间)

排行榜

Rank
Rank Spread
模型
分数
95% 置信区间 (±)
票数
组织/公司
许可证

1

1◄─►1

gemini-3-pro

1489

±5

26,385

Google

Proprietary

2

2◄─►6

grok-4.1-thinking

1477

±5

26,505

xAI

Proprietary

3

2◄─►9

gemini-3-flash

1471

±6

11,599

Google

Proprietary

4

2◄─►9

Anthropicclaude-opus-4-5-20251101-thinking-32k

1468

±5

18,518

Anthropic

Proprietary

5

2◄─►9

Anthropicclaude-opus-4-5-20251101

1467

±5

19,770

Anthropic

Proprietary

6

3◄─►9

grok-4.1

1466

±5

30,490

xAI

Proprietary

7

2◄─►10

gemini-3-flash (thinking-minimal)

1464

±9

5,530

Google

Proprietary

8

3◄─►15

ernie-5.0-0110

1460Preliminary

±9

4,813

Baidu

Proprietary

9

3◄─►10

gpt-5.1-high

1460

±5

23,068

OpenAI

Proprietary

10

7◄─►19

Anthropicclaude-sonnet-4-5-20250929-thinking-32k

1452

±4

37,043

Anthropic

Proprietary

11

9◄─►19

gemini-2.5-pro

1450

±3

86,296

Google

Proprietary

12

9◄─►19

Anthropicclaude-sonnet-4-5-20250929

1450

±4

33,416

Anthropic

Proprietary

13

9◄─►19

Anthropicclaude-opus-4-1-20250805-thinking-16k

1449

±4

50,088

Anthropic

Proprietary

14

9◄─►21

ernie-5.0-preview-1203

1447

±7

8,397

Baidu

Proprietary

15

10◄─►20

Anthropicclaude-opus-4-1-20250805

1445

±3

66,158

Anthropic

Proprietary

16

10◄─►22

gpt-4.5-preview-2025-02-27

1444

±6

14,549

OpenAI

Proprietary

17

9◄─►25

gpt-5.2-high

1444

±8

7,230

OpenAI

Proprietary

18

10◄─►24

glm-4.7

1443Preliminary

±7

8,258

Z.ai

MIT

19

14◄─►22

chatgpt-4o-latest-20250326

1442

±3

73,467

OpenAI

Proprietary

20

10◄─►29

gpt-5.2

1441

±10

3,723

OpenAI

Proprietary

21

15◄─►27

gpt-5.1

1436

±5

24,788

OpenAI

Proprietary

22

16◄─►29

gpt-5-high

1435

±5

32,700

OpenAI

Proprietary

23

18◄─►32

qwen3-max-preview

1434

±5

27,923

Alibaba

Proprietary

24

18◄─►32

o3-2025-04-16

1433

±4

61,465

OpenAI

Proprietary

25

19◄─►38

grok-4-1-fast-reasoning

1430

±5

19,706

xAI

Proprietary

26

20◄─►40

MoonshotAIkimi-k2-thinking-turbo

1429

±5

24,742

Moonshot

Modified MIT

27

20◄─►46

ernie-5.0-preview-1103

1428

±7

9,066

Baidu

Proprietary

28

21◄─►45

gpt-5-chat

1426

±4

31,898

OpenAI

Proprietary

29

21◄─►47

qwen3-max-2025-09-23

1424

±6

9,234

Alibaba

Proprietary

30

25◄─►46

glm-4.6

1424

±4

32,342

Z.ai

MIT

31

25◄─►46

Anthropicclaude-opus-4-20250514-thinking-16k

1424

±4

38,036

Anthropic

Proprietary

32

23◄─►49

deepseek-v3.2-exp

1423

±7

11,826

DeepSeek

MIT

33

23◄─►49

deepseek-v3.2-exp-thinking

1423

±7

9,015

DeepSeek

MIT

34

25◄─►46

qwen3-235b-a22b-instruct-2507

1423

±3

60,768

Alibaba

Apache 2.0

35

23◄─►51

grok-4-fast-chat

1422

±8

7,002

xAI

Proprietary

36

25◄─►51

deepseek-v3.2-thinking

1420

±6

14,504

DeepSeek

MIT

37

27◄─►53

deepseek-r1-0528

1418

±6

19,309

DeepSeek

MIT

38

26◄─►54

MoonshotAIkimi-k2-0905-preview

1418

±6

11,985

Moonshot

Modified MIT

39

27◄─►54

deepseek-v3.2

1418

±6

19,082

DeepSeek

MIT

40

27◄─►53

MoonshotAIkimi-k2-0711-preview

1417

±5

28,682

Moonshot

Modified MIT

41

25◄─►56

ernie-5.0-preview-1022

1417

±9

4,647

Baidu

Proprietary

42

27◄─►54

deepseek-v3.1

1417

±6

15,298

DeepSeek

MIT

43

27◄─►54

deepseek-v3.1-thinking

1417

±7

11,990

DeepSeek

MIT

44

25◄─►59

deepseek-v3.1-terminus-thinking

1416

±10

3,559

DeepSeek

MIT

45

26◄─►59

deepseek-v3.1-terminus

1416

±10

3,768

DeepSeek

MIT

46

28◄─►56

qwen3-vl-235b-a22b-instruct

1415

±6

11,704

Alibaba

Apache 2.0

47

33◄─►54

Anthropicclaude-opus-4-20250514

1414

±4

45,618

Anthropic

Proprietary

48

33◄─►54

gpt-4.1-2025-04-14

1413

±4

52,289

OpenAI

Proprietary

49

32◄─►56

mistral-large-3

1413

±6

15,465

Mistral

Apache 2.0

50

35◄─►56

mistral-medium-2508

1412

±4

54,571

Mistral

Proprietary

51

35◄─►58

grok-3-preview-02-24

1411

±4

34,000

xAI

Proprietary

52

37◄─►59

grok-4-0709

1410

±4

42,260

xAI

Proprietary

53

37◄─►64

glm-4.5

1409

±5

24,816

Z.ai

MIT

54

39◄─►59

gemini-2.5-flash

1409

±3

85,645

Google

Proprietary

55

45◄─►68

gemini-2.5-flash-preview-09-2025

1404

±4

33,008

Google

Proprietary

56

45◄─►68

grok-4-fast-reasoning

1404

±5

18,716

xAI

Proprietary

57

49◄─►68

Anthropicclaude-haiku-4-5-20251001

1403

±4

35,608

Anthropic

Proprietary

58

49◄─►68

o1-2024-12-17

1402

±4

27,822

OpenAI

Proprietary

59

54◄─►70

qwen3-235b-a22b-no-thinking

1401

±4

39,590

Alibaba

Apache 2.0

60

54◄─►70

qwen3-next-80b-a3b-instruct

1401

±5

22,912

Alibaba

Apache 2.0

61

54◄─►70

Anthropicclaude-sonnet-4-20250514-thinking-32k

1401

±4

36,273

Anthropic

Proprietary

62

54◄─►76

longcat-flash-chat

1399

±6

11,577

Meituan

MIT

63

55◄─►77

qwen3-235b-a22b-thinking-2507

1398

±6

9,262

Alibaba

Apache 2.0

64

55◄─►76

deepseek-r1

1398

±5

18,537

DeepSeek

MIT

65

50◄─►82

amazon-nova-experimental-chat-12-10

1396

±10

3,298

Amazon

Proprietary

66

55◄─►80

qwen3-vl-235b-a22b-thinking

1395

±7

7,981

Alibaba

Apache 2.0

67

55◄─►81

mimo-v2-flash (non-thinking)

1395

±7

7,821

Xiaomi

MIT

68

59◄─►78

deepseek-v3-0324

1394

±4

46,744

DeepSeek

MIT

69

54◄─►83

Tencenthunyuan-vision-1.5-thinking

1393

±12

2,225

Tencent

Proprietary

70

59◄─►82

mai-1-preview

1391

±5

18,174

Microsoft AI

Proprietary

71

62◄─►81

o4-mini-2025-04-16

1391

±4

46,744

OpenAI

Proprietary

72

62◄─►82

gpt-5-mini-high

1391

±5

27,240

OpenAI

Proprietary

73

62◄─►82

Anthropicclaude-sonnet-4-20250514

1390

±4

41,697

Anthropic

Proprietary

74

62◄─►82

Anthropicclaude-3-7-sonnet-20250219-thinking-32k

1389

±4

39,925

Anthropic

Proprietary

75

62◄─►82

o1-preview

1389

±5

31,120

OpenAI

Proprietary

76

62◄─►85

Tencenthunyuan-t1-20250711

1387

±9

4,806

Tencent

Proprietary

77

64◄─►83

qwen3-coder-480b-a35b-instruct

1386

±5

26,667

Alibaba

Apache 2.0

78

66◄─►83

mistral-medium-2505

1384

±5

34,594

Mistral

Proprietary

79

65◄─►88

Minimaxminimax-m2.1-preview

1383Preliminary

±7

7,332

MiniMax

MIT

80

67◄─►85

qwen3-30b-a3b-instruct-2507

1383

±5

24,125

Alibaba

Apache 2.0

81

66◄─►88

Tencenthunyuan-turbos-20250416

1382

±6

11,063

Tencent

Proprietary

82

69◄─►85

gpt-4.1-mini-2025-04-14

1382

±4

40,576

OpenAI

Proprietary

83

75◄─►89

gemini-2.5-flash-lite-preview-09-2025-no-thinking

1379

±4

38,666

Google

Proprietary

84

78◄─►91

gemini-2.5-flash-lite-preview-06-17-thinking

1375

±4

33,970

Google

Proprietary

85

78◄─►92

qwen3-235b-a22b

1374

±5

27,185

Alibaba

Apache 2.0

86

81◄─►92

qwen2.5-max

1374

±4

33,301

Alibaba

Proprietary

87

81◄─►91

Anthropicclaude-3-5-sonnet-20241022

1373

±3

89,495

Anthropic

Proprietary

88

81◄─►94

Anthropicclaude-3-7-sonnet-20250219

1372

±4

44,478

Anthropic

Proprietary

89

83◄─►95

glm-4.5-air

1371

±4

31,476

Z.ai

MIT

90

84◄─►98

qwen3-next-80b-a3b-thinking

1369

±6

13,872

Alibaba

Apache 2.0

91

84◄─►98

Minimaxminimax-m1

1367

±4

36,915

MiniMax

Apache 2.0

92

88◄─►98

gemma-3-27b-it

1365

±4

48,740

Google

Gemma

93

88◄─►102

o3-mini-high

1364

±5

18,584

OpenAI

Proprietary

94

86◄─►108

amazon-nova-experimental-chat-11-10

1363

±7

8,278

Amazon

Proprietary

95

89◄─►106

grok-3-mini-high

1362

±5

17,555

xAI

Proprietary

96

90◄─►106

gemini-2.0-flash-001

1361

±4

44,838

Google

Proprietary

97

90◄─►111

deepseek-v3

1359

±5

21,788

DeepSeek

DeepSeek

98

93◄─►117

grok-3-mini-beta

1356

±5

23,801

xAI

Proprietary

99

93◄─►118

mistral-small-2506

1356

±5

18,377

Mistral

Apache 2.0

100

90◄─►120

intellect-3

1355

±8

5,341

Prime Intellect

MIT

101

94◄─►120

gpt-oss-120b

1354

±4

31,035

OpenAI

Apache 2.0

102

94◄─►120

gemini-2.0-flash-lite-preview-02-05

1353

±4

24,951

Google

Proprietary

103

93◄─►121

glm-4.5v

1353

±8

4,984

Z.ai

MIT

104

96◄─►120

Coherecommand-a-03-2025

1353

±3

57,523

Cohere

CC-BY-NC-4.0

105

97◄─►120

gemini-1.5-pro-002

1352

±3

55,607

Google

Proprietary

106

93◄─►132

Tencenthunyuan-turbos-20250226

1349

±12

2,226

Tencent

Proprietary

107

98◄─►121

o3-mini

1348

±3

58,670

OpenAI

Proprietary

108

94◄─►133

Nvidiallama-3.1-nemotron-ultra-253b-v1

1347

±12

2,546

Nvidia

Nvidia Open Model

109

94◄─►133

amazon-nova-experimental-chat-10-09

1347

±11

2,904

Amazon

Proprietary

110

98◄─►124

amazon-nova-experimental-chat-10-20

1347

±6

11,471

Amazon

Proprietary

111

96◄─►132

qwen3-32b

1346

±9

3,932

Alibaba

Apache 2.0

112

97◄─►131

Minimaxminimax-m2

1346

±8

6,775

MiniMax

Apache 2.0

113

98◄─►129

ling-flash-2.0

1346

±7

7,054

Ant Group

MIT

114

98◄─►130

Stepfunstep-3

1346

±7

6,603

StepFun

Apache 2.0

115

100◄─►122

gpt-4o-2024-05-13

1346

±3

112,863

OpenAI

Proprietary

116

97◄─►132

qwen-plus-0125

1346

±8

5,823

Alibaba

Proprietary

117

105◄─►127

Anthropicclaude-3-5-sonnet-20240620

1343

±3

82,417

Anthropic

Proprietary

118

98◄─►133

glm-4-plus-0111

1343

±8

5,760

Zhipu

Proprietary

119

99◄─►136

gemma-3-12b-it

1342

±10

3,829

Google

Gemma

120

98◄─►138

Tencenthunyuan-turbo-0110

1340

±12

2,295

Tencent

Proprietary

121

100◄─►137

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5

1340

±10

3,430

Nvidia

Nvidia Open

122

107◄─►137

gpt-5-nano-high

1338

±7

8,401

OpenAI

Proprietary

123

109◄─►134

o1-mini

1337

±4

51,986

OpenAI

Proprietary

124

110◄─►134

Metallama-3.1-405b-instruct-bf16

1336

±4

41,392

Meta

Llama 3.1 Community

125

108◄─►137

nova-2-lite

1336

±6

12,303

Amazon

Proprietary

126

109◄─►137

qwq-32b

1336

±4

26,138

Alibaba

Apache 2.0

127

110◄─►137

gpt-4o-2024-08-06

1336

±4

45,498

OpenAI

Proprietary

128

109◄─►137

gemini-advanced-0514

1335

±5

50,142

Google

Proprietary

129

112◄─►136

grok-2-2024-08-13

1335

±4

63,495

xAI

Proprietary

130

113◄─►137

Metallama-3.1-405b-instruct-fp8

1334

±4

59,655

Meta

Llama 3.1 Community

131

108◄─►145

Stepfunstep-2-16k-exp-202412

1334

±9

4,829

StepFun

Proprietary

132

119◄─►149

01.AIyi-lightning

1329

±5

27,340

01 AI

Proprietary

133

121◄─►149

Metallama-4-maverick-17b-128e-instruct

1328

±4

41,190

Meta

Llama 4

134

121◄─►152

qwen3-30b-a3b

1328

±5

27,458

Alibaba

Apache 2.0

135

111◄─►159

Nvidiallama-3.3-nemotron-49b-super-v1

1327

±12

2,230

Nvidia

Nvidia

136

116◄─►159

Tencenthunyuan-large-2025-02-10

1327

±10

3,738

Tencent

Proprietary

137

131◄─►155

gpt-4-turbo-2024-04-09

1325

±4

98,130

OpenAI

Proprietary

138

131◄─►154

Anthropicclaude-3-5-haiku-20241022

1324

±3

71,227

Anthropic

Proprietary

139

131◄─►155

gemini-1.5-pro-001

1323

±4

79,132

Google

Proprietary

140

123◄─►159

deepseek-v2.5-1210

1323

±8

6,793

DeepSeek

DeepSeek

141

131◄─►157

Metallama-4-scout-17b-16e-instruct

1323

±5

31,288

Meta

Llama

142

131◄─►155

Anthropicclaude-3-opus-20240229

1323

±3

194,904

Anthropic

Proprietary

143

130◄─►160

gpt-4.1-nano-2025-04-14

1322

±8

6,107

OpenAI

Proprietary

144

131◄─►160

Stepfunstep-1o-turbo-202506

1321

±7

9,707

StepFun

Proprietary

145

131◄─►160

ring-flash-2.0

1320

±7

7,209

Ant Group

MIT

146

134◄─►159

Metallama-3.3-70b-instruct

1320

±3

55,602

Meta

Llama-3.3

147

132◄─►159

glm-4-plus

1319

±5

26,134

Zhipu AI

Proprietary

148

132◄─►160

gemma-3n-e4b-it

1319

±5

23,379

Google

Gemma

149

132◄─►161

qwen-max-0919

1318

±6

16,479

Alibaba

Qwen

150

136◄─►160

gpt-4o-mini-2024-07-18

1318

±3

68,801

OpenAI

Proprietary

151

132◄─►166

gpt-oss-20b

1318

±6

10,816

OpenAI

Apache 2.0

152

134◄─►166

Nvidianvidia-nemotron-3-nano-30b-a3b-bf16

1317

±7

10,362

Nvidia

NVIDIA Open Model

153

135◄─►168

qwen2.5-plus-1127

1315

±6

10,179

Alibaba

Proprietary

154

139◄─►166

mistral-large-2407

1315

±4

45,460

Mistral

Mistral Research

155

139◄─►166

athene-v2-chat

1314

±4

24,746

NexusFlow

NexusFlow

156

140◄─►166

gpt-4-1106-preview

1314

±4

100,107

OpenAI

Proprietary

157

140◄─►166

gpt-4-0125-preview

1314

±4

93,439

OpenAI

Proprietary

158

135◄─►171

Tencenthunyuan-standard-2025-02-10

1312

±10

3,905

Tencent

Proprietary

159

145◄─►170

gemini-1.5-flash-002

1310

±4

34,909

Google

Proprietary

160

134◄─►176

mercury

1309

±14

1,929

Inception AI

Proprietary

161

151◄─►171

grok-2-mini-2024-08-13

1308

±4

52,574

xAI

Proprietary

162

151◄─►171

deepseek-v2.5

1307

±5

24,574

DeepSeek

DeepSeek

163

151◄─►171

athene-70b-0725

1306

±6

19,622

NexusFlow

CC-BY-NC-4.0

164

151◄─►171

magistral-medium-2506

1306

±6

12,083

Mistral

Proprietary

165

157◄─►171

mistral-large-2411

1305

±4

28,081

Mistral

MRL

166

150◄─►176

olmo-3-32b-think

1305

±8

5,919

Allen AI

Apache 2.0

167

157◄─►171

mistral-small-3.1-24b-instruct-2503

1305

±4

34,159

Mistral

Apache 2.0

168

151◄─►178

gemma-3-4b-it

1303

±9

4,177

Google

Gemma

169

158◄─►171

qwen2.5-72b-instruct

1303

±4

39,409

Alibaba

Qwen

170

158◄─►180

Nvidiallama-3.1-nemotron-70b-instruct

1299

±8

7,136

Nvidia

Llama 3.1

171

159◄─►181

Tencenthunyuan-large-vision

1296

±9

5,608

Tencent

Proprietary

172

167◄─►181

Metallama-3.1-70b-instruct

1294

±4

55,234

Meta

Llama 3.1 Community

173

169◄─►183

amazon-nova-pro-v1.0

1290

±5

24,753

Amazon

Proprietary

174

167◄─►185

jamba-1.5-large

1289

±7

8,659

AI21 Labs

Jamba Open

175

170◄─►183

gemma-2-27b-it

1289

±3

75,764

Google

Gemma license

176

167◄─►188

ibm-granite-h-small

1289

±8

5,688

IBM

Apache 2.0

177

169◄─►185

reka-core-20240904

1288

±7

7,309

Reka AI

Proprietary

178

170◄─►185

gpt-4-0314

1288

±5

54,167

OpenAI

Proprietary

179

167◄─►191

Nvidiallama-3.1-nemotron-51b-instruct

1287

±10

3,749

Nvidia

Llama 3.1

180

167◄─►191

llama-3.1-tulu-3-70b

1287

±10

2,846

Ai2

Llama 3.1

181

171◄─►185

gemini-1.5-flash-001

1286

±4

62,823

Google

Proprietary

182

173◄─►191

Anthropicclaude-3-sonnet-20240229

1282

±4

109,289

Anthropic

Proprietary

183

173◄─►191

gemma-2-9b-it-simpo

1280

±7

10,069

Princeton

MIT

184

175◄─►191

Nvidianemotron-4-340b-instruct

1278

±5

19,661

Nvidia

NVIDIA Open Model

185

175◄─►193

Coherecommand-r-plus-08-2024

1277

±7

9,869

Cohere

CC-BY-NC-4.0

186

179◄─►191

Metallama-3-70b-instruct

1277

±4

156,880

Meta

Llama 3 Community

187

179◄─►191

gpt-4-0613

1276

±4

88,721

OpenAI

Proprietary

188

180◄─►194

mistral-small-24b-instruct-2501

1274

±6

14,677

Mistral

Apache 2.0

189

179◄─►196

glm-4-0520

1274

±7

9,788

Zhipu AI

Proprietary

190

180◄─►196

reka-flash-20240904

1273

±7

7,537

Reka AI

Proprietary

191

180◄─►200

qwen2.5-coder-32b-instruct

1271

±8

5,430

Alibaba

Apache 2.0

192

186◄─►200

Coherec4ai-aya-expanse-32b

1268

±5

27,123

Cohere

CC-BY-NC-4.0

193

188◄─►200

gemma-2-9b-it

1266

±4

54,615

Google

Gemma license

194

187◄─►201

deepseek-coder-v2

1265

±6

15,147

DeepSeek

DeepSeek License

195

189◄─►201

Coherecommand-r-plus

1263

±4

77,556

Cohere

CC-BY-NC-4.0

196

189◄─►201

qwen2-72b-instruct

1263

±5

37,325

Alibaba

Qianwen LICENSE

197

191◄─►201

Anthropicclaude-3-haiku-20240307

1262

±4

117,705

Anthropic

Proprietary

198

191◄─►202

amazon-nova-lite-v1.0

1261

±5

19,376

Amazon

Proprietary

199

191◄─►202

gemini-1.5-flash-8b-001

1259

±4

35,556

Google

Proprietary

200

194◄─►202

Azurephi-4

1256

±5

24,126

Microsoft

MIT

201

191◄─►208

olmo-2-0325-32b-instruct

1253

±11

3,335

Allen AI

Apache-2.0

202

198◄─►207

Coherecommand-r-08-2024

1251

±7

10,141

Cohere

CC-BY-NC-4.0

203

201◄─►211

mistral-large-2402

1244

±5

62,437

Mistral

Proprietary

204

201◄─►211

amazon-nova-micro-v1.0

1241

±5

19,355

Amazon

Proprietary

205

201◄─►215

jamba-1.5-mini

1239

±7

8,854

AI21 Labs

Jamba Open

206

201◄─►219

ministral-8b-2410

1237

±9

4,780

Mistral

MRL

207

202◄─►219

gemini-pro-dev-api

1236

±7

18,352

Google

Proprietary

208

203◄─►219

qwen1.5-110b-chat

1235

±6

26,191

Alibaba

Qianwen LICENSE

209

203◄─►220

reka-flash-21b-20240226-online

1234

±7

15,451

Reka AI

Proprietary

210

203◄─►219

qwen1.5-72b-chat

1234

±5

39,296

Alibaba

Qianwen LICENSE

211

201◄─►221

Tencenthunyuan-standard-256k

1233

±12

2,729

Tencent

Proprietary

212

205◄─►220

mixtral-8x22b-instruct-v0.1

1231

±5

51,417

Mistral

Apache 2.0

213

205◄─►221

Coherecommand-r

1228

±5

54,038

Cohere

CC-BY-NC-4.0

214

205◄─►221

reka-flash-21b-20240226

1228

±6

24,806

Reka AI

Proprietary

215

206◄─►221

gpt-3.5-turbo-0125

1225

±5

66,191

OpenAI

Proprietary

216

206◄─►223

mistral-medium

1224

±5

34,552

Mistral

Proprietary

217

210◄─►222

Metallama-3-8b-instruct

1224

±4

104,636

Meta

Llama 3 Community

218

206◄─►223

Coherec4ai-aya-expanse-8b

1223

±7

9,827

Cohere

CC-BY-NC-4.0

219

205◄─►226

gemini-pro

1223

±12

6,390

Google

Proprietary

220

206◄─►226

llama-3.1-tulu-3-8b

1221

±11

2,895

Ai2

Llama 3.1

221

217◄─►226

01.AIyi-1.5-34b-chat

1214

±5

24,142

01 AI

Apache-2.0

222

212◄─►227

HuggingFacezephyr-orpo-141b-A35b-v0.1

1214

±11

4,653

HuggingFace

Apache 2.0

223

219◄─►226

Metallama-3.1-8b-instruct

1212

±4

49,605

Meta

Llama 3.1 Community

224

215◄─►232

granite-3.1-8b-instruct

1210

±11

3,092

IBM

Apache 2.0

225

219◄─►232

qwen1.5-32b-chat

1205

±6

21,744

Alibaba

Qianwen LICENSE

226

219◄─►234

gpt-3.5-turbo-1106

1204

±9

16,616

OpenAI

Proprietary

227

223◄─►234

Azurephi-3-medium-4k-instruct

1199

±5

25,055

Microsoft

MIT

228

224◄─►234

gemma-2-2b-it

1199

±4

46,618

Google

Gemma license

229

224◄─►234

mixtral-8x7b-instruct-v0.1

1198

±4

73,505

Mistral

Apache 2.0

230

224◄─►239

dbrx-instruct-preview

1196

±6

32,196

Databricks

DBRX LICENSE

231

224◄─►243

qwen1.5-14b-chat

1192

±7

17,841

Alibaba

Qianwen LICENSE

232

224◄─►243

InternLMinternlm2_5-20b-chat

1192

±7

9,902

InternLM

Other

233

226◄─►249

Azurewizardlm-70b

1186

±9

8,214

Microsoft

Llama 2 Community

234

226◄─►250

deepseek-llm-67b-chat

1186

±12

4,933

DeepSeek

DeepSeek License

235

230◄─►246

01.AIyi-34b-chat

1185

±7

15,483

01 AI

Yi License

236

230◄─►250

OpenChatopenchat-3.5

1184

±10

7,967

OpenChat

Apache-2.0

237

230◄─►249

OpenChatopenchat-3.5-0106

1184

±8

12,636

OpenChat

Apache-2.0

238

230◄─►250

granite-3.0-8b-instruct

1183

±9

6,643

IBM

Apache 2.0

239

231◄─►250

gemma-1.1-7b-it

1181

±6

23,893

Google

Gemma license

240

231◄─►250

Snowflakesnowflake-arctic-instruct

1181

±6

32,836

Snowflake

Apache 2.0

241

230◄─►252

granite-3.1-2b-instruct

1180

±11

3,191

IBM

Apache 2.0

242

231◄─►250

tulu-2-dpo-70b

1179

±10

6,534

AllenAI/UW

AI2 ImpACT Low-risk

243

231◄─►254

openhermes-2.5-mistral-7b

1176

±10

5,006

NousResearch

Apache-2.0

244

233◄─►253

vicuna-33b

1174

±6

22,479

LMSYS

Non-commercial

245

233◄─►256

starling-lm-7b-beta

1172

±7

16,057

Nexusflow

Apache-2.0

246

233◄─►254

Azurephi-3-small-8k-instruct

1172

±6

17,763

Microsoft

MIT

247

234◄─►254

Metallama-2-70b-chat

1172

±5

38,491

Meta

Llama 2 Community

248

234◄─►257

starling-lm-7b-alpha

1169

±8

10,224

UC Berkeley

CC-BY-NC-4.0

249

236◄─►257

Metallama-3.2-3b-instruct

1168

±8

7,936

Meta

Llama 3.2

250

234◄─►260

nous-hermes-2-mixtral-8x7b-dpo

1166

±12

3,776

NousResearch

Apache-2.0

251

242◄─►267

qwq-32b-preview

1158

±12

3,233

Alibaba

Apache 2.0

252

247◄─►264

granite-3.0-2b-instruct

1157

±8

6,837

IBM

Apache 2.0

253

242◄─►267

Nvidiallama2-70b-steerlm-chat

1157

±13

3,584

Nvidia

Llama 2 Community

254

244◄─►270

solar-10.7b-instruct-v1.0

1154

±13

4,155

Upstage AI

CC-BY-NC-4.0

255

243◄─►272

dolphin-2.2.1-mistral-7b

1153

±15

1,679

Cognitive Computations

Apache-2.0

256

248◄─►270

mpt-30b-chat

1151

±12

2,571

MosaicML

CC-BY-NC-SA-4.0

257

250◄─►267

mistral-7b-instruct-v0.2

1151

±7

19,402

Mistral

Apache-2.0

258

250◄─►269

Azurewizardlm-13b

1150

±9

7,046

Microsoft

Llama 2 Community

259

247◄─►274

falcon-180b-chat

1148

±17

1,295

TII

Falcon-180B TII License

260

250◄─►273

qwen1.5-7b-chat

1145

±10

4,735

Alibaba

Qianwen LICENSE

261

251◄─►272

Azurephi-3-mini-4k-instruct-june-2024

1144

±6

12,296

Microsoft

MIT

262

251◄─►273

Metallama-2-13b-chat

1142

±7

19,171

Meta

Llama 2 Community

263

251◄─►273

vicuna-13b

1142

±7

19,366

LMSYS

Llama 2 Community

264

251◄─►275

qwen-14b-chat

1140

±11

4,964

Alibaba

Qianwen LICENSE

265

252◄─►275

palm-2

1139

±9

8,554

Google

Proprietary

266

252◄─►275

Metacodellama-34b-instruct

1138

±9

7,363

Meta

Llama 2 Community

267

252◄─►275

gemma-7b-it

1137

±10

8,925

Google

Gemma license

268

255◄─►276

HuggingFacezephyr-7b-beta

1132

±9

11,116

HuggingFace

MIT

269

258◄─►276

Azurephi-3-mini-128k-instruct

1130

±7

20,691

Microsoft

MIT

270

260◄─►276

Azurephi-3-mini-4k-instruct

1129

±6

20,115

Microsoft

MIT

271

256◄─►279

guanaco-33b

1129

±12

2,921

UW

Non-commercial

272

255◄─►280

HuggingFacezephyr-7b-alpha

1128

±16

1,785

HuggingFace

MIT

273

263◄─►280

stripedhyena-nous-7b

1122

±11

5,184

Together AI

Apache 2.0

274

258◄─►281

Metacodellama-70b-instruct

1120

±18

1,143

Meta

Llama 2 Community

275

268◄─►280

vicuna-7b

1116

±9

6,923

LMSYS

Llama 2 Community

276

264◄─►281

HuggingFacesmollm2-1.7b-instruct

1115

±14

2,201

HuggingFace

Apache 2.0

277

271◄─►280

gemma-1.1-2b-it

1115

±8

10,853

Google

Gemma license

278

271◄─►280

Metallama-3.2-1b-instruct

1112

±8

8,045

Meta

Llama 3.2

279

271◄─►281

mistral-7b-instruct

1111

±9

8,977

Mistral

Apache 2.0

280

272◄─►281

Metallama-2-7b-chat

1109

±7

14,148

Meta

Llama 2 Community

281

277◄─►285

gemma-2b-it

1093

±12

4,779

Google

Gemma license

282

281◄─►284

qwen1.5-4b-chat

1092

±9

7,598

Alibaba

Qianwen LICENSE

283

281◄─►288

olmo-7b-instruct

1076

±11

6,329

Allen AI

Apache-2.0

284

282◄─►288

koala-13b

1072

±10

6,964

UC Berkeley

Non-commercial

285

283◄─►288

alpaca-13b

1069

±12

5,745

Stanford

Non-commercial

286

281◄─►289

gpt4all-13b-snoozy

1067

±15

1,743

Nomic AI

Non-commercial

287

283◄─►289

mpt-7b-chat

1063

±12

3,925

MosaicML

CC-BY-NC-SA-4.0

288

283◄─►289

chatglm3-6b

1057

±12

4,658

Tsinghua

Apache-2.0

289

286◄─►291

RWKVRWKV-4-Raven-14B

1043

±11

4,845

RWKV

Apache 2.0

290

289◄─►291

chatglm2-6b

1026

±14

2,657

Tsinghua

Apache-2.0

291

289◄─►291

oasst-pythia-12b

1024

±11

6,311

OpenAssistant

Apache 2.0

292

292◄─►295

chatglm-6b

997

±13

4,914

Tsinghua

Non-commercial

293

292◄─►295

fastchat-t5-3b

993

±12

4,203

LMSYS

Apache 2.0

294

292◄─►295

dolly-v2-12b

982

±14

3,412

Databricks

MIT

295

292◄─►296

Metallama-13b

974

±16

2,391

Meta

Non-commercial

296

295◄─►296

Stabilitystablelm-tuned-alpha-7b

954

±13

3,287

Stability AI

CC-BY-NC-SA-4.0

说明

  • 排名 (UB):基于 Bradley-Terry 模型计算的排名。此排名反映了模型在竞技场中的综合表现,并提供了其 Elo 分数的 上界 估计,帮助理解模型的潜在竞争力。

  • 模型:大型语言模型 (LLM) 的名称。部分模型名称可能已嵌入相关链接。

  • 分数:模型在竞技场中通过用户投票获得的 Elo 评分。Elo 评分是一种相对排名系统,分数越高表示模型表现越好。

  • 95% 置信区间 (±):模型 Elo 评分的95%置信区间(例如:±6)。这个区间越小,表示模型的评分越稳定和可靠。

  • 票数:该模型在竞技场中收到的总投票数量。投票数越多,通常意味着其评分的统计可靠性越高。

  • 组织/公司:提供该模型的组织或公司。

  • 许可证:模型的许可协议类型,例如专有 (Proprietary)、Apache 2.0、MIT 等。

数据来源与更新频率

本排行榜数据由自动化脚本直接从 1 2arrow-up-right 官方网站获取。此排行榜由 GitHub Actions 每天自动更新。

免责声明

本报告仅供参考。排行榜数据是动态变化的,并基于特定时间段内用户在 Chatbot Arena 上的偏好投票。数据的完整性和准确性取决于上游数据源。不同模型可能采用不同的许可协议,使用时请务必参考模型提供商的官方说明。

最后更新于

这有帮助吗?