Model Rankings

This is a leaderboard based on Chatbot Arena (lmarena.ai) data, generated through an automated process.

Data update time: 2026-02-06 08:15:50 UTC / 2026-02-06 16:15:50 CST (Beijing time)

Leaderboard

Rank
Rank Spread
Model
Score
95% Confidence Interval (±)
Votes
Organization/Company
License

1

1◄─►1

gemini-3-pro

1487

±5

32,205

Google

Proprietary

2

2◄─►5

grok-4.1-thinking

1475

±5

32,194

xAI

Proprietary

3

2◄─►7

gemini-3-flash

1471

±5

22,683

Google

Proprietary

4

2◄─►8

Anthropicclaude-opus-4-5-20251101-thinking-32k

1468

±5

23,990

Anthropic

Proprietary

5

2◄─►8

Anthropicclaude-opus-4-5-20251101

1466

±5

28,752

Anthropic

Proprietary

6

3◄─►8

grok-4.1

1466

±4

36,363

xAI

Proprietary

7

3◄─►10

gemini-3-flash (thinking-minimal)

1463

±6

13,989

Google

Proprietary

8

4◄─►12

gpt-5.1-high

1459

±5

28,296

OpenAI

Proprietary

9

7◄─►21

ernie-5.0-0110

1453Preliminary

±7

8,274

Baidu

Proprietary

10

8◄─►19

Anthropicclaude-sonnet-4-5-20250929-thinking-32k

1450

±4

42,538

Anthropic

Proprietary

11

9◄─►21

Anthropicclaude-sonnet-4-5-20250929

1450

±4

40,121

Anthropic

Proprietary

12

7◄─►21

MoonshotAIkimi-k2.5-thinking

1450

±9

4,901

Moonshot

Modified MIT

13

9◄─►19

gemini-2.5-pro

1450

±3

91,734

Google

Proprietary

14

8◄─►21

ernie-5.0-preview-1203

1449

±7

9,786

Baidu

Proprietary

15

9◄─►21

Anthropicclaude-opus-4-1-20250805-thinking-16k

1448

±4

49,975

Anthropic

Proprietary

16

9◄─►21

Anthropicclaude-opus-4-1-20250805

1445

±3

71,681

Anthropic

Proprietary

17

9◄─►23

gpt-4.5-preview-2025-02-27

1444

±6

14,549

OpenAI

Proprietary

18

11◄─►22

chatgpt-4o-latest-20250326

1443

±3

78,991

OpenAI

Proprietary

19

9◄─►26

glm-4.7

1440

±6

12,027

Z.ai

MIT

20

9◄─►27

gpt-5.2

1440

±7

9,387

OpenAI

Proprietary

21

11◄─►27

gpt-5.2-high

1440

±7

12,869

OpenAI

Proprietary

22

17◄─►28

gpt-5.1

1435

±5

30,355

OpenAI

Proprietary

23

18◄─►29

gpt-5-high

1435

±5

32,651

OpenAI

Proprietary

24

19◄─►32

qwen3-max-preview

1434

±5

27,857

Alibaba

Proprietary

25

19◄─►32

o3-2025-04-16

1433

±4

61,365

OpenAI

Proprietary

26

19◄─►37

grok-4-1-fast-reasoning

1430

±5

25,264

xAI

Proprietary

27

20◄─►39

MoonshotAIkimi-k2-thinking-turbo

1429

±4

30,011

Moonshot

Modified MIT

28

23◄─►45

gpt-5-chat

1426

±4

31,849

OpenAI

Proprietary

29

22◄─►47

qwen3-max-2025-09-23

1425

±6

9,223

Alibaba

Proprietary

30

26◄─►46

glm-4.6

1425

±4

35,373

Z.ai

MIT

31

26◄─►46

Anthropicclaude-opus-4-20250514-thinking-16k

1424

±4

37,990

Anthropic

Proprietary

32

24◄─►49

deepseek-v3.2-exp

1423

±6

11,774

DeepSeek

MIT

33

24◄─►49

deepseek-v3.2-exp-thinking

1423

±7

9,000

DeepSeek

MIT

34

27◄─►47

qwen3-235b-a22b-instruct-2507

1422

±3

66,045

Alibaba

Apache 2.0

35

24◄─►51

grok-4-fast-chat

1422

±8

6,992

xAI

Proprietary

36

27◄─►51

deepseek-v3.2-thinking

1420

±5

19,713

DeepSeek

MIT

37

28◄─►51

deepseek-v3.2

1420

±5

24,571

DeepSeek

MIT

38

28◄─►53

deepseek-r1-0528

1418

±6

19,289

DeepSeek

MIT

39

28◄─►54

MoonshotAIkimi-k2-0905-preview

1418

±6

11,977

Moonshot

Modified MIT

40

26◄─►56

ernie-5.0-preview-1022

1418

±9

4,630

Baidu

Proprietary

41

28◄─►54

MoonshotAIkimi-k2-0711-preview

1417

±5

28,666

Moonshot

Modified MIT

42

28◄─►54

deepseek-v3.1

1417

±6

15,304

DeepSeek

MIT

43

28◄─►54

deepseek-v3.1-thinking

1417

±7

11,986

DeepSeek

MIT

44

26◄─►58

deepseek-v3.1-terminus

1416

±10

3,762

DeepSeek

MIT

45

26◄─►59

deepseek-v3.1-terminus-thinking

1416

±10

3,547

DeepSeek

MIT

46

29◄─►56

qwen3-vl-235b-a22b-instruct

1415

±6

11,684

Alibaba

Apache 2.0

47

31◄─►54

mistral-large-3

1414

±5

20,796

Mistral

Apache 2.0

48

33◄─►54

Anthropicclaude-opus-4-20250514

1414

±4

45,572

Anthropic

Proprietary

49

33◄─►54

gpt-4.1-2025-04-14

1413

±4

52,220

OpenAI

Proprietary

50

35◄─►56

mistral-medium-2508

1412

±3

59,880

Mistral

Proprietary

51

35◄─►58

grok-3-preview-02-24

1411

±4

33,980

xAI

Proprietary

52

38◄─►59

grok-4-0709

1410

±4

42,174

xAI

Proprietary

53

38◄─►62

glm-4.5

1410

±5

24,803

Z.ai

MIT

54

39◄─►58

gemini-2.5-flash

1409

±3

90,980

Google

Proprietary

55

46◄─►67

gemini-2.5-flash-preview-09-2025

1405

±4

32,903

Google

Proprietary

56

46◄─►68

grok-4-fast-reasoning

1404

±5

18,646

xAI

Proprietary

57

49◄─►68

Anthropicclaude-haiku-4-5-20251001

1403

±4

41,166

Anthropic

Proprietary

58

49◄─►68

o1-2024-12-17

1402

±4

27,822

OpenAI

Proprietary

59

54◄─►70

qwen3-235b-a22b-no-thinking

1401

±4

39,554

Alibaba

Apache 2.0

60

52◄─►70

qwen3-next-80b-a3b-instruct

1401

±5

22,874

Alibaba

Apache 2.0

61

55◄─►70

Anthropicclaude-sonnet-4-20250514-thinking-32k

1400

±4

36,212

Anthropic

Proprietary

62

54◄─►76

longcat-flash-chat

1399

±6

11,554

Meituan

MIT

63

55◄─►76

qwen3-235b-a22b-thinking-2507

1398

±6

9,248

Alibaba

Apache 2.0

64

55◄─►76

deepseek-r1

1398

±5

18,537

DeepSeek

MIT

65

55◄─►83

amazon-nova-experimental-chat-12-10

1395

±10

3,722

Amazon

Proprietary

66

55◄─►82

qwen3-vl-235b-a22b-thinking

1395

±7

7,974

Alibaba

Apache 2.0

67

56◄─►80

mimo-v2-flash (non-thinking)

1395

±6

13,410

Xiaomi

MIT

68

59◄─►77

deepseek-v3-0324

1394

±4

46,699

DeepSeek

MIT

69

54◄─►84

Tencenthunyuan-vision-1.5-thinking

1393

±12

2,222

Tencent

Proprietary

70

59◄─►83

mai-1-preview

1391

±5

18,137

Microsoft AI

Proprietary

71

62◄─►82

o4-mini-2025-04-16

1391

±4

46,681

OpenAI

Proprietary

72

62◄─►83

gpt-5-mini-high

1390

±5

27,174

OpenAI

Proprietary

73

62◄─►83

Anthropicclaude-sonnet-4-20250514

1390

±4

41,643

Anthropic

Proprietary

74

62◄─►83

Anthropicclaude-3-7-sonnet-20250219-thinking-32k

1389

±4

39,890

Anthropic

Proprietary

75

62◄─►83

o1-preview

1388

±5

31,120

OpenAI

Proprietary

76

62◄─►86

Tencenthunyuan-t1-20250711

1387

±9

4,801

Tencent

Proprietary

77

65◄─►84

qwen3-coder-480b-a35b-instruct

1387

±5

26,648

Alibaba

Apache 2.0

78

66◄─►84

mistral-medium-2505

1384

±5

34,547

Mistral

Proprietary

79

67◄─►86

qwen3-30b-a3b-instruct-2507

1383

±5

24,093

Alibaba

Apache 2.0

80

67◄─►87

Minimaxminimax-m2.1-preview

1383

±6

12,907

MiniMax

MIT

81

66◄─►89

Tencenthunyuan-turbos-20250416

1382

±6

11,050

Tencent

Proprietary

82

69◄─►87

gpt-4.1-mini-2025-04-14

1382

±4

40,533

OpenAI

Proprietary

83

75◄─►89

gemini-2.5-flash-lite-preview-09-2025-no-thinking

1379

±4

44,231

Google

Proprietary

84

66◄─►97

glm-4.6v

1378

±11

2,822

Z.ai

MIT

85

78◄─►94

gemini-2.5-flash-lite-preview-06-17-thinking

1375

±4

33,931

Google

Proprietary

86

78◄─►94

qwen3-235b-a22b

1374

±5

27,156

Alibaba

Apache 2.0

87

80◄─►94

qwen2.5-max

1374

±4

33,288

Alibaba

Proprietary

88

82◄─►94

Anthropicclaude-3-5-sonnet-20241022

1373

±3

89,457

Anthropic

Proprietary

89

82◄─►96

Anthropicclaude-3-7-sonnet-20250219

1372

±4

44,434

Anthropic

Proprietary

90

84◄─►97

glm-4.5-air

1371

±4

31,396

Z.ai

MIT

91

84◄─►100

qwen3-next-80b-a3b-thinking

1369

±6

13,871

Alibaba

Apache 2.0

92

84◄─►100

Minimaxminimax-m1

1367

±4

36,843

MiniMax

Apache 2.0

93

84◄─►103

amazon-nova-experimental-chat-11-10

1367

±6

13,391

Amazon

Proprietary

94

88◄─►100

gemma-3-27b-it

1365

±4

48,686

Google

Gemma

95

88◄─►104

o3-mini-high

1364

±5

18,584

OpenAI

Proprietary

96

89◄─►108

grok-3-mini-high

1362

±5

17,525

xAI

Proprietary

97

84◄─►116

glm-4.7-flash

1362

±9

4,090

Z.ai

MIT

98

91◄─►108

gemini-2.0-flash-001

1361

±4

44,815

Google

Proprietary

99

91◄─►114

deepseek-v3

1359

±5

21,788

DeepSeek

DeepSeek

100

94◄─►119

grok-3-mini-beta

1357

±5

23,763

xAI

Proprietary

101

94◄─►120

mistral-small-2506

1356

±5

18,340

Mistral

Apache 2.0

102

91◄─►122

intellect-3

1356

±8

5,325

Prime Intellect

MIT

103

96◄─►121

gpt-oss-120b

1354

±4

30,977

OpenAI

Apache 2.0

104

96◄─►122

gemini-2.0-flash-lite-preview-02-05

1353

±4

24,951

Google

Proprietary

105

94◄─►124

glm-4.5v

1353

±8

4,977

Z.ai

MIT

106

98◄─►122

Coherecommand-a-03-2025

1353

±3

57,447

Cohere

CC-BY-NC-4.0

107

98◄─►122

gemini-1.5-pro-002

1352

±3

55,607

Google

Proprietary

108

95◄─►134

Tencenthunyuan-turbos-20250226

1349

±12

2,226

Tencent

Proprietary

109

100◄─►124

o3-mini

1348

±3

58,645

OpenAI

Proprietary

110

98◄─►126

amazon-nova-experimental-chat-10-20

1348

±6

11,439

Amazon

Proprietary

111

96◄─►135

Nvidiallama-3.1-nemotron-ultra-253b-v1

1347

±12

2,546

Nvidia

Nvidia Open Model

112

96◄─►134

amazon-nova-experimental-chat-10-09

1347

±11

2,892

Amazon

Proprietary

113

98◄─►134

qwen3-32b

1347

±9

3,932

Alibaba

Apache 2.0

114

98◄─►132

Minimaxminimax-m2

1347

±8

6,749

MiniMax

Apache 2.0

115

99◄─►131

ling-flash-2.0

1346

±7

7,044

Ant Group

MIT

116

99◄─►132

Stepfunstep-3

1346

±7

6,595

StepFun

Apache 2.0

117

98◄─►133

qwen-plus-0125

1346

±8

5,823

Alibaba

Proprietary

118

103◄─►124

gpt-4o-2024-05-13

1346

±3

112,863

OpenAI

Proprietary

119

100◄─►135

glm-4-plus-0111

1343

±8

5,760

Zhipu

Proprietary

120

107◄─►129

Anthropicclaude-3-5-sonnet-20240620

1343

±3

82,417

Anthropic

Proprietary

121

101◄─►138

gemma-3-12b-it

1342

±10

3,829

Google

Gemma

122

102◄─►140

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5

1341

±10

3,430

Nvidia

Nvidia Open

123

100◄─►141

Tencenthunyuan-turbo-0110

1340

±12

2,295

Tencent

Proprietary

124

107◄─►139

gpt-5-nano-high

1338

±7

8,402

OpenAI

Proprietary

125

111◄─►136

o1-mini

1337

±4

51,986

OpenAI

Proprietary

126

110◄─►140

nova-2-lite

1336

±6

12,274

Amazon

Proprietary

127

112◄─►137

Metallama-3.1-405b-instruct-bf16

1336

±4

41,392

Meta

Llama 3.1 Community

128

111◄─►139

qwq-32b

1336

±4

26,123

Alibaba

Apache 2.0

129

112◄─►139

gpt-4o-2024-08-06

1335

±4

45,498

OpenAI

Proprietary

130

111◄─►140

gemini-advanced-0514

1335

±5

50,142

Google

Proprietary

131

114◄─►139

grok-2-2024-08-13

1335

±4

63,495

xAI

Proprietary

132

116◄─►140

Metallama-3.1-405b-instruct-fp8

1334

±4

59,655

Meta

Llama 3.1 Community

133

110◄─►148

Stepfunstep-2-16k-exp-202412

1334

±9

4,829

StepFun

Proprietary

134

121◄─►153

01.AIyi-lightning

1329

±5

27,340

01 AI

Proprietary

135

122◄─►154

qwen3-30b-a3b

1328

±5

27,428

Alibaba

Apache 2.0

136

123◄─►153

Metallama-4-maverick-17b-128e-instruct

1328

±4

41,136

Meta

Llama 4

137

113◄─►162

Nvidiallama-3.3-nemotron-49b-super-v1

1327

±12

2,230

Nvidia

Nvidia

138

118◄─►162

Tencenthunyuan-large-2025-02-10

1327

±10

3,738

Tencent

Proprietary

139

133◄─►158

gpt-4-turbo-2024-04-09

1325

±4

98,130

OpenAI

Proprietary

140

133◄─►158

Anthropicclaude-3-5-haiku-20241022

1324

±3

71,179

Anthropic

Proprietary

141

128◄─►162

olmo-3.1-32b-instruct

1324

±7

8,310

Allen AI

Apache 2.0

142

124◄─►162

deepseek-v2.5-1210

1323

±8

6,793

DeepSeek

DeepSeek

143

133◄─►158

gemini-1.5-pro-001

1323

±4

79,132

Google

Proprietary

144

133◄─►160

Metallama-4-scout-17b-16e-instruct

1323

±5

31,231

Meta

Llama

145

133◄─►158

Anthropicclaude-3-opus-20240229

1323

±3

194,904

Anthropic

Proprietary

146

132◄─►163

gpt-4.1-nano-2025-04-14

1322

±8

6,107

OpenAI

Proprietary

147

133◄─►162

Stepfunstep-1o-turbo-202506

1321

±7

9,687

StepFun

Proprietary

148

133◄─►164

ring-flash-2.0

1320

±7

7,199

Ant Group

MIT

149

136◄─►162

Metallama-3.3-70b-instruct

1320

±3

55,561

Meta

Llama-3.3

150

134◄─►162

glm-4-plus

1319

±5

26,134

Zhipu AI

Proprietary

151

134◄─►163

gemma-3n-e4b-it

1319

±5

23,347

Google

Gemma

152

134◄─►164

qwen-max-0919

1318

±6

16,479

Alibaba

Qwen

153

137◄─►162

gpt-4o-mini-2024-07-18

1318

±3

68,801

OpenAI

Proprietary

154

134◄─►169

gpt-oss-20b

1318

±6

10,807

OpenAI

Apache 2.0

155

137◄─►169

Nvidianvidia-nemotron-3-nano-30b-a3b-bf16

1317

±6

15,510

Nvidia

NVIDIA Open Model

156

137◄─►170

qwen2.5-plus-1127

1315

±6

10,179

Alibaba

Proprietary

157

141◄─►169

athene-v2-chat

1314

±4

24,746

NexusFlow

NexusFlow

158

141◄─►169

mistral-large-2407

1314

±4

45,460

Mistral

Mistral Research

159

142◄─►169

gpt-4-1106-preview

1314

±4

100,107

OpenAI

Proprietary

160

142◄─►169

gpt-4-0125-preview

1314

±4

93,439

OpenAI

Proprietary

161

137◄─►174

Tencenthunyuan-standard-2025-02-10

1312

±10

3,905

Tencent

Proprietary

162

134◄─►178

mercury

1310

±14

1,921

Inception AI

Proprietary

163

149◄─►173

gemini-1.5-flash-002

1310

±4

34,909

Google

Proprietary

164

154◄─►174

grok-2-mini-2024-08-13

1308

±4

52,574

xAI

Proprietary

165

154◄─►174

deepseek-v2.5

1307

±5

24,574

DeepSeek

DeepSeek

166

154◄─►174

athene-70b-0725

1306

±6

19,622

NexusFlow

CC-BY-NC-4.0

167

154◄─►174

magistral-medium-2506

1305

±6

12,061

Mistral

Proprietary

168

160◄─►174

mistral-large-2411

1305

±4

28,081

Mistral

MRL

169

152◄─►177

olmo-3-32b-think

1305

±8

5,891

Allen AI

Apache 2.0

170

161◄─►174

mistral-small-3.1-24b-instruct-2503

1305

±4

34,132

Mistral

Apache 2.0

171

154◄─►181

gemma-3-4b-it

1303

±9

4,177

Google

Gemma

172

161◄─►174

qwen2.5-72b-instruct

1303

±4

39,409

Alibaba

Qwen

173

161◄─►184

Nvidiallama-3.1-nemotron-70b-instruct

1299

±8

7,136

Nvidia

Llama 3.1

174

162◄─►185

Tencenthunyuan-large-vision

1296

±9

5,601

Tencent

Proprietary

175

170◄─►185

Metallama-3.1-70b-instruct

1294

±4

55,234

Meta

Llama 3.1 Community

176

172◄─►186

amazon-nova-pro-v1.0

1290

±5

24,753

Amazon

Proprietary

177

172◄─►189

jamba-1.5-large

1289

±7

8,659

AI21 Labs

Jamba Open

178

171◄─►192

ibm-granite-h-small

1288

±8

5,675

IBM

Apache 2.0

179

173◄─►187

gemma-2-27b-it

1288

±3

75,764

Google

Gemma license

180

172◄─►189

reka-core-20240904

1288

±7

7,309

Reka AI

Proprietary

181

173◄─►189

gpt-4-0314

1288

±5

54,167

OpenAI

Proprietary

182

170◄─►195

Nvidiallama-3.1-nemotron-51b-instruct

1287

±10

3,749

Nvidia

Llama 3.1

183

170◄─►195

llama-3.1-tulu-3-70b

1287

±10

2,846

Ai2

Llama 3.1

184

174◄─►189

gemini-1.5-flash-001

1286

±4

62,823

Google

Proprietary

185

173◄─►195

olmo-3.1-32b-think

1285

±7

8,498

Allen AI

Apache 2.0

186

177◄─►195

Anthropicclaude-3-sonnet-20240229

1282

±4

109,289

Anthropic

Proprietary

187

176◄─►195

gemma-2-9b-it-simpo

1280

±7

10,069

Princeton

MIT

188

178◄─►195

Nvidianemotron-4-340b-instruct

1278

±5

19,661

Nvidia

NVIDIA Open Model

189

178◄─►197

Coherecommand-r-plus-08-2024

1277

±7

9,869

Cohere

CC-BY-NC-4.0

190

182◄─►195

Metallama-3-70b-instruct

1277

±4

156,880

Meta

Llama 3 Community

191

182◄─►196

gpt-4-0613

1276

±4

88,721

OpenAI

Proprietary

192

183◄─►198

mistral-small-24b-instruct-2501

1274

±6

14,677

Mistral

Apache 2.0

193

182◄─►200

glm-4-0520

1274

±7

9,788

Zhipu AI

Proprietary

194

183◄─►201

reka-flash-20240904

1273

±7

7,537

Reka AI

Proprietary

195

183◄─►204

qwen2.5-coder-32b-instruct

1271

±8

5,430

Alibaba

Apache 2.0

196

190◄─►204

Coherec4ai-aya-expanse-32b

1267

±5

27,123

Cohere

CC-BY-NC-4.0

197

192◄─►204

gemma-2-9b-it

1266

±4

54,615

Google

Gemma license

198

191◄─►205

deepseek-coder-v2

1265

±6

15,147

DeepSeek

DeepSeek License

199

193◄─►205

Coherecommand-r-plus

1263

±4

77,556

Cohere

CC-BY-NC-4.0

200

193◄─►205

qwen2-72b-instruct

1263

±5

37,325

Alibaba

Qianwen LICENSE

201

195◄─►205

Anthropicclaude-3-haiku-20240307

1262

±4

117,705

Anthropic

Proprietary

202

194◄─►206

amazon-nova-lite-v1.0

1261

±5

19,376

Amazon

Proprietary

203

195◄─►206

gemini-1.5-flash-8b-001

1259

±4

35,556

Google

Proprietary

204

198◄─►206

Azurephi-4

1256

±5

24,126

Microsoft

MIT

205

195◄─►212

olmo-2-0325-32b-instruct

1253

±11

3,335

Allen AI

Apache-2.0

206

202◄─►211

Coherecommand-r-08-2024

1251

±7

10,141

Cohere

CC-BY-NC-4.0

207

205◄─►215

mistral-large-2402

1243

±5

62,437

Mistral

Proprietary

208

205◄─►215

amazon-nova-micro-v1.0

1241

±5

19,355

Amazon

Proprietary

209

205◄─►219

jamba-1.5-mini

1239

±7

8,854

AI21 Labs

Jamba Open

210

205◄─►223

ministral-8b-2410

1237

±9

4,780

Mistral

MRL

211

206◄─►223

gemini-pro-dev-api

1236

±7

18,352

Google

Proprietary

212

207◄─►223

qwen1.5-110b-chat

1235

±6

26,191

Alibaba

Qianwen LICENSE

213

207◄─►224

reka-flash-21b-20240226-online

1234

±7

15,451

Reka AI

Proprietary

214

207◄─►223

qwen1.5-72b-chat

1234

±5

39,296

Alibaba

Qianwen LICENSE

215

205◄─►225

Tencenthunyuan-standard-256k

1234

±12

2,729

Tencent

Proprietary

216

209◄─►224

mixtral-8x22b-instruct-v0.1

1230

±5

51,417

Mistral

Apache 2.0

217

209◄─►225

Coherecommand-r

1228

±5

54,038

Cohere

CC-BY-NC-4.0

218

209◄─►225

reka-flash-21b-20240226

1227

±6

24,806

Reka AI

Proprietary

219

210◄─►226

gpt-3.5-turbo-0125

1225

±5

66,191

OpenAI

Proprietary

220

210◄─►227

mistral-medium

1224

±5

34,552

Mistral

Proprietary

221

214◄─►226

Metallama-3-8b-instruct

1224

±4

104,636

Meta

Llama 3 Community

222

210◄─►227

Coherec4ai-aya-expanse-8b

1223

±7

9,827

Cohere

CC-BY-NC-4.0

223

209◄─►230

gemini-pro

1223

±12

6,390

Google

Proprietary

224

210◄─►230

llama-3.1-tulu-3-8b

1221

±11

2,895

Ai2

Llama 3.1

225

221◄─►230

01.AIyi-1.5-34b-chat

1214

±5

24,142

01 AI

Apache-2.0

226

216◄─►231

HuggingFacezephyr-orpo-141b-A35b-v0.1

1214

±11

4,653

HuggingFace

Apache 2.0

227

223◄─►230

Metallama-3.1-8b-instruct

1212

±4

49,605

Meta

Llama 3.1 Community

228

219◄─►236

granite-3.1-8b-instruct

1209

±11

3,092

IBM

Apache 2.0

229

223◄─►236

qwen1.5-32b-chat

1205

±6

21,744

Alibaba

Qianwen LICENSE

230

223◄─►238

gpt-3.5-turbo-1106

1204

±9

16,616

OpenAI

Proprietary

231

227◄─►238

Azurephi-3-medium-4k-instruct

1199

±5

25,055

Microsoft

MIT

232

228◄─►238

gemma-2-2b-it

1199

±4

46,618

Google

Gemma license

233

228◄─►238

mixtral-8x7b-instruct-v0.1

1198

±4

73,505

Mistral

Apache 2.0

234

228◄─►243

dbrx-instruct-preview

1196

±6

32,196

Databricks

DBRX LICENSE

235

228◄─►247

InternLMinternlm2_5-20b-chat

1192

±7

9,902

InternLM

Other

236

228◄─►247

qwen1.5-14b-chat

1192

±7

17,841

Alibaba

Qianwen LICENSE

237

230◄─►253

Azurewizardlm-70b

1186

±9

8,214

Microsoft

Llama 2 Community

238

230◄─►254

deepseek-llm-67b-chat

1185

±12

4,933

DeepSeek

DeepSeek License

239

234◄─►250

01.AIyi-34b-chat

1185

±7

15,483

01 AI

Yi License

240

234◄─►253

OpenChatopenchat-3.5-0106

1183

±8

12,636

OpenChat

Apache-2.0

241

234◄─►254

OpenChatopenchat-3.5

1183

±10

7,967

OpenChat

Apache-2.0

242

234◄─►254

granite-3.0-8b-instruct

1183

±9

6,643

IBM

Apache 2.0

243

235◄─►254

gemma-1.1-7b-it

1181

±6

23,893

Google

Gemma license

244

235◄─►254

Snowflakesnowflake-arctic-instruct

1180

±6

32,836

Snowflake

Apache 2.0

245

234◄─►256

granite-3.1-2b-instruct

1180

±11

3,191

IBM

Apache 2.0

246

235◄─►254

tulu-2-dpo-70b

1179

±10

6,534

AllenAI/UW

AI2 ImpACT Low-risk

247

235◄─►258

openhermes-2.5-mistral-7b

1176

±10

5,006

NousResearch

Apache-2.0

248

237◄─►257

vicuna-33b

1174

±6

22,479

LMSYS

Non-commercial

249

237◄─►260

starling-lm-7b-beta

1172

±7

16,057

Nexusflow

Apache-2.0

250

237◄─►258

Azurephi-3-small-8k-instruct

1172

±6

17,763

Microsoft

MIT

251

238◄─►258

Metallama-2-70b-chat

1172

±5

38,491

Meta

Llama 2 Community

252

238◄─►261

starling-lm-7b-alpha

1168

±8

10,224

UC Berkeley

CC-BY-NC-4.0

253

240◄─►261

Metallama-3.2-3b-instruct

1167

±8

7,936

Meta

Llama 3.2

254

238◄─►264

nous-hermes-2-mixtral-8x7b-dpo

1166

±12

3,776

NousResearch

Apache-2.0

255

246◄─►270

qwq-32b-preview

1158

±12

3,233

Alibaba

Apache 2.0

256

251◄─►268

granite-3.0-2b-instruct

1157

±8

6,837

IBM

Apache 2.0

257

246◄─►271

Nvidiallama2-70b-steerlm-chat

1156

±13

3,584

Nvidia

Llama 2 Community

258

248◄─►274

solar-10.7b-instruct-v1.0

1153

±13

4,155

Upstage AI

CC-BY-NC-4.0

259

247◄─►276

dolphin-2.2.1-mistral-7b

1153

±15

1,679

Cognitive Computations

Apache-2.0

260

252◄─►274

mpt-30b-chat

1151

±12

2,571

MosaicML

CC-BY-NC-SA-4.0

261

254◄─►271

mistral-7b-instruct-v0.2

1151

±7

19,402

Mistral

Apache-2.0

262

254◄─►273

Azurewizardlm-13b

1150

±9

7,046

Microsoft

Llama 2 Community

263

251◄─►278

falcon-180b-chat

1148

±17

1,295

TII

Falcon-180B TII License

264

254◄─►277

qwen1.5-7b-chat

1145

±10

4,735

Alibaba

Qianwen LICENSE

265

255◄─►276

Azurephi-3-mini-4k-instruct-june-2024

1144

±6

12,296

Microsoft

MIT

266

255◄─►277

Metallama-2-13b-chat

1142

±7

19,171

Meta

Llama 2 Community

267

255◄─►277

vicuna-13b

1142

±7

19,366

LMSYS

Llama 2 Community

268

255◄─►279

qwen-14b-chat

1140

±11

4,964

Alibaba

Qianwen LICENSE

269

256◄─►279

palm-2

1138

±9

8,554

Google

Proprietary

270

256◄─►279

Metacodellama-34b-instruct

1138

±9

7,363

Meta

Llama 2 Community

271

257◄─►279

gemma-7b-it

1137

±10

8,925

Google

Gemma license

272

259◄─►280

HuggingFacezephyr-7b-beta

1132

±9

11,116

HuggingFace

MIT

273

262◄─►280

Azurephi-3-mini-128k-instruct

1130

±7

20,691

Microsoft

MIT

274

264◄─►280

Azurephi-3-mini-4k-instruct

1129

±6

20,115

Microsoft

MIT

275

260◄─►283

guanaco-33b

1128

±12

2,921

UW

Non-commercial

276

259◄─►284

HuggingFacezephyr-7b-alpha

1128

±16

1,785

HuggingFace

MIT

277

267◄─►284

stripedhyena-nous-7b

1122

±11

5,184

Together AI

Apache 2.0

278

262◄─►285

Metacodellama-70b-instruct

1120

±18

1,143

Meta

Llama 2 Community

279

272◄─►284

vicuna-7b

1116

±9

6,923

LMSYS

Llama 2 Community

280

268◄─►285

HuggingFacesmollm2-1.7b-instruct

1115

±14

2,201

HuggingFace

Apache 2.0

281

275◄─►284

gemma-1.1-2b-it

1115

±8

10,853

Google

Gemma license

282

275◄─►284

Metallama-3.2-1b-instruct

1112

±8

8,045

Meta

Llama 3.2

283

275◄─►285

mistral-7b-instruct

1111

±9

8,977

Mistral

Apache 2.0

284

276◄─►285

Metallama-2-7b-chat

1109

±7

14,148

Meta

Llama 2 Community

285

281◄─►289

gemma-2b-it

1092

±12

4,779

Google

Gemma license

286

285◄─►288

qwen1.5-4b-chat

1091

±9

7,598

Alibaba

Qianwen LICENSE

287

285◄─►292

olmo-7b-instruct

1075

±11

6,329

Allen AI

Apache-2.0

288

286◄─►292

koala-13b

1071

±10

6,964

UC Berkeley

Non-commercial

289

287◄─►292

alpaca-13b

1068

±12

5,745

Stanford

Non-commercial

290

285◄─►293

gpt4all-13b-snoozy

1067

±15

1,743

Nomic AI

Non-commercial

291

287◄─►293

mpt-7b-chat

1063

±12

3,925

MosaicML

CC-BY-NC-SA-4.0

292

287◄─►293

chatglm3-6b

1057

±12

4,658

Tsinghua

Apache-2.0

293

290◄─►295

RWKVRWKV-4-Raven-14B

1042

±11

4,845

RWKV

Apache 2.0

294

293◄─►295

chatglm2-6b

1025

±14

2,657

Tsinghua

Apache-2.0

295

293◄─►295

oasst-pythia-12b

1023

±11

6,311

OpenAssistant

Apache 2.0

296

296◄─►299

chatglm-6b

997

±13

4,914

Tsinghua

Non-commercial

297

296◄─►299

fastchat-t5-3b

992

±12

4,203

LMSYS

Apache 2.0

298

296◄─►299

dolly-v2-12b

981

±14

3,412

Databricks

MIT

299

296◄─►300

Metallama-13b

973

±16

2,391

Meta

Non-commercial

300

299◄─►300

Stabilitystablelm-tuned-alpha-7b

954

±13

3,287

Stability AI

CC-BY-NC-SA-4.0

Description

  • Ranking (UB): Ranking computed based on the Bradley-Terry model. This ranking reflects the models' overall performance in the arena and provides their Elo scores' upper bound estimate, helping to understand the models' potential competitiveness.

  • Model: The name of the large language model (LLM). Some model names may include embedded links.

  • Score: The Elo score the model received in the arena through user votes. The Elo score is a relative ranking system; higher scores indicate better performance.

  • 95% Confidence Interval (±): The 95% confidence interval for the model's Elo score (for example:±6). The smaller this interval, the more stable and reliable the model's score.

  • Votes: The total number of votes the model received in the arena. More votes generally mean greater statistical reliability of its score.

  • Organization/Company: The organization or company providing the model.

  • License: The model's license type, such as Proprietary, Apache 2.0, MIT, etc.

Data sources and update frequency

The data for this ranking is obtained automatically by scripts directly from the 1 2arrow-up-right official website. This ranking is automatically updated daily by GitHub Actions.

Disclaimer

This report is for reference only. Ranking data is dynamic and based on user preference votes on the Chatbot Arena during a specific time period. The integrity and accuracy of the data depend on upstream data sources. Different models may use different licensing terms; please consult the official documentation from the model providers before use.

Last updated

Was this helpful?