Model Rankings

This is a leaderboard based on Chatbot Arena (lmarena.ai) data, generated through an automated process.

Data update time: 2025-12-22 08:09:23 UTC / 2025-12-22 16:09:23 CST (Beijing Time)

Leaderboard

Rank
Rank Spread
Model
Score
95% Confidence Interval (±)
Votes
Organization/Company
License

1

1◄─►2

gemini-3-pro

1490

±6

19,199

Google

Proprietary

2

2◄─►6

grok-4.1-thinking

1477

±6

20,074

xAI

Proprietary

3

1◄─►7

gemini-3-flash

1475Preliminary

±10

4,431

Google

Proprietary

4

2◄─►8

Anthropicclaude-opus-4-5-20251101-thinking-32k

1469

±6

12,829

Anthropic

Proprietary

5

2◄─►8

Anthropicclaude-opus-4-5-20251101

1465

±6

13,654

Anthropic

Proprietary

6

3◄─►8

grok-4.1

1465

±6

20,553

xAI

Proprietary

7

2◄─►11

gemini-3-flash (thinking-minimal)

1463Preliminary

±9

5,605

Google

Proprietary

8

4◄─►11

gpt-5.1-high

1457

±6

17,084

OpenAI

Proprietary

9

7◄─►15

gemini-2.5-pro

1451

±3

79,900

Google

Proprietary

10

7◄─►15

Anthropicclaude-sonnet-4-5-20250929-thinking-32k

1450

±4

31,225

Anthropic

Proprietary

11

9◄─►15

Anthropicclaude-opus-4-1-20250805-thinking-16k

1448

±4

46,958

Anthropic

Proprietary

12

9◄─►18

Anthropicclaude-sonnet-4-5-20250929

1446

±5

26,513

Anthropic

Proprietary

13

9◄─►22

gpt-4.5-preview-2025-02-27

1442

±6

14,644

OpenAI

Proprietary

14

7◄─►27

gpt-5.2

1442Preliminary

±12

2,422

OpenAI

Proprietary

15

12◄─►22

Anthropicclaude-opus-4-1-20250805

1440

±4

60,044

Anthropic

Proprietary

16

12◄─►22

chatgpt-4o-latest-20250326

1440

±3

66,721

OpenAI

Proprietary

17

9◄─►24

gpt-5.2-high

1440Preliminary

±9

6,192

OpenAI

Proprietary

18

12◄─►24

gpt-5.1

1437

±6

18,423

OpenAI

Proprietary

19

13◄─►24

gpt-5-high

1436

±5

32,779

OpenAI

Proprietary

20

13◄─►25

o3-2025-04-16

1434

±4

61,385

OpenAI

Proprietary

21

13◄─►29

qwen3-max-preview

1433

±5

28,039

Alibaba

Proprietary

22

13◄─►33

grok-4-1-fast-reasoning

1431

±6

12,526

xAI

Proprietary

23

16◄─►38

MoonshotAIkimi-k2-thinking-turbo

1428

±5

18,724

Moonshot

Modified MIT

24

16◄─►43

ernie-5.0-preview-1103

1427Preliminary

±8

6,699

Baidu

Proprietary

25

20◄─►43

glm-4.6

1425

±5

27,300

Z.ai

MIT

26

21◄─►43

gpt-5-chat

1425

±4

32,045

OpenAI

Proprietary

27

19◄─►43

qwen3-max-2025-09-23

1424

±6

9,227

Alibaba

Proprietary

28

20◄─►43

deepseek-v3.2-exp

1423

±7

11,929

DeepSeek AI

MIT

29

22◄─►43

Anthropicclaude-opus-4-20250514-thinking-16k

1423

±4

37,714

Anthropic

Proprietary

30

21◄─►46

deepseek-v3.2-thinking

1422

±7

8,916

DeepSeek AI

MIT

31

23◄─►43

qwen3-235b-a22b-instruct-2507

1422

±4

54,369

Alibaba

Apache 2.0

32

22◄─►46

deepseek-v3.2-exp-thinking

1421

±7

9,198

DeepSeek AI

MIT

33

22◄─►49

grok-4-fast-chat

1420

±8

7,040

xAI

Proprietary

34

22◄─►49

mistral-large-3

1419Preliminary

±7

9,518

Mistral

Apache 2.0

35

23◄─►49

MoonshotAIkimi-k2-0905-preview

1418

±7

11,845

Moonshot

Modified MIT

36

23◄─►49

deepseek-r1-0528

1418

±6

19,162

DeepSeek

MIT

37

24◄─►50

deepseek-v3.1

1416

±6

15,213

DeepSeek

MIT

38

24◄─►49

MoonshotAIkimi-k2-0711-preview

1416

±5

28,532

Moonshot

Modified MIT

39

24◄─►51

deepseek-v3.1-thinking

1416

±7

11,947

DeepSeek

MIT

40

24◄─►51

deepseek-v3.2

1415

±7

10,253

DeepSeek AI

MIT

41

23◄─►54

deepseek-v3.1-terminus

1415

±10

3,737

DeepSeek AI

MIT

42

24◄─►51

qwen3-vl-235b-a22b-instruct

1414

±7

9,011

Alibaba

Apache 2.0

43

23◄─►57

deepseek-v3.1-terminus-thinking

1414

±10

3,511

DeepSeek AI

MIT

44

31◄─►51

gpt-4.1-2025-04-14

1412

±4

52,398

OpenAI

Proprietary

45

31◄─►51

Anthropicclaude-opus-4-20250514

1412

±4

45,477

Anthropic

Proprietary

46

31◄─►51

mistral-medium-2508

1411

±4

48,471

Mistral

Proprietary

47

33◄─►54

grok-3-preview-02-24

1410

±4

34,039

xAI

Proprietary

48

33◄─►56

grok-4-0709

1409

±4

42,410

xAI

Proprietary

49

33◄─►57

glm-4.5

1408

±5

24,736

Z.ai

MIT

50

38◄─►56

gemini-2.5-flash

1408

±3

79,419

Google

Proprietary

51

39◄─►60

gemini-2.5-flash-preview-09-2025

1405

±4

32,599

Google

Proprietary

52

45◄─►62

Anthropicclaude-haiku-4-5-20251001

1402

±4

29,594

Anthropic

Proprietary

53

45◄─►63

grok-4-fast-reasoning

1402

±5

18,821

xAI

Proprietary

54

47◄─►63

o1-2024-12-17

1400

±4

28,039

OpenAI

Proprietary

55

47◄─►65

qwen3-next-80b-a3b-instruct

1400

±5

23,035

Alibaba

Apache 2.0

56

45◄─►66

longcat-flash-chat

1400

±6

11,470

Meituan

MIT

57

49◄─►65

qwen3-235b-a22b-no-thinking

1399

±4

39,246

Alibaba

Apache 2.0

58

51◄─►66

Anthropicclaude-sonnet-4-20250514-thinking-32k

1399

±4

36,070

Anthropic

Proprietary

59

51◄─►71

qwen3-235b-a22b-thinking-2507

1397

±6

9,311

Alibaba

Apache 2.0

60

52◄─►71

deepseek-r1

1396

±5

18,718

DeepSeek

MIT

61

52◄─►72

qwen3-vl-235b-a22b-thinking

1394

±7

7,965

Alibaba

Apache 2.0

62

53◄─►72

gpt-5-mini-high

1392

±5

27,344

OpenAI

Proprietary

63

55◄─►71

deepseek-v3-0324

1392

±4

46,624

DeepSeek

MIT

64

51◄─►79

Tencenthunyuan-vision-1.5-thinking

1391

±12

2,210

Tencent

Proprietary

65

57◄─►72

o4-mini-2025-04-16

1391

±4

46,703

OpenAI

Proprietary

66

55◄─►74

mai-1-preview

1390

±5

18,116

Microsoft AI

Proprietary

67

59◄─►75

Anthropicclaude-sonnet-4-20250514

1389

±4

41,493

Anthropic

Proprietary

68

59◄─►75

o1-preview

1387

±5

31,505

OpenAI

Proprietary

69

59◄─►75

Anthropicclaude-3-7-sonnet-20250219-thinking-32k

1387

±4

39,817

Anthropic

Proprietary

70

59◄─►77

qwen3-coder-480b-a35b-instruct

1386

±5

23,591

Alibaba

Apache 2.0

71

59◄─►80

Tencenthunyuan-t1-20250711

1385

±9

4,803

Tencent

Proprietary

72

62◄─►79

mistral-medium-2505

1383

±5

34,401

Mistral

Proprietary

73

65◄─►79

qwen3-30b-a3b-instruct-2507

1382

±5

24,120

Alibaba

Apache 2.0

74

66◄─►80

gpt-4.1-mini-2025-04-14

1380

±4

40,370

OpenAI

Proprietary

75

65◄─►83

Tencenthunyuan-turbos-20250416

1380

±6

11,094

Tencent

Proprietary

76

69◄─►83

gemini-2.5-flash-lite-preview-09-2025-no-thinking

1378

±4

32,630

Google

Proprietary

77

70◄─►84

gemini-2.5-flash-lite-preview-06-17-thinking

1375

±4

33,818

Google

Proprietary

78

70◄─►85

qwen3-235b-a22b

1374

±5

27,077

Alibaba

Apache 2.0

79

73◄─►85

qwen2.5-max

1373

±4

33,489

Alibaba

Proprietary

80

75◄─►85

Anthropicclaude-3-5-sonnet-20241022

1372

±3

89,727

Anthropic

Proprietary

81

75◄─►88

Anthropicclaude-3-7-sonnet-20250219

1371

±4

44,473

Anthropic

Proprietary

82

69◄─►94

amazon-nova-experimental-chat-11-10

1370Preliminary

±12

2,743

Amazon

Proprietary

83

75◄─►88

glm-4.5-air

1370

±4

31,551

Z.ai

MIT

84

77◄─►91

qwen3-next-80b-a3b-thinking

1367

±6

13,779

Alibaba

Apache 2.0

85

78◄─►90

Minimaxminimax-m1

1366

±4

36,732

MiniMax

Apache 2.0

86

81◄─►91

gemma-3-27b-it

1364

±4

49,181

Google

Gemma

87

81◄─►96

o3-mini-high

1362

±5

18,735

OpenAI

Proprietary

88

81◄─►96

grok-3-mini-high

1362

±5

17,505

xAI

Proprietary

89

83◄─►98

gemini-2.0-flash-001

1360

±4

45,015

Google

Proprietary

90

83◄─►107

deepseek-v3

1357

±5

21,994

DeepSeek

DeepSeek

91

84◄─►107

grok-3-mini-beta

1357

±5

23,701

xAI

Proprietary

92

86◄─►112

mistral-small-2506

1355

±5

18,254

Mistral

Apache 2.0

93

89◄─►113

gpt-oss-120b

1352

±4

31,145

OpenAI

Apache 2.0

94

89◄─►112

gemini-2.0-flash-lite-preview-02-05

1352

±4

25,215

Google

Proprietary

95

86◄─►115

glm-4.5v

1352

±8

4,971

Z.ai

MIT

96

90◄─►112

Coherecommand-a-03-2025

1352

±3

57,648

Cohere

CC-BY-NC-4.0

97

90◄─►113

gemini-1.5-pro-002

1351

±3

56,012

Google

Proprietary

98

86◄─►116

intellect-3

1351

±9

4,753

Prime Intellect

MIT

99

90◄─►116

amazon-nova-experimental-chat-10-20

1349

±6

10,971

Amazon

Proprietary

100

92◄─►115

o3-mini

1348

±3

58,693

OpenAI

Proprietary

101

87◄─►126

Tencenthunyuan-turbos-20250226

1346

±12

2,250

Tencent

Proprietary

102

90◄─►119

ling-flash-2.0

1346

±7

7,126

Ant Group

MIT

103

90◄─►122

Minimaxminimax-m2

1346

±8

7,093

MiniMax

Apache 2.0

104

90◄─►121

Stepfunstep-3

1346

±7

6,620

StepFun

Apache 2.0

105

87◄─►127

Nvidiallama-3.1-nemotron-ultra-253b-v1

1346

±12

2,573

Nvidia

Nvidia Open Model

106

89◄─►126

amazon-nova-experimental-chat-10-09

1345

±11

2,878

Amazon

Proprietary

107

94◄─►116

gpt-4o-2024-05-13

1345

±3

113,568

OpenAI

Proprietary

108

90◄─►126

qwen3-32b

1345

±9

3,943

Alibaba

Apache 2.0

109

90◄─►126

qwen-plus-0125

1344

±8

5,861

Alibaba

Proprietary

110

92◄─►126

glm-4-plus-0111

1343

±8

5,806

Zhipu

Proprietary

111

97◄─►119

Anthropicclaude-3-5-sonnet-20240620

1342

±3

82,864

Anthropic

Proprietary

112

92◄─►129

gemma-3-12b-it

1340

±9

3,866

Google

Gemma

113

92◄─►131

Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5

1340

±10

3,469

Nvidia

Nvidia Open

114

92◄─►133

Tencenthunyuan-turbo-0110

1339

±11

2,322

Tencent

Proprietary

115

97◄─►128

gpt-5-nano-high

1338

±7

8,369

OpenAI

Proprietary

116

99◄─►131

nova-2-lite

1336

±7

8,997

Amazon

Proprietary

117

102◄─►128

o1-mini

1335

±4

52,301

OpenAI

Proprietary

118

104◄─►128

Metallama-3.1-405b-instruct-bf16

1335

±4

41,932

Meta

Llama 3.1 Community

119

104◄─►131

gpt-4o-2024-08-06

1334

±4

45,787

OpenAI

Proprietary

120

106◄─►129

grok-2-2024-08-13

1334

±4

63,725

xAI

Proprietary

121

105◄─►131

qwq-32b

1334

±4

26,196

Alibaba

Apache 2.0

122

102◄─►131

gemini-advanced-0514

1334

±5

50,654

Google

Proprietary

123

106◄─►131

Metallama-3.1-405b-instruct-fp8

1333

±3

60,272

Meta

Llama 3.1 Community

124

102◄─►141

Stepfunstep-2-16k-exp-202412

1332

±9

4,895

StepFun

Proprietary

125

112◄─►142

01.AIyi-lightning

1328

±5

27,624

01 AI

Proprietary

126

115◄─►143

Metallama-4-maverick-17b-128e-instruct

1327

±4

41,093

Meta

Llama 4

127

106◄─►153

Nvidianvidia-nemotron-3-nano-30b-a3b-bf16

1326Preliminary

±10

4,247

Nvidia

NVIDIA Open Model

128

117◄─►145

qwen3-30b-a3b

1326

±5

27,405

Alibaba

Apache 2.0

129

106◄─►154

Nvidiallama-3.3-nemotron-49b-super-v1

1325

±12

2,243

Nvidia

Nvidia

130

111◄─►153

Tencenthunyuan-large-2025-02-10

1325

±10

3,760

Tencent

Proprietary

131

123◄─►146

gpt-4-turbo-2024-04-09

1324

±4

98,965

OpenAI

Proprietary

132

124◄─►148

Anthropicclaude-3-5-haiku-20241022

1322

±3

71,241

Anthropic

Proprietary

133

124◄─►148

Metallama-4-scout-17b-16e-instruct

1322

±5

31,074

Meta

Llama

134

117◄─►154

deepseek-v2.5-1210

1322

±8

6,877

DeepSeek

DeepSeek

135

124◄─►148

gemini-1.5-pro-001

1322

±4

79,769

Google

Proprietary

136

124◄─►148

Anthropicclaude-3-opus-20240229

1322

±3

196,368

Anthropic

Proprietary

137

123◄─►154

gpt-4.1-nano-2025-04-14

1321

±8

6,143

OpenAI

Proprietary

138

124◄─►154

ring-flash-2.0

1320

±7

7,250

Ant Group

MIT

139

124◄─►154

Stepfunstep-1o-turbo-202506

1319

±7

9,635

StepFun

Proprietary

140

127◄─►153

Metallama-3.3-70b-instruct

1319

±3

55,961

Meta

Llama-3.3

141

125◄─►154

gemma-3n-e4b-it

1318

±5

23,421

Google

Gemma

142

126◄─►154

glm-4-plus

1318

±5

26,342

Zhipu AI

Proprietary

143

124◄─►155

gpt-oss-20b

1318

±6

10,828

OpenAI

Apache 2.0

144

127◄─►155

qwen-max-0919

1317

±6

16,598

Alibaba

Qwen

145

129◄─►154

gpt-4o-mini-2024-07-18

1316

±3

69,290

OpenAI

Proprietary

146

128◄─►161

qwen2.5-plus-1127

1314

±6

10,252

Alibaba

Proprietary

147

133◄─►159

gpt-4-1106-preview

1313

±4

101,117

OpenAI

Proprietary

148

133◄─►159

mistral-large-2407

1313

±4

45,968

Mistral

Mistral Research

149

133◄─►159

gpt-4-0125-preview

1313

±4

94,534

OpenAI

Proprietary

150

133◄─►160

athene-v2-chat

1313

±4

24,880

NexusFlow

NexusFlow

151

124◄─►164

mercury

1311

±14

1,961

Inception AI

Proprietary

152

129◄─►164

Tencenthunyuan-standard-2025-02-10

1310

±10

3,920

Tencent

Proprietary

153

136◄─►161

gemini-1.5-flash-002

1310

±4

35,180

Google

Proprietary

154

133◄─►164

olmo-3-32b-think

1308

±9

5,307

Allen AI

Apache 2.0

155

146◄─►163

grok-2-mini-2024-08-13

1307

±4

52,789

xAI

Proprietary

156

146◄─►164

deepseek-v2.5

1306

±5

24,839

DeepSeek

DeepSeek

157

146◄─►164

athene-70b-0725

1305

±6

19,796

NexusFlow

CC-BY-NC-4.0

158

149◄─►164

mistral-large-2411

1304

±4

28,455

Mistral

MRL

159

146◄─►164

magistral-medium-2506

1304

±6

11,957

Mistral

Proprietary

160

150◄─►164

mistral-small-3.1-24b-instruct-2503

1303

±4

34,030

Mistral

Apache 2.0

161

144◄─►170

gemma-3-4b-it

1302

±9

4,195

Google

Gemma

162

152◄─►164

qwen2.5-72b-instruct

1302

±4

39,632

Alibaba

Qwen

163

152◄─►173

Nvidiallama-3.1-nemotron-70b-instruct

1297

±8

7,216

Nvidia

Llama 3.1

164

153◄─►175

Tencenthunyuan-large-vision

1294

±9

5,575

Tencent

Proprietary

165

162◄─►173

Metallama-3.1-70b-instruct

1293

±4

56,003

Meta

Llama 3.1 Community

166

162◄─►178

jamba-1.5-large

1288

±7

8,730

AI21 Labs

Jamba Open

167

163◄─►176

amazon-nova-pro-v1.0

1288

±4

25,218

Amazon

Proprietary

168

162◄─►183

ibm-granite-h-small

1287

±8

5,690

IBM

Apache 2.0

169

163◄─►176

gemma-2-27b-it

1287

±3

76,195

Google

Gemma license

170

162◄─►178

reka-core-20240904

1287

±7

7,380

Reka AI

Proprietary

171

163◄─►178

gpt-4-0314

1286

±5

54,754

OpenAI

Proprietary

172

162◄─►184

Nvidiallama-3.1-nemotron-51b-instruct

1286

±10

3,777

Nvidia

Llama 3.1

173

162◄─►184

llama-3.1-tulu-3-70b

1286

±10

2,881

Ai2

Llama 3.1

174

165◄─►178

gemini-1.5-flash-001

1284

±4

63,418

Google

Proprietary

175

166◄─►183

Anthropicclaude-3-sonnet-20240229

1281

±4

110,173

Anthropic

Proprietary

176

165◄─►184

gemma-2-9b-it-simpo

1278

±7

10,108

Princeton

MIT

177

168◄─►184

Nvidianemotron-4-340b-instruct

1277

±5

19,913

Nvidia

NVIDIA Open Model

178

168◄─►185

Coherecommand-r-plus-08-2024

1277

±7

9,931

Cohere

CC-BY-NC-4.0

179

172◄─►184

Metallama-3-70b-instruct

1276

±3

158,908

Meta

Llama 3 Community

180

172◄─►185

gpt-4-0613

1275

±4

89,612

OpenAI

Proprietary

181

172◄─►187

mistral-small-24b-instruct-2501

1273

±6

14,830

Mistral

Apache 2.0

182

172◄─►189

glm-4-0520

1273

±7

9,857

Zhipu AI

Proprietary

183

172◄─►189

reka-flash-20240904

1272

±7

7,583

Reka AI

Proprietary

184

174◄─►193

qwen2.5-coder-32b-instruct

1269

±8

5,452

Alibaba

Apache 2.0

185

179◄─►193

Coherec4ai-aya-expanse-32b

1266

±5

27,362

Cohere

CC-BY-NC-4.0

186

181◄─►193

gemma-2-9b-it

1264

±4

54,954

Google

Gemma license

187

181◄─►195

deepseek-coder-v2

1264

±6

15,242

DeepSeek AI

DeepSeek License

188

182◄─►194

Coherecommand-r-plus

1263

±4

78,401

Cohere

CC-BY-NC-4.0

189

182◄─►195

qwen2-72b-instruct

1262

±5

37,688

Alibaba

Qianwen LICENSE

190

184◄─►195

Anthropicclaude-3-haiku-20240307

1261

±4

118,626

Anthropic

Proprietary

191

184◄─►195

amazon-nova-lite-v1.0

1259

±5

19,760

Amazon

Proprietary

192

184◄─►195

gemini-1.5-flash-8b-001

1259

±4

35,914

Google

Proprietary

193

187◄─►195

Azurephi-4

1255

±4

24,354

Microsoft

MIT

194

184◄─►201

olmo-2-0325-32b-instruct

1252

±11

3,377

Allen AI

Apache-2.0

195

188◄─►200

Coherecommand-r-08-2024

1251

±7

10,229

Cohere

CC-BY-NC-4.0

196

194◄─►204

mistral-large-2402

1242

±5

63,404

Mistral

Proprietary

197

194◄─►204

amazon-nova-micro-v1.0

1241

±5

19,774

Amazon

Proprietary

198

194◄─►209

jamba-1.5-mini

1239

±7

8,918

AI21 Labs

Jamba Open

199

194◄─►212

ministral-8b-2410

1237

±9

4,833

Mistral

MRL

200

195◄─►212

gemini-pro-dev-api

1234

±7

18,454

Google

Proprietary

201

196◄─►212

qwen1.5-110b-chat

1234

±5

26,679

Alibaba

Qianwen LICENSE

202

196◄─►212

qwen1.5-72b-chat

1234

±5

39,689

Alibaba

Qianwen LICENSE

203

196◄─►213

reka-flash-21b-20240226-online

1233

±7

15,606

Reka AI

Proprietary

204

194◄─►214

Tencenthunyuan-standard-256k

1233

±12

2,761

Tencent

Proprietary

205

198◄─►213

mixtral-8x22b-instruct-v0.1

1230

±4

52,214

Mistral

Apache 2.0

206

198◄─►214

Coherecommand-r

1228

±5

54,710

Cohere

CC-BY-NC-4.0

207

198◄─►214

reka-flash-21b-20240226

1227

±6

25,026

Reka AI

Proprietary

208

199◄─►215

gpt-3.5-turbo-0125

1224

±5

67,214

OpenAI

Proprietary

209

199◄─►216

mistral-medium

1223

±5

34,893

Mistral

Proprietary

210

199◄─►216

Coherec4ai-aya-expanse-8b

1223

±7

9,922

Cohere

CC-BY-NC-4.0

211

203◄─►215

Metallama-3-8b-instruct

1223

±4

106,055

Meta

Llama 3 Community

212

198◄─►219

gemini-pro

1222

±12

6,418

Google

Proprietary

213

198◄─►218

llama-3.1-tulu-3-8b

1221

±11

2,943

Ai2

Llama 3.1

214

205◄─►220

HuggingFacezephyr-orpo-141b-A35b-v0.1

1214

±11

4,712

HuggingFace

Apache 2.0

215

210◄─►219

01.AIyi-1.5-34b-chat

1213

±5

24,417

01 AI

Apache-2.0

216

212◄─►219

Metallama-3.1-8b-instruct

1211

±4

50,234

Meta

Llama 3.1 Community

217

208◄─►225

granite-3.1-8b-instruct

1210

±11

3,142

IBM

Apache 2.0

218

212◄─►224

qwen1.5-32b-chat

1205

±6

22,068

Alibaba

Qianwen LICENSE

219

213◄─►227

gpt-3.5-turbo-1106

1202

±9

16,760

OpenAI

Proprietary

220

216◄─►227

Azurephi-3-medium-4k-instruct

1198

±5

25,301

Microsoft

MIT

221

217◄─►227

gemma-2-2b-it

1198

±4

46,901

Google

Gemma license

222

217◄─►227

mixtral-8x7b-instruct-v0.1

1198

±4

74,303

Mistral

Apache 2.0

223

217◄─►232

dbrx-instruct-preview

1196

±6

32,760

Databricks

DBRX LICENSE

224

217◄─►236

qwen1.5-14b-chat

1193

±7

18,066

Alibaba

Qianwen LICENSE

225

218◄─►236

InternLMinternlm2_5-20b-chat

1192

±7

10,038

InternLM

Other

226

219◄─►242

Azurewizardlm-70b

1185

±9

8,270

Microsoft

Llama 2 Community

227

219◄─►243

deepseek-llm-67b-chat

1184

±12

4,950

DeepSeek AI

DeepSeek License

228

223◄─►240

01.AIyi-34b-chat

1184

±7

15,624

01 AI

Yi License

229

223◄─►242

granite-3.0-8b-instruct

1183

±9

6,727

IBM

Apache 2.0

230

223◄─►242

OpenChatopenchat-3.5-0106

1183

±8

12,712

OpenChat

Apache-2.0

231

223◄─►243

OpenChatopenchat-3.5

1182

±10

8,009

OpenChat

Apache-2.0

232

223◄─►244

granite-3.1-2b-instruct

1181

±11

3,235

IBM

Apache 2.0

233

224◄─►242

Snowflakesnowflake-arctic-instruct

1180

±6

33,272

Snowflake

Apache 2.0

234

224◄─►243

gemma-1.1-7b-it

1180

±6

24,327

Google

Gemma license

235

224◄─►245

tulu-2-dpo-70b

1179

±10

6,579

AllenAI/UW

AI2 ImpACT Low-risk

236

224◄─►247

openhermes-2.5-mistral-7b

1176

±10

5,026

NousResearch

Apache-2.0

237

226◄─►246

vicuna-33b

1173

±6

22,613

LMSYS

Non-commercial

238

226◄─►247

starling-lm-7b-beta

1173

±7

16,190

Nexusflow

Apache-2.0

239

226◄─►247

Azurephi-3-small-8k-instruct

1172

±6

17,983

Microsoft

MIT

240

227◄─►247

Metallama-2-70b-chat

1171

±5

38,767

Meta

Llama 2 Community

241

227◄─►250

starling-lm-7b-alpha

1168

±8

10,267

UC Berkeley

CC-BY-NC-4.0

242

231◄─►251

Metallama-3.2-3b-instruct

1167

±8

8,043

Meta

Llama 3.2

243

226◄─►252

nous-hermes-2-mixtral-8x7b-dpo

1166

±12

3,792

NousResearch

Apache-2.0

244

234◄─►257

qwq-32b-preview

1159

±11

3,256

Alibaba

Apache 2.0

245

235◄─►260

Nvidiallama2-70b-steerlm-chat

1157

±13

3,605

Nvidia

Llama 2 Community

246

241◄─►257

granite-3.0-2b-instruct

1157

±8

6,922

IBM

Apache 2.0

247

237◄─►261

solar-10.7b-instruct-v1.0

1154

±13

4,187

Upstage AI

CC-BY-NC-4.0

248

236◄─►265

dolphin-2.2.1-mistral-7b

1152

±15

1,685

Cognitive Computations

Apache-2.0

249

241◄─►263

mpt-30b-chat

1151

±12

2,606

MosaicML

CC-BY-NC-SA-4.0

250

243◄─►261

mistral-7b-instruct-v0.2

1151

±7

19,603

Mistral

Apache-2.0

251

242◄─►261

Azurewizardlm-13b

1150

±9

7,122

Microsoft

Llama 2 Community

252

241◄─►268

falcon-180b-chat

1147

±17

1,312

TII

Falcon-180B TII License

253

244◄─►266

qwen1.5-7b-chat

1144

±10

4,782

Alibaba

Qianwen LICENSE

254

244◄─►265

Azurephi-3-mini-4k-instruct-june-2024

1143

±6

12,415

Microsoft

MIT

255

244◄─►265

Metallama-2-13b-chat

1143

±7

19,357

Meta

Llama 2 Community

256

244◄─►266

vicuna-13b

1142

±7

19,539

LMSYS

Llama 2 Community

257

244◄─►268

qwen-14b-chat

1139

±11

5,004

Alibaba

Qianwen LICENSE

258

246◄─►268

palm-2

1137

±9

8,634

Google

Proprietary

259

246◄─►268

Metacodellama-34b-instruct

1137

±9

7,417

Meta

Llama 2 Community

260

246◄─►268

gemma-7b-it

1135

±9

9,034

Google

Gemma license

261

250◄─►269

HuggingFacezephyr-7b-beta

1132

±9

11,220

HuggingFace

MIT

262

251◄─►269

Azurephi-3-mini-128k-instruct

1131

±7

21,024

Microsoft

MIT

263

254◄─►269

Azurephi-3-mini-4k-instruct

1129

±6

20,539

Microsoft

MIT

264

247◄─►273

HuggingFacezephyr-7b-alpha

1128

±16

1,803

HuggingFace

MIT

265

250◄─►272

guanaco-33b

1128

±12

2,955

UW

Non-commercial

266

256◄─►273

stripedhyena-nous-7b

1121

±11

5,214

Together AI

Apache 2.0

267

251◄─►274

Metacodellama-70b-instruct

1119

±18

1,151

Meta

Llama 2 Community

268

256◄─►273

HuggingFacesmollm2-1.7b-instruct

1119

±14

2,244

HuggingFace

Apache 2.0

269

261◄─►273

vicuna-7b

1115

±9

6,972

LMSYS

Llama 2 Community

270

264◄─►273

gemma-1.1-2b-it

1114

±8

11,035

Google

Gemma license

271

264◄─►273

Metallama-3.2-1b-instruct

1113

±8

8,166

Meta

Llama 3.2

272

264◄─►274

mistral-7b-instruct

1111

±9

9,042

Mistral

Apache 2.0

273

265◄─►274

Metallama-2-7b-chat

1109

±7

14,272

Meta

Llama 2 Community

274

271◄─►278

gemma-2b-it

1091

±12

4,817

Google

Gemma license

275

274◄─►276

qwen1.5-4b-chat

1091

±9

7,662

Alibaba

Qianwen LICENSE

276

274◄─►281

olmo-7b-instruct

1074

±11

6,412

Allen AI

Apache-2.0

277

275◄─►281

koala-13b

1071

±10

6,998

UC Berkeley

Non-commercial

278

276◄─►281

alpaca-13b

1067

±11

5,828

Stanford

Non-commercial

279

275◄─►282

gpt4all-13b-snoozy

1066

±15

1,773

Nomic AI

Non-commercial

280

276◄─►282

mpt-7b-chat

1062

±12

3,977

MosaicML

CC-BY-NC-SA-4.0

281

276◄─►282

chatglm3-6b

1057

±12

4,692

Tsinghua

Apache-2.0

282

279◄─►284

RWKVRWKV-4-Raven-14B

1042

±11

4,898

RWKV

Apache 2.0

283

282◄─►284

chatglm2-6b

1026

±14

2,683

Tsinghua

Apache-2.0

284

282◄─►284

oasst-pythia-12b

1023

±11

6,343

OpenAssistant

Apache 2.0

285

285◄─►288

chatglm-6b

996

±13

4,968

Tsinghua

Non-commercial

286

285◄─►288

fastchat-t5-3b

992

±12

4,270

LMSYS

Apache 2.0

287

285◄─►288

dolly-v2-12b

980

±14

3,471

Databricks

MIT

288

285◄─►289

Metallama-13b

972

±16

2,441

Meta

Non-commercial

289

288◄─►289

Stabilitystablelm-tuned-alpha-7b

953

±13

3,325

Stability AI

CC-BY-NC-SA-4.0

Description

  • Rank (UB): Rank computed based on the Bradley-Terry model. This ranking reflects the model's overall performance in the arena and provides its Elo score's upper bound estimate, helping to understand the model's potential competitiveness.

  • Model: The name of the large language model (LLM). Some model names may include embedded links.

  • Score: The model's Elo score in the arena determined by user votes. The higher the Elo score, the better the model performed.

  • 95% Confidence Interval (±): The 95% confidence interval for the model's Elo score (for example:±6). A smaller interval indicates the model's score is more stable and reliable.

  • Votes: The total number of votes the model received in the arena. More votes generally imply greater statistical reliability of its score.

  • Organization/Company: The organization or company that provides the model.

  • License: The model's license type, such as Proprietary, Apache 2.0, MIT, etc.

Data sources and update frequency

The leaderboard data is automatically fetched by scripts directly from the 1 2 official website. This leaderboard is automatically updated daily via GitHub Actions.

Disclaimer

This report is for reference only. Leaderboard data is dynamic and based on users' preference votes on the Chatbot Arena during specific time periods. The integrity and accuracy of the data depend on upstream data sources. Different models may use different licenses; please refer to the model provider's official documentation when using them.

Last updated

Was this helpful?