Model Rankings

This is a leaderboard based on Chatbot Arena (lmarena.ai) data, generated through an automated process.

Data update time: 2026-02-11 08:18:25 UTC / 2026-02-11 16:18:25 CST (Beijing Time)

Leaderboard

Rank
Rank Spread
Model
Score
Votes

1

12

Anthropic claude-opus-4-6-thinking Anthropic · Proprietary

1505±10

3,679

2

12

Anthropic claude-opus-4-6 Anthropic · Proprietary

1503±9

4,427

3

33

gemini-3-pro Google · Proprietary

1486±4

35,513

4

47

grok-4.1-thinking xAI · Proprietary

1475±4

35,214

5

49

gemini-3-flash Google · Proprietary

1472±5

26,167

6

49

Anthropic claude-opus-4-5-20251101-thinking-32k Anthropic · Proprietary

1471±5

27,282

7

49

Anthropic claude-opus-4-5-20251101 Anthropic · Proprietary

1467±4

32,179

8

510

grok-4.1 xAI · Proprietary

1464±4

39,284

9

511

gemini-3-flash (thinking-minimal) Google · Proprietary

1462±5

17,361

10

814

gpt-5.1-high OpenAI · Proprietary

1458±4

31,519

11

921

ernie-5.0-0110 Baidu · Proprietary

1452±6 Preliminary

11,080

12

1021

Anthropic claude-sonnet-4-5-20250929 Anthropic · Proprietary

1451±4

43,519

13

1121

Anthropic claude-sonnet-4-5-20250929-thinking-32k Anthropic · Proprietary

1450±4

45,809

14

1023

ernie-5.0-preview-1203 Baidu · Proprietary

1449±7

9,773

15

1121

gemini-2.5-pro Google · Proprietary

1449±3

94,838

16

1122

Anthropic claude-opus-4-1-20250805-thinking-16k Anthropic · Proprietary

1449±4

49,947

17

1025

Moonshot AI kimi-k2.5-thinking Moonshot · Modified MIT

1448±7

8,083

18

1125

Anthropic claude-opus-4-1-20250805 Anthropic · Proprietary

1445±3

74,911

19

1127

gpt-4.5-preview-2025-02-27 OpenAI · Proprietary

1444±6

14,549

20

1525

chatgpt-4o-latest-20250326 OpenAI · Proprietary

1442±3

82,329

21

1129

glm-4.7 Z.ai · MIT

1441±6

12,013

22

1136

Moonshot AI kimi-k2.5-instant Moonshot · Modified MIT

1438±10

3,830

23

1729

gpt-5.1 OpenAI · Proprietary

1438±4

33,736

24

1630

gpt-5.2 OpenAI · Proprietary

1438±6

12,759

25

1732

gpt-5.2-high OpenAI · Proprietary

1436±6

16,100

26

2032

gpt-5-high OpenAI · Proprietary

1434±5

32,619

27

2035

qwen3-max-preview Alibaba · Proprietary

1434±5

27,842

28

2136

o3-2025-04-16 OpenAI · Proprietary

1432±4

61,351

29

2138

grok-4-1-fast-reasoning xAI · Proprietary

1431±4

28,142

30

2343

Moonshot AI kimi-k2-thinking-turbo Moonshot · Modified MIT

1429±4

33,118

31

2447

gpt-5-chat OpenAI · Proprietary

1426±4

31,821

32

2750

glm-4.6 Z.ai · MIT

1425±4

35,331

33

2450

qwen3-max-2025-09-23 Alibaba · Proprietary

1425±6

9,223

34

2950

Anthropic claude-opus-4-20250514-thinking-16k Anthropic · Proprietary

1424±4

37,973

35

2652

deepseek-v3.2-exp DeepSeek · MIT

1423±6

11,762

36

2652

deepseek-v3.2-exp-thinking DeepSeek · MIT

1423±7

8,999

37

3050

qwen3-235b-a22b-instruct-2507 Alibaba · Apache 2.0

1422±3

69,176

38

2654

grok-4-fast-chat xAI · Proprietary

1422±8

6,989

39

3052

deepseek-v3.2 DeepSeek · MIT

1421±5

27,695

40

3052

deepseek-v3.2-thinking DeepSeek · MIT

1421±5

22,732

41

3157

deepseek-r1-0528 DeepSeek · MIT

1418±6

19,289

42

2958

ernie-5.0-preview-1022 Baidu · Proprietary

1418±9

4,616

43

3157

deepseek-v3.1 DeepSeek · MIT

1418±6

15,298

44

3157

Moonshot AI kimi-k2-0905-preview Moonshot · Modified MIT

1417±6

11,973

45

3157

deepseek-v3.1-thinking DeepSeek · MIT

1417±7

11,985

46

3257

Moonshot AI kimi-k2-0711-preview Moonshot · Modified MIT

1417±5

28,662

47

3257

mistral-large-3 Mistral · Apache 2.0

1416±5

24,050

48

3060

deepseek-v3.1-terminus DeepSeek · MIT

1416±10

3,761

49

3064

deepseek-v3.1-terminus-thinking DeepSeek · MIT

1415±10

3,548

50

3259

qwen3-vl-235b-a22b-instruct Alibaba · Apache 2.0

1415±6

11,684

51

3657

gpt-4.1-2025-04-14 OpenAI · Proprietary

1413±4

52,211

52

3657

Anthropic claude-opus-4-20250514 Anthropic · Proprietary

1413±4

45,573

53

4060

grok-3-preview-02-24 xAI · Proprietary

1411±4

33,970

54

4160

mistral-medium-2508 Mistral · Proprietary

1411±3

63,013

55

4160

gemini-2.5-flash Google · Proprietary

1410±3

94,114

56

4064

glm-4.5 Z.ai · MIT

1410±5

24,792

57

4163

grok-4-0709 xAI · Proprietary

1410±4

42,143

58

4970

gemini-2.5-flash-preview-09-2025 Google · Proprietary

1405±4

32,875

59

5170

Anthropic claude-haiku-4-5-20251001 Anthropic · Proprietary

1404±4

44,511

60

5071

grok-4-fast-reasoning xAI · Proprietary

1403±5

18,637

61

5572

o1-2024-12-17 OpenAI · Proprietary

1402±4

27,822

62

5572

qwen3-235b-a22b-no-thinking Alibaba · Apache 2.0

1401±4

39,546

63

5573

qwen3-next-80b-a3b-instruct Alibaba · Apache 2.0

1401±5

22,856

64

5873

Anthropic claude-sonnet-4-20250514-thinking-32k Anthropic · Proprietary

1400±4

36,207

65

5679

longcat-flash-chat Meituan · MIT

1399±6

11,548

66

5881

qwen3-235b-a22b-thinking-2507 Alibaba · Apache 2.0

1398±6

9,254

67

5879

deepseek-r1 DeepSeek · MIT

1398±5

18,537

68

5886

qwen3-vl-235b-a22b-thinking Alibaba · Apache 2.0

1395±7

7,971

69

5887

amazon-nova-experimental-chat-12-10 Amazon · Proprietary

1395±10

3,718

70

6084

mimo-v2-flash (non-thinking) Xiaomi · MIT

1394±5

16,651

71

6182

deepseek-v3-0324 DeepSeek · MIT

1394±4

46,690

72

5888

Tencent hunyuan-vision-1.5-thinking Tencent · Proprietary

1393±12

2,225

73

6386

mai-1-preview Microsoft AI · Proprietary

1391±5

18,134

74

6586

o4-mini-2025-04-16 OpenAI · Proprietary

1391±4

46,676

75

6587

gpt-5-mini-high OpenAI · Proprietary

1390±5

27,145

76

6587

Anthropic claude-sonnet-4-20250514 Anthropic · Proprietary

1390±4

41,645

77

6787

Anthropic claude-3-7-sonnet-20250219-thinking-32k Anthropic · Proprietary

1388±4

39,889

78

6588

o1-preview OpenAI · Proprietary

1388±5

31,120

79

6788

Minimax minimax-m2.1-preview MiniMax · MIT

1387±5

16,114

80

6590

Tencent hunyuan-t1-20250711 Tencent · Proprietary

1387±9

4,801

81

6888

qwen3-coder-480b-a35b-instruct Alibaba · Apache 2.0

1386±5

26,652

82

6590

Stepfun step-3.5-flash StepFun · Apache 2.0

1386±8

5,571

83

6988

mistral-medium-2505 Mistral · Proprietary

1385±5

34,548

84

7090

qwen3-30b-a3b-instruct-2507 Alibaba · Apache 2.0

1383±5

24,092

85

7091

Tencent hunyuan-turbos-20250416 Tencent · Proprietary

1382±6

11,053

86

7391

gpt-4.1-mini-2025-04-14 OpenAI · Proprietary

1382±4

40,532

87

7791

gemini-2.5-flash-lite-preview-09-2025-no-thinking Google · Proprietary

1380±4

47,414

88

69101

glm-4.6v Z.ai · MIT

1378±11

2,819

89

8298

gemini-2.5-flash-lite-preview-06-17-thinking Google · Proprietary

1375±4

33,929

90

8298

qwen3-235b-a22b Alibaba · Apache 2.0

1375±5

27,164

91

8598

qwen2.5-max Alibaba · Proprietary

1374±4

33,294

92

8898

Anthropic claude-3-5-sonnet-20241022 Anthropic · Proprietary

1373±3

89,471

93

88101

Anthropic claude-3-7-sonnet-20250219 Anthropic · Proprietary

1372±4

44,434

94

88101

glm-4.5-air Z.ai · MIT

1371±4

31,382

95

88104

qwen3-next-80b-a3b-thinking Alibaba · Apache 2.0

1369±6

13,859

96

88104

Minimax minimax-m1 MiniMax · Apache 2.0

1367±4

36,838

97

88106

amazon-nova-experimental-chat-11-10 Amazon · Proprietary

1367±5

16,393

98

92105

gemma-3-27b-it Google · Gemma

1365±4

48,694

99

92109

o3-mini-high OpenAI · Proprietary

1364±5

18,584

100

88114

glm-4.7-flash Z.ai · MIT

1363±7

7,376

101

92112

grok-3-mini-high xAI · Proprietary

1363±5

17,523

102

95112

gemini-2.0-flash-001 Google · Proprietary

1361±4

44,828

103

95119

deepseek-v3 DeepSeek · DeepSeek

1358±5

21,788

104

98121

grok-3-mini-beta xAI · Proprietary

1357±5

23,765

105

95126

intellect-3 Prime Intellect · MIT

1356±8

5,325

106

99125

mistral-small-2506 Mistral · Apache 2.0

1356±5

18,343

107

100125

gpt-oss-120b OpenAI · Apache 2.0

1354±4

30,963

108

100126

gemini-2.0-flash-lite-preview-02-05 Google · Proprietary

1353±4

24,951

109

97127

glm-4.5v Z.ai · MIT

1353±8

4,973

110

102125

Cohere command-a-03-2025 Cohere · CC-BY-NC-4.0

1353±3

57,439

111

103126

gemini-1.5-pro-002 Google · Proprietary

1351±3

55,607

112

103129

amazon-nova-experimental-chat-10-20 Amazon · Proprietary

1349±6

11,439

113

99139

Tencent hunyuan-turbos-20250226 Tencent · Proprietary

1349±12

2,226

114

104128

o3-mini OpenAI · Proprietary

1348±3

58,638

115

100139

amazon-nova-experimental-chat-10-09 Amazon · Proprietary

1347±11

2,890

116

99140

Nvidia llama-3.1-nemotron-ultra-253b-v1 Nvidia · Nvidia Open Model

1347±12

2,546

117

102138

qwen3-32b Alibaba · Apache 2.0

1347±9

3,932

118

103133

ling-flash-2.0 Ant Group · MIT

1347±7

7,038

119

103136

Minimax minimax-m2 MiniMax · Apache 2.0

1347±8

6,742

120

104136

Stepfun step-3 StepFun · Apache 2.0

1346±7

6,592

121

103137

qwen-plus-0125 Alibaba · Proprietary

1346±8

5,823

122

108130

gpt-4o-2024-05-13 OpenAI · Proprietary

1346±3

112,863

123

105140

glm-4-plus-0111 Zhipu · Proprietary

1343±8

5,760

124

111133

Anthropic claude-3-5-sonnet-20240620 Anthropic · Proprietary

1343±3

82,417

125

105142

gemma-3-12b-it Google · Gemma

1342±10

3,829

126

105144

Nvidia nvidia-llama-3.3-nemotron-super-49b-v1.5 Nvidia · Nvidia Open

1341±10

3,432

127

104145

Tencent hunyuan-turbo-0110 Tencent · Proprietary

1340±12

2,295

128

112144

gpt-5-nano-high OpenAI · Proprietary

1338±7

8,399

129

113144

nova-2-lite Amazon · Proprietary

1337±6

12,258

130

115141

o1-mini OpenAI · Proprietary

1337±4

51,986

131

115144

qwq-32b Alibaba · Apache 2.0

1336±4

26,118

132

117143

Metallama-3.1-405b-instruct-bf16 Meta · Llama 3.1 Community

1335±4

41,392

133

117144

gpt-4o-2024-08-06 OpenAI · Proprietary

1335±4

45,498

134

119144

grok-2-2024-08-13 xAI · Proprietary

1335±4

63,495

135

115144

gemini-advanced-0514 Google · Proprietary

1335±5

50,142

136

114151

Stepfun step-2-16k-exp-202412 StepFun · Proprietary

1334±9

4,829

137

121144

Metallama-3.1-405b-instruct-fp8 Meta · Llama 3.1 Community

1334±4

59,655

138

120152

olmo-3.1-32b-instruct Allen AI · Apache 2.0

1331±6

11,634

139

125157

01.AI yi-lightning 01 AI · Proprietary

1329±5

27,340

140

126157

qwen3-30b-a3b Alibaba · Apache 2.0

1328±5

27,431

141

127157

Metallama-4-maverick-17b-128e-instruct Meta · Llama 4

1328±4

41,136

142

117166

Nvidia llama-3.3-nemotron-49b-super-v1 Nvidia · Nvidia

1327±12

2,230

143

123166

Tencent hunyuan-large-2025-02-10 Tencent · Proprietary

1326±10

3,738

144

137162

gpt-4-turbo-2024-04-09 OpenAI · Proprietary

1324±4

98,130

145

137162

Anthropic claude-3-5-haiku-20241022 Anthropic · Proprietary

1324±3

71,183

146

128166

deepseek-v2.5-1210 DeepSeek · DeepSeek

1323±8

6,793

147

137162

gemini-1.5-pro-001 Google · Proprietary

1323±4

79,132

148

137163

Metallama-4-scout-17b-16e-instruct Meta · Llama

1323±5

31,235

149

138162

Anthropic claude-3-opus-20240229 Anthropic · Proprietary

1322±3

194,904

150

136167

gpt-4.1-nano-2025-04-14 OpenAI · Proprietary

1322±8

6,107

151

137166

Stepfun step-1o-turbo-202506 StepFun · Proprietary

1321±7

9,687

152

137168

ring-flash-2.0 Ant Group · MIT

1320±7

7,204

153

142166

Metallama-3.3-70b-instruct Meta · Llama-3.3

1320±3

55,576

154

139166

glm-4-plus Zhipu AI · Proprietary

1319±5

26,134

155

139168

gemma-3n-e4b-it Google · Gemma

1319±5

23,342

156

139169

qwen-max-0919 Alibaba · Qwen

1318±6

16,479

157

142166

gpt-4o-mini-2024-07-18 OpenAI · Proprietary

1318±3

68,801

158

139173

gpt-oss-20b OpenAI · Apache 2.0

1317±6

10,810

159

142173

Nvidia nvidia-nemotron-3-nano-30b-a3b-bf16 Nvidia · NVIDIA Open Model

1317±6

15,500

160

142175

qwen2.5-plus-1127 Alibaba · Proprietary

1315±6

10,179

161

146173

athene-v2-chat NexusFlow · NexusFlow

1314±5

24,746

162

147173

mistral-large-2407 Mistral · Mistral Research

1314±4

45,460

163

147174

gpt-4-0125-preview OpenAI · Proprietary

1313±4

93,439

164

147173

gpt-4-1106-preview OpenAI · Proprietary

1313±4

100,107

165

142178

Tencent hunyuan-standard-2025-02-10 Tencent · Proprietary

1312±10

3,905

166

139182

mercury Inception AI · Proprietary

1310±14

1,920

167

154177

gemini-1.5-flash-002 Google · Proprietary

1310±4

34,909

168

158178

grok-2-mini-2024-08-13 xAI · Proprietary

1308±4

52,574

169

158178

deepseek-v2.5 DeepSeek · DeepSeek

1307±5

24,574

170

158178

athene-70b-0725 NexusFlow · CC-BY-NC-4.0

1306±6

19,622

171

154178

olmo-3-32b-think Allen AI · Apache 2.0

1306±8

5,892

172

162178

mistral-large-2411 Mistral · MRL

1305±4

28,081

173

158178

magistral-medium-2506 Mistral · Proprietary

1305±6

12,065

174

164178

mistral-small-3.1-24b-instruct-2503 Mistral · Apache 2.0

1305±4

34,130

175

157185

gemma-3-4b-it Google · Gemma

1303±9

4,177

176

165178

qwen2.5-72b-instruct Alibaba · Qwen

1303±4

39,409

177

165188

Nvidia llama-3.1-nemotron-70b-instruct Nvidia · Llama 3.1

1299±8

7,136

178

166189

Tencent hunyuan-large-vision Tencent · Proprietary

1296±9

5,600

179

175189

Metallama-3.1-70b-instruct Meta · Llama 3.1 Community

1294±4

55,234

180

176190

amazon-nova-pro-v1.0 Amazon · Proprietary

1290±5

24,753

181

176193

jamba-1.5-large AI21 Labs · Jamba Open

1289±7

8,659

182

177191

gemma-2-27b-it Google · Gemma license

1288±3

75,764

183

175196

ibm-granite-h-small IBM · Apache 2.0

1288±8

5,682

184

176193

reka-core-20240904 Reka AI · Proprietary

1288±7

7,309

185

177193

gpt-4-0314 OpenAI · Proprietary

1287±5

54,167

186

175199

llama-3.1-tulu-3-70b Ai2 · Llama 3.1

1287±10

2,846

187

175199

Nvidia llama-3.1-nemotron-51b-instruct Nvidia · Llama 3.1

1287±10

3,749

188

177198

olmo-3.1-32b-think Allen AI · Apache 2.0

1286±7

8,479

189

178193

gemini-1.5-flash-001 Google · Proprietary

1286±4

62,823

190

181199

Anthropic claude-3-sonnet-20240229 Anthropic · Proprietary

1281±4

109,289

191

180199

gemma-2-9b-it-simpo Princeton · MIT

1280±7

10,069

192

182199

Nvidia nemotron-4-340b-instruct Nvidia · NVIDIA Open Model

1278±5

19,661

193

182201

Cohere command-r-plus-08-2024 Cohere · CC-BY-NC-4.0

1277±7

9,869

194

186199

Metallama-3-70b-instruct Meta · Llama 3 Community

1276±4

156,880

195

187200

gpt-4-0613 OpenAI · Proprietary

1276±4

88,721

196

186202

mistral-small-24b-instruct-2501 Mistral · Apache 2.0

1274±6

14,677

197

186203

glm-4-0520 Zhipu AI · Proprietary

1274±7

9,788

198

187205

reka-flash-20240904 Reka AI · Proprietary

1272±7

7,537

199

188208

qwen2.5-coder-32b-instruct Alibaba · Apache 2.0

1271±8

5,430

200

194208

Cohere c4ai-aya-expanse-32b Cohere · CC-BY-NC-4.0

1267±5

27,123

201

196208

gemma-2-9b-it Google · Gemma license

1266±4

54,615

202

195209

deepseek-coder-v2 DeepSeek · DeepSeek License

1264±6

15,147

203

198209

Cohere command-r-plus Cohere · CC-BY-NC-4.0

1262±4

77,556

204

197209

qwen2-72b-instruct Alibaba · Qianwen LICENSE

1262±5

37,325

205

199209

Anthropic claude-3-haiku-20240307 Anthropic · Proprietary

1261±4

117,705

206

198210

amazon-nova-lite-v1.0 Amazon · Proprietary

1261±5

19,376

207

199210

gemini-1.5-flash-8b-001 Google · Proprietary

1259±4

35,556

208

202210

Azure phi-4 Microsoft · MIT

1256±5

24,126

209

199216

olmo-2-0325-32b-instruct Allen AI · Apache-2.0

1252±11

3,335

210

206215

Cohere command-r-08-2024 Cohere · CC-BY-NC-4.0

1251±7

10,141

211

209219

mistral-large-2402 Mistral · Proprietary

1243±5

62,437

212

209219

amazon-nova-micro-v1.0 Amazon · Proprietary

1241±5

19,355

213

209223

jamba-1.5-mini AI21 Labs · Jamba Open

1239±7

8,854

214

209227

ministral-8b-2410 Mistral · MRL

1237±9

4,780

215

210227

gemini-pro-dev-api Google · Proprietary

1235±7

18,352

216

211227

qwen1.5-110b-chat Alibaba · Qianwen LICENSE

1234±6

26,191

217

211228

reka-flash-21b-20240226-online Reka AI · Proprietary

1234±7

15,451

218

211227

qwen1.5-72b-chat Alibaba · Qianwen LICENSE

1234±5

39,296

219

209229

Tencent hunyuan-standard-256k Tencent · Proprietary

1233±12

2,729

220

213228

mixtral-8x22b-instruct-v0.1 Mistral · Apache 2.0

1230±5

51,417

221

213229

Cohere command-r Cohere · CC-BY-NC-4.0

1227±5

54,038

222

213229

reka-flash-21b-20240226 Reka AI · Proprietary

1227±6

24,806

223

214230

gpt-3.5-turbo-0125 OpenAI · Proprietary

1224±5

66,191

224

218230

Metallama-3-8b-instruct Meta · Llama 3 Community

1223±4

104,636

225

214231

mistral-medium Mistral · Proprietary

1223±6

34,552

226

214231

Cohere c4ai-aya-expanse-8b Cohere · CC-BY-NC-4.0

1223±7

9,827

227

213234

gemini-pro Google · Proprietary

1222±12

6,390

228

214234

llama-3.1-tulu-3-8b Ai2 · Llama 3.1

1221±11

2,895

229

225234

01.AI yi-1.5-34b-chat 01 AI · Apache-2.0

1214±5

24,142

230

220236

HuggingFace zephyr-orpo-141b-A35b-v0.1 HuggingFace · Apache 2.0

1213±11

4,653

231

227234

Metallama-3.1-8b-instruct Meta · Llama 3.1 Community

1212±4

49,605

232

223240

granite-3.1-8b-instruct IBM · Apache 2.0

1209±11

3,092

233

227240

qwen1.5-32b-chat Alibaba · Qianwen LICENSE

1204±6

21,744

234

227242

gpt-3.5-turbo-1106 OpenAI · Proprietary

1203±9

16,616

235

231241

gemma-2-2b-it Google · Gemma license

1199±4

46,618

236

231242

Azure phi-3-medium-4k-instruct Microsoft · MIT

1198±5

25,055

237

232242

mixtral-8x7b-instruct-v0.1 Mistral · Apache 2.0

1198±4

73,505

238

232247

dbrx-instruct-preview Databricks · DBRX LICENSE

1195±6

32,196

239

232251

InternLM internlm2_5-20b-chat InternLM · Other

1191±7

9,902

240

232251

qwen1.5-14b-chat Alibaba · Qianwen LICENSE

1191±7

17,841

241

235257

Azure wizardlm-70b Microsoft · Llama 2 Community

1185±9

8,214

242

234258

deepseek-llm-67b-chat DeepSeek · DeepSeek License

1185±12

4,933

243

238254

01.AI yi-34b-chat 01 AI · Yi License

1184±7

15,483

244

238257

OpenChat openchat-3.5-0106 OpenChat · Apache-2.0

1183±8

12,636

245

238258

OpenChat openchat-3.5 OpenChat · Apache-2.0

1183±10

7,967

246

238258

granite-3.0-8b-instruct IBM · Apache 2.0

1182±9

6,643

247

239258

gemma-1.1-7b-it Google · Gemma license

1180±6

23,893

248

239258

Snowflake snowflake-arctic-instruct Snowflake · Apache 2.0

1180±6

32,836

249

238259

granite-3.1-2b-instruct IBM · Apache 2.0

1180±11

3,191

250

239259

tulu-2-dpo-70b AllenAI/UW · AI2 ImpACT Low-risk

1178±10

6,534

251

239262

openhermes-2.5-mistral-7b Nous Research · Apache-2.0

1176±10

5,006

252

241261

vicuna-33b LMSYS · Non-commercial

1173±6

22,479

253

241264

starling-lm-7b-beta Nexusflow · Apache-2.0

1172±7

16,057

254

241262

Azure phi-3-small-8k-instruct Microsoft · MIT

1172±6

17,763

255

242262

Metallama-2-70b-chat Meta · Llama 2 Community

1171±6

38,491

256

242265

starling-lm-7b-alpha UC Berkeley · CC-BY-NC-4.0

1168±8

10,224

257

244265

Metallama-3.2-3b-instruct Meta · Llama 3.2

1167±8

7,936

258

242268

nous-hermes-2-mixtral-8x7b-dpo Nous Research · Apache-2.0

1165±12

3,776

259

249275

qwq-32b-preview Alibaba · Apache 2.0

1157±12

3,233

260

255272

granite-3.0-2b-instruct IBM · Apache 2.0

1156±8

6,837

261

251275

Nvidia llama2-70b-steerlm-chat Nvidia · Llama 2 Community

1156±13

3,584

262

252278

solar-10.7b-instruct-v1.0 Upstage AI · CC-BY-NC-4.0

1153±13

4,155

263

251280

dolphin-2.2.1-mistral-7b Cognitive Computations · Apache-2.0

1152±15

1,679

264

256278

mpt-30b-chat MosaicML · CC-BY-NC-SA-4.0

1151±12

2,571

265

258275

mistral-7b-instruct-v0.2 Mistral · Apache-2.0

1150±7

19,402

266

258277

Azure wizardlm-13b Microsoft · Llama 2 Community

1150±9

7,046

267

255282

falcon-180b-chat TII · Falcon-180B TII License

1147±17

1,295

268

258281

qwen1.5-7b-chat Alibaba · Qianwen LICENSE

1144±10

4,735

269

259280

Azure phi-3-mini-4k-instruct-june-2024 Microsoft · MIT

1143±6

12,296

270

259281

Metallama-2-13b-chat Meta · Llama 2 Community

1142±7

19,171

271

259281

vicuna-13b LMSYS · Llama 2 Community

1141±7

19,366

272

259283

qwen-14b-chat Alibaba · Qianwen LICENSE

1139±11

4,964

273

260283

palm-2 Google · Proprietary

1138±9

8,554

274

260283

Metacode llama-34b-instruct Meta · Llama 2 Community

1137±9

7,363

275

260283

gemma-7b-it Google · Gemma license

1136±10

8,925

276

263284

HuggingFace zephyr-7b-beta HuggingFace · MIT

1131±9

11,116

277

266284

Azure phi-3-mini-128k-instruct Microsoft · MIT

1130±7

20,691

278

268284

Azure phi-3-mini-4k-instruct Microsoft · MIT

1129±6

20,115

279

264287

guanaco-33b UW · Non-commercial

1128±12

2,921

280

263288

HuggingFace zephyr-7b-alpha HuggingFace · MIT

1127±16

1,785

281

271288

stripedhyena-nous-7b Together AI · Apache 2.0

1121±11

5,184

282

266289

Metacode llama-70b-instruct Meta · Llama 2 Community

1119±18

1,143

283

276288

vicuna-7b LMSYS · Llama 2 Community

1115±9

6,923

284

272289

HuggingFace smollm2-1.7b-instruct HuggingFace · Apache 2.0

1115±14

2,201

285

279288

gemma-1.1-2b-it Google · Gemma license

1114±8

10,853

286

279288

Metallama-3.2-1b-instruct Meta · Llama 3.2

1112±8

8,045

287

279289

mistral-7b-instruct Mistral · Apache 2.0

1110±9

8,977

288

280289

Metallama-2-7b-chat Meta · Llama 2 Community

1108±7

14,148

289

285293

gemma-2b-it Google · Gemma license

1092±12

4,779

290

289292

qwen1.5-4b-chat Alibaba · Qianwen LICENSE

1091±9

7,598

291

289296

olmo-7b-instruct Allen AI · Apache-2.0

1075±11

6,329

292

290296

koala-13b UC Berkeley · Non-commercial

1071±10

6,964

293

291296

alpaca-13b Stanford · Non-commercial

1068±12

5,745

294

289297

gpt4all-13b-snoozy Nomic AI · Non-commercial

1066±15

1,743

295

291297

mpt-7b-chat MosaicML · CC-BY-NC-SA-4.0

1062±12

3,925

296

291297

chatglm3-6b Tsinghua · Apache-2.0

1056±12

4,658

297

294299

RWKV RWKV-4-Raven-14B RWKV · Apache 2.0

1042±11

4,845

298

297299

chatglm2-6b Tsinghua · Apache-2.0

1025±14

2,657

299

297299

oasst-pythia-12b OpenAssistant · Apache 2.0

1023±11

6,311

300

300303

chatglm-6b Tsinghua · Non-commercial

996±13

4,914

301

300303

fastchat-t5-3b LMSYS · Apache 2.0

992±12

4,203

302

300303

dolly-v2-12b Databricks · MIT

981±14

3,412

303

300304

Metallama-13b Meta · Non-commercial

973±16

2,391

304

303304

Stability stablelm-tuned-alpha-7b Stability AI · CC-BY-NC-SA-4.0

953±13

3,287

Notes

  • Rank (UB): Rankings computed based on the Bradley-Terry model. This ranking reflects the models' overall performance in the arena and provides their Elo scores' upper bound estimate, helping to understand the models' potential competitiveness.

  • Model: Names of the large language models (LLMs). Some model names may contain embedded links.

  • Score: The models' Elo scores in the arena obtained via user votes. Elo is a relative ranking system; higher scores indicate better performance.

  • 95% confidence interval (±): The 95% confidence interval for the model's Elo score (for example:±6). A smaller interval indicates more stable and reliable model scoring.

  • Votes: The total number of votes the model received in the arena. More votes generally imply higher statistical reliability of its score.

  • Organization/Company: The organization or company that provides the model.

  • License: The type of license for the model, e.g., Proprietary, Apache 2.0, MIT, etc.

Data sources and update frequency

The data for this leaderboard is fetched automatically by scripts directly from the 1 2arrow-up-right official website. This leaderboard is automatically updated daily by GitHub Actions.

Disclaimer

This report is for reference only. The leaderboard data is dynamic and based on user preference votes on Chatbot Arena during specific time periods. The completeness and accuracy of the data depend on upstream sources. Different models may use different license agreements; please consult the model provider's official documentation when using them.

Last updated

Was this helpful?