Model Leaderboard

This is a leaderboard based on Chatbot Arena (lmarena.ai) data, generated through an automated process.

Data updated: 2025-06-25 11:42:53 UTC / 2025-06-25 19:42:53 CST (Beijing Time)

Click on the model name in the leaderboard to navigate to its detailed information or trial page.

Leaderboard

Rank(UB)
Rank(StyleCtrl)
Model Name
Score
Confidence Interval
Votes
Provider
License
Knowledge Cutoff Date

1

1

1477

+5/-5

12,327

Google

Proprietary

No data available

2

2

1446

+4/-6

14,040

Google

Proprietary

No data available

3

3

1428

+5/-3

22,488

OpenAI

Proprietary

No data available

3

2

1428

+5/-4

18,205

OpenAI

Proprietary

No data available

3

6

1424

+5/-6

11,871

DeepSeek

MIT

No data available

3

7

1422

+4/-5

24,316

xAI

Proprietary

No data available

4

6

1420

+5/-6

17,535

Google

Proprietary

No data available

5

4

1415

+5/-5

15,271

OpenAI

Proprietary

No data available

9

8

1400

+4/-5

16,123

Google

Proprietary

No data available

9

12

1388

+7/-5

12,320

Alibaba

Apache 2.0

No data available

10

6

1386

+5/-5

16,362

OpenAI

Proprietary

No data available

10

12

1385

+3/-5

19,091

DeepSeek

MIT

No data available

11

18

1376

+6/-6

7,816

Tencent

Proprietary

No data available

11

12

1373

+9/-8

3,895

MiniMax

Apache 2.0

No data available

13

12

1375

+4/-3

19,430

DeepSeek

MIT

No data available

13

6

1373

+4/-5

18,287

Anthropic

Proprietary

No data available

13

18

1369

+4/-6

16,637

Mistral

Proprietary

No data available

14

12

1367

+4/-3

29,038

OpenAI

Proprietary

No data available

14

22

1365

+5/-5

13,002

Alibaba

Apache 2.0

No data available

15

12

1363

+5/-4

16,112

OpenAI

Proprietary

No data available

15

28

1363

+6/-5

8,715

xAI

Proprietary

No data available

16

25

1364

+3/-3

35,894

Google

Proprietary

No data available

16

24

1363

+3/-3

31,170

Alibaba

Proprietary

No data available

20

27

1357

+4/-3

25,323

Google

Gemma

No data available

24

18

1352

+3/-3

33,177

OpenAI

Proprietary

2023/10

26

12

1345

+5/-4

14,984

Anthropic

Proprietary

No data available

26

25

1342

+4/-4

19,404

OpenAI

Proprietary

No data available

26

21

1339

+5/-4

15,337

OpenAI

Proprietary

No data available

26

33

1338

+10/-9

3,976

Google

Gemma

No data available

27

30

1336

+3/-4

22,841

DeepSeek

DeepSeek

No data available

27

37

1334

+5/-4

17,462

Alibaba

Apache 2.0

No data available

28

36

1328

+8/-8

6,028

Zhipu

Proprietary

No data available

28

51

1328

+8/-9

5,213

Amazon

Proprietary

No data available

28

32

1328

+7/-6

6,055

Alibaba

Proprietary

No data available

29

31

1330

+3/-3

26,104

Google

Proprietary

No data available

31

33

1327

+3/-3

22,851

Cohere

CC-BY-NC-4.0

No data available

31

39

1322

+7/-7

5,126

StepFun

Proprietary

No data available

31

31

1320

+10/-9

2,452

Tencent

Proprietary

No data available

33

32

1323

+3/-2

35,063

OpenAI

Proprietary

No data available

33

39

1321

+2/-3

54,951

OpenAI

Proprietary

2023/10

33

39

1314

+10/-9

2,371

Nvidia

Nvidia

No data available

33

33

1314

+10/-8

2,510

Tencent

Proprietary

No data available

34

32

1320

+2/-2

58,645

Google

Proprietary

No data available

36

16

1317

+3/-4

24,159

Anthropic

Proprietary

No data available

41

21

1309

+3/-4

28,664

Anthropic

Proprietary

No data available

43

44

1305

+2/-2

67,084

xAI

Proprietary

2024/3

43

47

1304

+3/-4

28,968

01 AI

Proprietary

No data available

43

55

1303

+6/-8

5,282

Google

Gemma

No data available

46

35

1302

+2/-2

117,747

OpenAI

Proprietary

2023/10

46

25

1301

+2/-2

75,986

Anthropic

Proprietary

2024/4

46

57

1300

+4/-5

10,715

Alibaba

Proprietary

No data available

47

51

1297

+6/-6

7,243

DeepSeek

DeepSeek

No data available

49

69

1292

+8/-8

4,321

Google

Gemma

No data available

50

61

1293

+3/-3

26,074

NexusFlow

NexusFlow

No data available

50

41

1292

+4/-4

15,906

Meta

Llama 4

No data available

50

56

1291

+4/-2

27,788

Zhipu AI

Proprietary

No data available

50

47

1289

+9/-10

3,856

Tencent

Proprietary

No data available

50

48

1288

+8/-7

6,302

OpenAI

Proprietary

No data available

52

57

1289

+3/-2

72,536

OpenAI

Proprietary

2023/10

52

67

1289

+4/-3

37,021

Google

Proprietary

No data available

52

76

1286

+7/-6

7,577

Nvidia

Llama 3.1

2023/12

53

56

1278

+11/-8

4,014

Tencent

Proprietary

No data available

55

39

1286

+2/-2

43,788

Meta

Llama 3.1 Community

2023/12

56

36

1286

+2/-2

86,159

Anthropic

Proprietary

2024/4

57

40

1285

+2/-2

63,038

Meta

Llama 3.1 Community

2023/12

57

40

1284

+3/-3

52,144

Google

Proprietary

Online

58

71

1284

+2/-2

55,442

xAI

Proprietary

2024/3

58

42

1283

+3/-3

47,973

OpenAI

Proprietary

2023/10

58

59

1281

+4/-4

17,432

Alibaba

Qwen

No data available

62

54

1271

+11/-6

4,998

Meta

Llama

No data available

68

54

1277

+2/-2

82,435

Google

Proprietary

2023/11

68

69

1276

+2/-3

26,344

DeepSeek

DeepSeek

No data available

68

57

1275

+2/-2

46,558

Meta

Llama-3.3

No data available

68

74

1271

+8/-7

4,963

Mistral

Apache 2.0

No data available

69

74

1275

+3/-3

41,519

Alibaba

Qwen

2024/9

69

53

1274

+2/-2

102,133

OpenAI

Proprietary

2023/12

70

56

1263

+11/-11

3,089

Mistral

Proprietary

No data available

74

61

1269

+2/-2

48,217

Mistral

Mistral Research

2024/7

74

73

1268

+4/-4

20,580

NexusFlow

CC-BY-NC-4.0

2024/7

74

80

1262

+9/-9

3,010

Ai2

Llama 3.1

No data available

75

57

1267

+2/-2

103,748

OpenAI

Proprietary

2023/4

75

74

1266

+3/-3

29,633

Mistral

MRL

No data available

75

82

1265

+2/-2

58,637

Meta

Llama 3.1 Community

2023/12

76

54

1265

+1/-2

202,641

Anthropic

Proprietary

2023/8

76

84

1262

+3/-4

26,371

Amazon

Proprietary

No data available

78

61

1262

+2/-2

97,079

OpenAI

Proprietary

2023/12

85

57

1256

+2/-3

47,551

Anthropic

Propretary

No data available

85

82

1253

+5/-6

7,948

Reka AI

Proprietary

No data available

88

86

1244

+2/-2

65,661

Google

Proprietary

2023/11

89

84

1239

+6/-5

9,125

AI21 Labs

Jamba Open

2024/3

90

85

1237

+2/-2

79,538

Google

Gemma license

2024/6

90

93

1235

+7/-8

5,730

Alibaba

Apache 2.0

No data available

90

96

1235

+4/-3

15,321

Mistral

Apache 2.0

No data available

90

101

1234

+3/-4

20,646

Amazon

Proprietary

No data available

90

88

1234

+5/-5

10,548

Princeton

MIT

2024/7

90

88

1233

+7/-5

10,535

Cohere

CC-BY-NC-4.0

2024/8

90

82

1229

+11/-7

3,889

Nvidia

Llama 3.1

2023/12

92

105

1230

+3/-3

37,697

Google

Proprietary

No data available

92

102

1223

+9/-9

3,460

Allen AI

Apache-2.0

No data available

94

99

1227

+3/-3

28,768

Cohere

CC-BY-NC-4.0

No data available

94

91

1227

+4/-4

20,608

Nvidia

NVIDIA Open Model

2023/6

94

96

1224

+5/-5

10,221

Zhipu AI

Proprietary

No data available

94

99

1219

+12/-12

3,478

Tencent

Proprietary

No data available

96

92

1223

+5/-5

8,132

Reka AI

Proprietary

No data available

98

93

1224

+2/-2

163,629

Meta

Llama 3 Community

2023/12

98

106

1223

+3/-2

25,213

Microsoft

MIT

No data available

103

92

1218

+2/-2

113,067

Anthropic

Proprietary

2023/8

104

115

1215

+3/-4

20,654

Amazon

Proprietary

No data available

106

116

1206

+10/-9

2,901

Tencent

Proprietary

No data available

107

103

1209

+3/-2

57,197

Google

Gemma license

2024/6

107

117

1203

+10/-9

3,074

Ai2

Llama 3.1

No data available

108

101

1207

+2/-2

80,846

Cohere

CC-BY-NC-4.0

2024/3

108

102

1205

+3/-3

38,872

Alibaba

Qianwen LICENSE

2024/6

109

115

1200

+7/-8

5,111

Mistral

MRL

No data available

110

89

1204

+2/-2

55,962

OpenAI

Proprietary

2021/9

111

117

1197

+4/-5

10,391

Cohere

CC-BY-NC-4.0

No data available

111

106

1197

+5/-6

10,851

Cohere

CC-BY-NC-4.0

2024/8

113

106

1197

+2/-2

122,309

Anthropic

Proprietary

2023/8

113

99

1196

+4/-4

15,753

DeepSeek AI

DeepSeek License

2024/6

113

116

1193

+5/-6

9,274

AI21 Labs

Jamba Open

2024/3

114

133

1193

+3/-2

52,578

Meta

Llama 3.1 Community

2023/12

122

99

1181

+2/-2

91,614

OpenAI

Proprietary

2021/9

122

117

1179

+3/-3

27,430

Alibaba

Qianwen LICENSE

2024/4

122

152

1170

+9/-8

3,410

Alibaba

Apache 2.0

No data available

123

132

1175

+3/-4

25,135

01 AI

Apache-2.0

2024/5

123

116

1175

+2/-2

64,926

Mistral

Proprietary

No data available

123

117

1173

+5/-5

16,027

Reka AI

Proprietary

Online

125

125

1169

+2/-2

109,056

Meta

Llama 3 Community

2023/3

125

139

1166

+5/-6

10,599

InternLM

Other

2024/8

126

125

1165

+3/-3

25,803

Reka AI

Proprietary

2023/11

127

121

1166

+2/-3

56,398

Cohere

CC-BY-NC-4.0

2024/3

127

125

1165

+3/-3

35,556

Mistral

Proprietary

No data available

127

121

1165

+3/-3

40,658

Alibaba

Qianwen LICENSE

2024/2

127

125

1160

+8/-10

3,289

IBM

Apache 2.0

No data available

128

121

1165

+2/-3

53,751

Mistral

Apache 2.0

2024/4

128

139

1161

+2/-2

48,892

Google

Gemma license

2024/7

136

121

1149

+5/-4

18,800

Google

Proprietary

2023/4

136

129

1145

+8/-9

4,854

HuggingFace

Apache 2.0

2024/4

137

133

1143

+4/-4

22,765

Alibaba

Qianwen LICENSE

2024/2

138

139

1140

+3/-4

26,105

Microsoft

MIT

2023/10

138

141

1137

+7/-8

3,380

IBM

Apache 2.0

No data available

138

150

1136

+5/-4

16,676

Nexusflow

Apache-2.0

2024/3

141

139

1131

+2/-2

76,126

Mistral

Apache 2.0

2023/12

141

144

1129

+5/-3

15,917

01 AI

Yi License

2023/6

141

129

1128

+7/-8

6,557

Google

Proprietary

2023/4

142

142

1126

+4/-4

18,687

Alibaba

Qianwen LICENSE

2024/2

142

142

1124

+6/-6

8,383

Microsoft

Llama 2 Community

2023/8

144

129

1123

+2/-2

68,867

OpenAI

Proprietary

2021/9

144

148

1120

+7/-6

8,390

Meta

Llama 3.2

2023/12

145

139

1121

+3/-4

33,743

Databricks

DBRX LICENSE

2023/12

145

146

1119

+4/-3

18,476

Microsoft

MIT

2023/10

145

147

1116

+7/-7

6,658

AllenAI/UW

AI2 ImpACT Low-risk

2023/11

149

139

1111

+7/-6

7,002

IBM

Apache 2.0

No data available

152

159

1110

+3/-3

39,595

Meta

Llama 2 Community

2023/7

152

144

1109

+4/-4

12,990

OpenChat

Apache-2.0

2024/1

152

152

1108

+4/-4

22,936

LMSYS

Non-commercial

2023/8

152

155

1106

+6/-5

10,415

UC Berkeley

CC-BY-NC-4.0

2023/11

152

161

1102

+8/-9

3,836

NousResearch

Apache-2.0

2024/1

153

144

1107

+3/-3

34,173

Snowflake

Apache 2.0

2024/4

154

159

1098

+7/-8

3,636

Nvidia

Llama 2 Community

2023/11

156

146

1101

+3/-3

25,070

Google

Gemma license

2024/2

159

148

1094

+6/-6

4,988

DeepSeek AI

DeepSeek License

2023/11

159

146

1094

+6/-5

8,106

OpenChat

Apache-2.0

2023/11

159

150

1092

+7/-7

5,088

NousResearch

Apache-2.0

2023/11

160

156

1091

+6/-7

7,191

IBM

Apache 2.0

No data available

160

165

1090

+4/-4

20,067

Mistral

Apache-2.0

2023/12

160

165

1088

+5/-5

12,808

Microsoft

MIT

2023/10

160

163

1087

+7/-7

4,872

Alibaba

Qianwen LICENSE

2024/2

160

161

1080

+16/-14

1,714

Cognitive Computations

Apache-2.0

2023/10

162

141

1085

+5/-3

17,036

OpenAI

Proprietary

2021/9

163

169

1084

+4/-4

21,097

Microsoft

MIT

2023/10

164

165

1080

+7/-7

4,286

Upstage AI

CC-BY-NC-4.0

2023/11

165

169

1081

+4/-5

19,722

Meta

Llama 2 Community

2023/7

167

165

1076

+7/-7

7,176

Microsoft

Llama 2 Community

2023/7

171

176

1071

+7/-5

8,523

Meta

Llama 3.2

2023/12

171

174

1071

+6/-6

11,321

HuggingFace

MIT

2023/10

172

166

1064

+11/-11

2,375

HuggingFace

Apache 2.0

No data available

172

165

1063

+10/-10

2,644

MosaicML

CC-BY-NC-SA-4.0

2023/6

172

174

1059

+16/-16

1,192

Meta

Llama 2 Community

2024/1

174

173

1060

+7/-9

7,509

Meta

Llama 2 Community

2023/7

174

169

1058

+11/-16

1,811

HuggingFace

MIT

2023/10

176

165

1052

+14/-15

1,327

TII

Falcon-180B TII License

2023/9

177

169

1059

+4/-4

19,775

LMSYS

Llama 2 Community

2023/7

177

174

1055

+5/-6

9,176

Google

Gemma license

2024/2

177

174

1054

+4/-4

21,622

Microsoft

MIT

2023/10

177

188

1054

+5/-6

14,532

Meta

Llama 2 Community

2023/7

177

165

1052

+8/-7

5,065

Alibaba

Qianwen LICENSE

2023/8

177

177

1050

+11/-10

2,996

UW

Non-commercial

2023/5

186

178

1038

+5/-5

11,351

Google

Gemma license

2024/2

186

183

1035

+7/-8

5,276

Together AI

Apache 2.0

2023/12

188

196

1033

+6/-6

6,503

Allen AI

Apache-2.0

2024/2

190

188

1025

+5/-6

9,142

Mistral

Apache 2.0

2023/9

190

189

1022

+6/-6

7,017

LMSYS

Llama 2 Community

2023/7

190

178

1021

+7/-6

8,713

Google

Proprietary

2021/6

194

193

1007

+9/-6

4,918

Google

Gemma license

2024/2

195

191

1006

+6/-8

7,816

Alibaba

Qianwen LICENSE

2024/2

197

196

982

+7/-7

7,020

UC Berkeley

Non-commercial

2023/4

197

197

972

+9/-9

4,763

Tsinghua

Apache-2.0

2023/10

198

197

950

+14/-16

1,788

Nomic AI

Non-commercial

2023/3

199

197

946

+10/-11

3,997

MosaicML

CC-BY-NC-SA-4.0

2023/5

199

202

942

+13/-10

2,713

Tsinghua

Apache-2.0

2023/6

199

202

939

+10/-9

4,920

RWKV

Apache 2.0

2023/4

203

197

919

+9/-8

5,864

Stanford

Non-commercial

2023/3

203

203

911

+7/-9

6,368

OpenAssistant

Apache 2.0

2023/4

204

205

897

+10/-8

4,983

Tsinghua

Non-commercial

2023/3

205

205

885

+10/-10

4,288

LMSYS

Apache 2.0

2023/4

207

208

858

+10/-11

3,336

Stability AI

CC-BY-NC-SA-4.0

2023/4

207

205

840

+11/-12

3,480

Databricks

MIT

2023/4

208

207

817

+11/-12

2,446

Meta

Non-commercial

2023/2

Explanation

  • Rank (UB): Ranking calculated based on the Bradley-Terry model. This ranking reflects the overall performance of the model in the arena and provides an upper bound estimate of its Elo score, helping to understand the potential competitiveness of the model.

  • Rank (StyleCtrl): Ranking after controlling for conversation style. This ranking aims to reduce bias in preferences caused by the model's response style (e.g., verbose, concise), providing a more pure evaluation of the model's core capabilities.

  • Model Name: The name of the Large Language Model (LLM). This column embeds relevant model links, which can be clicked to navigate.

  • Score: The Elo rating obtained by the model through user votes in the arena. Elo rating is a relative ranking system, where a higher score indicates better model performance. This score is dynamic and reflects the model's relative strength in the current competitive environment.

  • Confidence Interval: The 95% confidence interval of the model's Elo score (e.g., +6/-6). A smaller interval indicates a more stable and reliable model score; conversely, a larger interval may indicate insufficient data or significant fluctuations in model performance. It provides a quantitative assessment of the score's accuracy.

  • Votes: The total number of votes received by the model in the arena. More votes generally mean higher statistical reliability of its score.

  • Provider: The organization or company that provides the model.

  • License: The license type of the model, such as Proprietary, Apache 2.0, MIT, etc.

  • Knowledge Cutoff Date: The knowledge cutoff date for the model's training data. No data available indicates that the relevant information is not provided or is unknown.

Data Source and Update Frequency

The data for this leaderboard is automatically generated and provided by the fboulnois/llm-leaderboard-csv project, which obtains and processes data from lmarena.ai. This leaderboard is automatically updated daily by GitHub Actions.

Disclaimer

This report is for reference only. The leaderboard data is dynamic and based on user preference votes on Chatbot Arena during a specific period. The completeness and accuracy of the data depend on the updates and processing of the upstream data source and the fboulnois/llm-leaderboard-csv project. Different models may use different license agreements; please refer to the official documentation of the model provider when using them.

最后更新于

这有帮助吗?