Model Leaderboard

LLM Arena Leaderboard (Live Updates)

This is a leaderboard based on data from Chatbot Arena (lmarena.ai), generated through an automated process.

Data Updated: 2025-06-12 11:42:10 UTC / 2025-06-12 19:42:10 CST (Beijing Time)

Click on the Model Name in the leaderboard to go to its detailed information or trial page.

Leaderboard

Rank (UB)
Rank (StyleCtrl)
Model Name
Score
Confidence Interval
Votes
Provider
License
Knowledge Cutoff

1

1

1478

+6/-7

7,343

Google

Proprietary

No data

2

2

1446

+6/-7

12,351

Google

Proprietary

No data

3

2

1425

+4/-5

15,210

OpenAI

Proprietary

No data

3

4

1423

+4/-4

19,762

OpenAI

Proprietary

No data

3

6

1420

+5/-5

12,614

Google

Proprietary

No data

3

8

1417

+4/-4

21,879

xAI

Proprietary

No data

5

4

1411

+4/-5

15,271

OpenAI

Proprietary

No data

8

8

1396

+5/-6

14,148

Google

Proprietary

No data

9

7

1384

+4/-5

13,830

OpenAI

Proprietary

No data

9

11

1382

+4/-4

16,550

DeepSeek

MIT

No data

11

5

1373

+4/-4

13,850

Anthropic

Proprietary

No data

11

16

1372

+6/-7

5,944

Tencent

Proprietary

No data

11

11

1371

+3/-4

19,430

DeepSeek

MIT

No data

11

16

1363

+6/-5

12,003

Mistral

Proprietary

No data

12

44

1361

+8/-6

6,636

xAI

Proprietary

No data

13

11

1363

+4/-3

29,038

OpenAI

Proprietary

No data

13

21

1362

+3/-3

34,240

Google

Proprietary

No data

13

10

1361

+6/-5

13,554

OpenAI

Proprietary

No data

14

28

1360

+4/-5

10,677

Alibaba

Apache 2.0

No data

14

19

1358

+3/-3

29,484

Alibaba

Proprietary

No data

15

21

1355

+4/-5

20,295

Google

Gemma

No data

21

16

1348

+3/-2

33,177

OpenAI

Proprietary

2023/10

21

9

1345

+5/-6

10,740

Anthropic

Proprietary

No data

23

20

1338

+4/-6

19,404

OpenAI

Proprietary

No data

23

16

1336

+5/-5

12,702

OpenAI

Proprietary

No data

23

28

1334

+7/-8

3,976

Google

Gemma

No data

23

38

1330

+11/-11

2,595

Amazon

Proprietary

No data

24

26

1332

+3/-4

22,841

DeepSeek

DeepSeek

No data

24

47

1330

+5/-5

15,930

Alibaba

Apache 2.0

No data

25

28

1324

+8/-7

6,055

Alibaba

Proprietary

No data

26

28

1326

+3/-4

26,104

Google

Proprietary

No data

26

32

1324

+6/-7

6,028

Zhipu

Proprietary

No data

27

28

1323

+4/-3

20,084

Cohere

CC-BY-NC-4.0

No data

27

27

1316

+10/-8

2,452

Tencent

Proprietary

No data

28

35

1318

+7/-6

5,126

StepFun

Proprietary

No data

30

28

1319

+3/-3

32,421

OpenAI

Proprietary

No data

30

35

1310

+11/-8

2,371

Nvidia

Nvidia

No data

30

28

1310

+11/-11

2,510

Tencent

Proprietary

No data

31

35

1317

+2/-2

54,951

OpenAI

Proprietary

2023/10

32

28

1316

+2/-2

58,645

Google

Proprietary

No data

32

18

1313

+4/-4

21,310

Anthropic

Proprietary

No data

38

52

1301

+8/-8

3,913

Google

Gemma

No data

39

17

1306

+3/-3

25,983

Anthropic

Proprietary

No data

40

38

1301

+2/-2

67,084

xAI

Proprietary

2024/3

40

43

1301

+4/-3

28,968

01 AI

Proprietary

No data

42

31

1298

+2/-2

117,747

OpenAI

Proprietary

2023/10

42

53

1296

+5/-6

10,715

Alibaba

Proprietary

No data

43

21

1297

+2/-2

73,327

Anthropic

Proprietary

2024/4

43

46

1293

+7/-5

7,243

DeepSeek

DeepSeek

No data

46

69

1289

+8/-10

4,321

Google

Gemma

No data

47

43

1285

+9/-7

3,856

Tencent

Proprietary

No data

48

55

1289

+3/-3

26,074

NexusFlow

NexusFlow

No data

48

51

1287

+4/-3

27,788

Zhipu AI

Proprietary

No data

48

37

1287

+4/-5

13,750

Meta

Llama 4

No data

48

44

1284

+8/-7

6,302

OpenAI

Proprietary

No data

49

52

1285

+3/-2

72,536

OpenAI

Proprietary

2023/10

49

62

1285

+3/-3

37,021

Google

Proprietary

No data

49

71

1282

+6/-6

7,577

Nvidia

Llama 3.1

2023/12

51

35

1282

+2/-3

43,788

Meta

Llama 3.1 Community

2023/12

52

33

1282

+2/-2

86,159

Anthropic

Proprietary

2024/4

52

51

1274

+10/-9

4,014

Tencent

Proprietary

No data

53

35

1281

+2/-2

63,038

Meta

Llama 3.1 Community

2023/12

53

35

1280

+3/-2

52,144

Google

Proprietary

Online

54

69

1280

+2/-3

55,442

xAI

Proprietary

2024/3

55

37

1279

+3/-3

47,973

OpenAI

Proprietary

2023/10

55

53

1277

+4/-4

17,432

Alibaba

Qwen

No data

65

49

1273

+2/-2

82,435

Google

Proprietary

2023/11

65

64

1272

+2/-5

26,344

DeepSeek

DeepSeek

No data

65

70

1271

+2/-3

41,519

Alibaba

Qwen

2024/9

65

51

1270

+3/-3

44,800

Meta

Llama-3.3

No data

65

69

1263

+10/-11

2,484

Mistral

Apache 2.0

No data

66

47

1270

+2/-2

102,133

OpenAI

Proprietary

2023/12

69

54

1265

+2/-3

48,217

Mistral

Mistral Research

2024/7

69

67

1264

+4/-3

20,580

NexusFlow

CC-BY-NC-4.0

2024/7

71

74

1258

+8/-8

3,010

Ai2

Llama 3.1

No data

72

52

1263

+1/-2

103,748

OpenAI

Proprietary

2023/4

72

70

1262

+2/-2

29,633

Mistral

MRL

No data

72

77

1261

+3/-2

58,637

Meta

Llama 3.1 Community

2023/12

73

49

1261

+1/-2

202,641

Anthropic

Proprietary

2023/8

74

78

1258

+3/-4

26,371

Amazon

Proprietary

No data

75

57

1258

+2/-2

97,079

OpenAI

Proprietary

2023/12

80

52

1251

+3/-3

44,893

Anthropic

Propretary

No data

80

77

1249

+6/-5

7,948

Reka AI

Proprietary

No data

84

80

1240

+2/-2

65,661

Google

Proprietary

2023/11

84

78

1235

+6/-5

9,125

AI21 Labs

Jamba Open

2024/3

84

88

1231

+7/-6

5,730

Alibaba

Apache 2.0

No data

85

80

1233

+2/-2

79,538

Google

Gemma license

2024/6

85

88

1231

+5/-4

15,321

Mistral

Apache 2.0

No data

85

95

1230

+3/-4

20,646

Amazon

Proprietary

No data

85

82

1230

+6/-5

10,548

Princeton

MIT

2024/7

85

83

1229

+4/-6

10,535

Cohere

CC-BY-NC-4.0

2024/8

85

77

1225

+8/-7

3,889

Nvidia

Llama 3.1

2023/12

87

99

1226

+3/-2

37,697

Google

Proprietary

No data

87

96

1219

+9/-10

3,460

Allen AI

Apache-2.0

No data

89

94

1223

+3/-3

28,768

Cohere

CC-BY-NC-4.0

No data

89

86

1223

+4/-5

20,608

Nvidia

NVIDIA Open Model

2023/6

89

91

1220

+5/-6

10,221

Zhipu AI

Proprietary

No data

89

85

1219

+6/-6

8,132

Reka AI

Proprietary

No data

93

88

1220

+2/-2

163,629

Meta

Llama 3 Community

2023/12

93

101

1219

+3/-3

25,213

Microsoft

MIT

No data

97

86

1214

+2/-2

113,067

Anthropic

Proprietary

2023/8

98

109

1211

+4/-3

20,654

Amazon

Proprietary

No data

99

109

1202

+12/-11

2,901

Tencent

Proprietary

No data

103

98

1206

+3/-2

57,197

Google

Gemma license

2024/6

103

96

1203

+2/-2

80,846

Cohere

CC-BY-NC-4.0

2024/3

103

111

1199

+9/-9

3,074

Ai2

Llama 3.1

No data

104

98

1201

+3/-3

38,872

Alibaba

Qianwen LICENSE

2024/6

104

83

1200

+3/-3

55,962

OpenAI

Proprietary

2021/9

104

109

1196

+7/-7

5,111

Mistral

MRL

No data

105

111

1193

+5/-5

10,391

Cohere

CC-BY-NC-4.0

No data

105

100

1193

+7/-4

10,851

Cohere

CC-BY-NC-4.0

2024/8

107

101

1193

+2/-2

122,309

Anthropic

Proprietary

2023/8

107

95

1192

+4/-4

15,753

DeepSeek AI

DeepSeek License

2024/6

107

109

1189

+5/-6

9,274

AI21 Labs

Jamba Open

2024/3

108

126

1189

+2/-3

52,578

Meta

Llama 3.1 Community

2023/12

116

94

1177

+2/-2

91,614

OpenAI

Proprietary

2021/9

116

111

1175

+3/-3

27,430

Alibaba

Qianwen LICENSE

2024/4

116

144

1166

+11/-9

3,410

Alibaba

Apache 2.0

No data

117

126

1171

+4/-3

25,135

01 AI

Apache-2.0

2024/5

117

111

1171

+2/-3

64,926

Mistral

Proprietary

No data

117

111

1169

+4/-4

16,027

Reka AI

Proprietary

Online

119

120

1165

+2/-2

109,056

Meta

Llama 3 Community

2023/3

120

133

1162

+5/-7

10,599

InternLM

Other

2024/8

121

115

1162

+2/-3

56,398

Cohere

CC-BY-NC-4.0

2024/3

121

120

1161

+3/-3

35,556

Mistral

Proprietary

No data

121

114

1161

+2/-2

53,751

Mistral

Apache 2.0

2024/4

121

118

1161

+3/-3

25,803

Reka AI

Proprietary

2023/11

121

115

1161

+3/-3

40,658

Alibaba

Qianwen LICENSE

2024/2

121

121

1156

+8/-9

3,289

IBM

Apache 2.0

No data

122

133

1157

+2/-3

48,892

Google

Gemma license

2024/7

130

115

1145

+4/-4

18,800

Google

Proprietary

2023/4

130

126

1141

+8/-8

4,854

HuggingFace

Apache 2.0

2024/4

131

129

1139

+4/-4

22,765

Alibaba

Qianwen LICENSE

2024/2

131

135

1133

+8/-7

3,380

IBM

Apache 2.0

No data

132

133

1136

+3/-4

26,105

Microsoft

MIT

2023/10

132

143

1132

+4/-4

16,676

Nexusflow

Apache-2.0

2024/3

135

133

1128

+3/-2

76,126

Mistral

Apache 2.0

2023/12

135

138

1125

+4/-4

15,917

01 AI

Yi License

2023/6

135

125

1124

+7/-7

6,557

Google

Proprietary

2023/4

136

136

1122

+5/-3

18,687

Alibaba

Qianwen LICENSE

2024/2

136

135

1120

+6/-4

8,383

Microsoft

Llama 2 Community

2023/8

138

123

1119

+3/-2

68,867

OpenAI

Proprietary

2021/9

138

143

1116

+7/-6

8,390

Meta

Llama 3.2

2023/12

139

133

1117

+3/-3

33,743

Databricks

DBRX LICENSE

2023/12

139

140

1116

+4/-4

18,476

Microsoft

MIT

2023/10

139

143

1113

+7/-5

6,658

AllenAI/UW

AI2 ImpACT Low-risk

2023/11

143

133

1107

+8/-7

7,002

IBM

Apache 2.0

No data

145

138

1105

+5/-4

12,990

OpenChat

Apache-2.0

2024/1

146

152

1106

+3/-3

39,595

Meta

Llama 2 Community

2023/7

146

144

1104

+3/-4

22,936

LMSYS

Non-commercial

2023/8

146

148

1102

+6/-5

10,415

UC Berkeley

CC-BY-NC-4.0

2023/11

147

138

1103

+2/-3

34,173

Snowflake

Apache 2.0

2024/4

147

156

1098

+7/-8

3,836

NousResearch

Apache-2.0

2024/1

147

154

1094

+10/-9

3,636

Nvidia

Llama 2 Community

2023/11

151

140

1097

+3/-4

25,070

Google

Gemma license

2024/2

152

143

1090

+9/-8

4,988

DeepSeek AI

DeepSeek License

2023/11

153

141

1090

+6/-5

8,106

OpenChat

Apache-2.0

2023/11

153

143

1088

+7/-11

5,088

NousResearch

Apache-2.0

2023/11

153

148

1087

+6/-7

7,191

IBM

Apache 2.0

No data

154

159

1083

+8/-10

4,872

Alibaba

Qianwen LICENSE

2024/2

155

159

1086

+4/-5

20,067

Mistral

Apache-2.0

2023/12

155

159

1084

+5/-5

12,808

Microsoft

MIT

2023/10

155

135

1081

+5/-4

17,036

OpenAI

Proprietary

2021/9

155

154

1076

+11/-13

1,714

Cognitive Computations

Apache-2.0

2023/10

157

163

1080

+4/-3

21,097

Microsoft

MIT

2023/10

157

159

1076

+8/-10

4,286

Upstage AI

CC-BY-NC-4.0

2023/11

160

163

1077

+3/-4

19,722

Meta

Llama 2 Community

2023/7

161

159

1072

+7/-7

7,176

Microsoft

Llama 2 Community

2023/7

165

169

1067

+6/-6

8,523

Meta

Llama 3.2

2023/12

166

167

1067

+4/-5

11,321

HuggingFace

MIT

2023/10

166

162

1060

+12/-10

2,375

HuggingFace

Apache 2.0

No data

166

159

1059

+9/-12

2,644

MosaicML

CC-BY-NC-SA-4.0

2023/6

166

168

1056

+9/-6

7,509

Meta

Llama 2 Community

2023/7

166

168

1055

+16/-17

1,192

Meta

Llama 2 Community

2024/1

166

163

1054

+15/-14

1,811

HuggingFace

MIT

2023/10

169

159

1048

+16/-16

1,327

TII

Falcon-180B TII License

2023/9

171

162

1055

+4/-4

19,775

LMSYS

Llama 2 Community

2023/7

171

169

1051

+6/-5

9,176

Google

Gemma license

2024/2

171

168

1050

+3/-4

21,622

Microsoft

MIT

2023/10

171

183

1050

+4/-6

14,532

Meta

Llama 2 Community

2023/7

171

162

1048

+9/-7

5,065

Alibaba

Qianwen LICENSE

2023/8

171

169

1046

+11/-12

2,996

UW

Non-commercial

2023/5

180

174

1034

+5/-4

11,351

Google

Gemma license

2024/2

180

176

1031

+8/-8

5,276

Together AI

Apache 2.0

2023/12

181

189

1029

+6/-8

6,503

Allen AI

Apache-2.0

2024/2

184

181

1021

+7/-6

9,142

Mistral

Apache 2.0

2023/9

184

183

1018

+6/-6

7,017

LMSYS

Llama 2 Community

2023/7

184

172

1017

+7/-6

8,713

Google

Proprietary

2021/6

188

187

1003

+9/-9

4,918

Google

Gemma license

2024/2

189

185

1002

+5/-6

7,816

Alibaba

Qianwen LICENSE

2024/2

191

190

978

+6/-8

7,020

UC Berkeley

Non-commercial

2023/4

191

191

968

+8/-8

4,763

Tsinghua

Apache-2.0

2023/10

193

190

946

+14/-15

1,788

Nomic AI

Non-commercial

2023/3

193

191

942

+9/-9

3,997

MosaicML

CC-BY-NC-SA-4.0

2023/5

193

196

938

+13/-14

2,713

Tsinghua

Apache-2.0

2023/6

193

196

935

+9/-8

4,920

RWKV

Apache 2.0

2023/4

197

191

915

+6/-9

5,864

Stanford

Non-commercial

2023/3

197

196

906

+8/-9

6,368

OpenAssistant

Apache 2.0

2023/4

198

199

892

+9/-10

4,983

Tsinghua

Non-commercial

2023/3

199

199

881

+8/-9

4,288

LMSYS

Apache 2.0

2023/4

201

201

853

+10/-10

3,336

Stability AI

CC-BY-NC-SA-4.0

2023/4

201

199

836

+12/-12

3,480

Databricks

MIT

2023/4

202

200

813

+14/-12

2,446

Meta

Non-commercial

2023/2

Explanation

  • Rank (UB): A ranking calculated based on the Bradley-Terry model. This rank reflects the model's overall performance in the arena and provides an upper bound estimate of its Elo score, helping to understand the model's potential competitiveness.

  • Rank (StyleCtrl): The ranking after applying dialogue style control. This ranking aims to reduce preference bias caused by the model's response style (e.g., verbosity, conciseness) to more purely evaluate its core capabilities.

  • Model Name: The name of the Large Language Model (LLM). This column has embedded links to the models; click to navigate.

  • Score: The Elo rating the model received from user votes in the arena. The Elo rating is a relative ranking system where a higher score indicates better performance. This score is dynamic and reflects the model's relative strength in the current competitive environment.

  • Confidence Interval: The 95% confidence interval for the model's Elo rating (e.g., +6/-6). A smaller interval indicates that the model's rating is more stable and reliable; conversely, a larger interval may suggest insufficient data or significant performance fluctuations. It provides a quantitative assessment of the rating's accuracy.

  • Votes: The total number of votes the model has received in the arena. A higher number of votes generally means higher statistical reliability of its rating.

  • Provider: The organization or company that provides the model.

  • License: The type of license for the model, such as Proprietary, Apache 2.0, MIT, etc.

  • Knowledge Cutoff: The knowledge cutoff date for the model's training data. No data indicates that the relevant information is not provided or is unknown.

Data Source and Update Frequency

The data for this leaderboard is automatically generated and provided by the fboulnois/llm-leaderboard-csv project, which sources and processes data from lmarena.ai. This leaderboard is updated daily via GitHub Actions.

Disclaimer

This report is for reference only. The leaderboard data is dynamic and based on user preference votes on Chatbot Arena over a specific period. The completeness and accuracy of the data depend on the upstream data source and the updates and processing from the fboulnois/llm-leaderboard-csv project. Different models may have different license agreements; please refer to the official documentation from the model provider before use.

最后更新于

这有帮助吗?