モデルランキング

これはChatbot Arena (lmarena.ai)のデータに基づくランキングで、自動化プロセスにより生成されています。

データ更新時刻: 2025-08-11 11:44:05 UTC / 2025-08-11 19:44:05 CST (北京時間)

ランキング内のモデル名をクリックすると、詳細情報や試用ページにジャンプできます。

ランキング

順位(UB)
順位(StyleCtrl)
モデル名
スコア
信頼区間
投票数
プロバイダ
ライセンス
ナレッジカットオフ

1

1

1470

+5/-5

26,019

Google

Proprietary

nan

2

2

1446

+6/-6

13,715

Google

Proprietary

nan

3

2

1434

+9/-9

4,112

Z.ai

MIT

nan

4

2

1434

+6/-6

13,058

xAI

Proprietary

nan

5

3

1429

+4/-4

30,777

OpenAI

Proprietary

nan

6

3

1428

+4/-4

32,033

OpenAI

Proprietary

nan

7

3

1427

+9/-9

4,154

Alibaba

Apache 2.0

nan

8

3

1427

+5/-5

18,284

DeepSeek

MIT

nan

9

4

1423

+4/-4

31,757

xAI

Proprietary

nan

10

8

1416

+4/-4

26,604

Meta

nan

nan

11

8

1415

+5/-5

15,271

OpenAI

Proprietary

nan

12

7

1413

+9/-9

3,715

Alibaba

Apache 2.0

nan

13

8

1412

+6/-6

13,837

xAI

Proprietary

nan

14

10

1411

+4/-4

31,359

Google

Proprietary

nan

15

15

1397

+4/-4

27,552

Google

Proprietary

nan

16

15

1397

+5/-5

20,120

Google

Proprietary

nan

17

15

1396

+5/-5

18,655

Google

Proprietary

nan

18

15

1393

+9/-9

4,306

Z.ai

MIT

nan

19

15

1391

+5/-5

24,372

Alibaba

Apache 2.0

nan

20

15

1389

+4/-4

23,657

Google

Proprietary

nan

21

15

1389

+4/-4

23,858

OpenAI

Proprietary

nan

22

19

1381

+3/-3

40,509

OpenAI

Proprietary

nan

23

18

1380

+6/-6

11,676

Moonshot

Modified MIT

nan

24

19

1380

+5/-5

24,834

OpenAI

Proprietary

nan

25

16

1380

+12/-12

2,258

Alibaba

Apache 极2.0

nan

26

22

1379

+5/-5

17,328

Google

Proprietary

nan

27

22

1378

+5/-5

16,963

Google

Proprietary

nan

28

22

1376

+6/-6

11,657

Tencent

Proprietary

nan

29

22

1376

+4/-4

27,391

DeepSeek

MIT

nan

30

22

1373

+5/-5

17,970

Anthropic

Proprietary

nan

31

23

1372

+4/-4

19,430

DeepSeek

MIT

nan

32

24

1370

+4/-4

22,500

Google

Proprietary

nan

33

24

1370

+5/-5

28,010

Mistral

Proprietary

nan

34

24

1368

+5/-5

17,088

Google

Proprietary

nan

35

28

1366

+3/-3

35,457

Alibaba

Proprietary

nan

36

27

1366

+5/-5

26,141

Anthropic

Proprietary

nan

37

28

1365

+5/-5

20,489

Alibaba

Apache 2.0

nan

38

29

1365

+4/-4

29,038

OpenAI

Proprietary

nan

39

32

1362

+3/-3

41,036

Google

Proprietary

nan

40

32

1362

+5/-5

24,472

OpenAI

Proprietary

nan

41

32

1361

+6/-6

10,194

xAI

Proprietary

nan

42

32

1360

+5/-5

16,845

xAI

Proprietary

nan

43

33

1357

+7/-7

6,012

Alibaba

Apache 2.0

nan

44

36

1357

+4/-4

32,176

Google

Gemma

nan

45

39

1354

+4/-4

48,669

OpenAI

Proprietary

2023/10

46

39

1354

+5/-5

17,524

MiniMax

Apache 2.0

nan

47

40

1353

+4/-4

33,177

OpenAI

Proprietary

2023/10

48

41

1350

+5/-5

17,124

Anthropic

Proprietary

nan

49

44

1341

+9/-9

4,074

Alibaba

Apache 2.0

nan

50

46

1339

+10/-10

3,106

Nvidia

Nvidia Open

nan

51

49

1338

+5/-5

19,404

OpenAI

Proprietary

nan

52

49

1338

+5/-5

22,629

Anthropic

Proprietary

nan

53

49

1336

+5/-5

23,892

OpenAI

Proprietary

nan

54

49

1335

+6/-6

9,147

Mistral

Apache 2.0

nan

55

49

1335

+9/-9

3,976

Google

Gemma

nan

56

49

1334

+3/-3

35,503

OpenAI

Proprietary

2023/10

57

49

1333

+4/-4

22,841

DeepSeek

DeepSeek

nan

58

49

1332

+8/-8

6,028

Zhipu

Proprietary

nan

59

49

1331

+4/-4

22,178

Alibaba

Apache 2.0

nan

60

49

1330

+4/-4

26,104

Google

Proprietary

nan

61

49

1327

+8/-8

6,055

Alibaba

Proprietary

nan

62

54

1326

+4/-4

30,944

Cohere

CC-BY-NC-4.0

nan

63

57

1322

+5/-5

13,900

Amazon

Proprietary

nan

64

55

1320

+8/-8

5,126

StepFun

Proprietary

nan

65

52

1320

+11/-11

2,656

Nvidia

Nvidia Open Model

nan

66

60

1320

+5/-5

20,360

Alibaba

Apache 2.0

nan

67

61

1320

+3/-3

58,645

Google

Proprietary

nan

68

53

1320

+10/-10

2,452

Tencent

Proprietary

nan

69

61

1318

+3/-3

54,951

OpenAI

Proprietary

2023/10

70

61

1318

+3/-3

43,216

OpenAI

Proprietary

nan

71

62

1317

+4/-4

32,592

Anthropic

Proprietary

nan

72

62

1316

+4/-4

32,262

Google

Proprietary

2023/11

73

62

1315

+4/-4

25,988

Google

Proprietary

2023/11

74

60

1313

+10/-10

2,510

Tencent

Proprietary

nan

75

62

1309

+11/-11

2,371

Nvidia

Nvidia

nan

76

73

1305

+3/-3

67,084

xAI

Proprietary

2024/3

77

71

1305

+6/-6

13,182

Google

Gemma

nan

78

74

1303

+4/-4

28,968

01 AI

Proprietary

nan

79

74

1302

+3/-3

117,747

OpenAI

Proprietary

2023/10

80

74

1301

+4/-4

37,645

Anthropic

Proprietary

nan

81

74

1300

+6/-6

10,715

Alibaba

Proprietary

nan

82

76

1299

+2/-2

84,537

Anthropic

Proprietary

2024/4

83

74

1295

+7/-7

7,243

DeepSeek

DeepSeek

nan

84

80

1293

+4/-4

26,074

NexusFlow

NexusFlow

nan

85

76

1293

+9/-9

4,321

Google

Gemma

nan

86

81

1292

+4/-4

极24,544

Meta

Llama 4

nan

87

82

1291

+4/-4

27,788

Zhipu AI

Proprietary

nan

88

76

1290

+9/-9

3,856

Tencent

Proprietary

nan

89

83

1289

+3/-3

37,021

Google

Proprietary

nan

90

83

1288

+3/-3

72,427

OpenAI

Proprietary

2023/10

91

82

1287

+7/-7

6,302

OpenAI

Proprietary

nan

92

83

1286

+3/-3

43,788

Meta

Llama 3.1 Community

2023/12

93

84

1285

+3/-3

47,973

OpenAI

Proprietary

2023/10

94

83

1284

+7/-7

7,577

Nvidia

Llama 3.1

2023/12

95

83

1284

+5/-5

17,432

Alibaba

Qwen

nan

96

84

1284

+4/-4

25,409

Google

Proprietary

2023/11

97

85

1284

+3/-3

63,038

Meta

Llama 3.1 Community

2023/12

98

84

1283

+5/-5

17,067

01 AI

Proprietary

nan

99

87

1283

+3/-3

55,442

xAI

Proprietary

2024/3

100

87

1283

+3/-3

86,159

Anthropic

Proprietary

2024/4

101

89

1280

+4/-4

52,144

Google

Proprietary

Online

102

95

1276

+3/-3

82,435

Google

Proprietary

2023/11

103

91

1276

+5/-5

14,153

Meta

Llama

nan

104

88

1276

+9/-9

4,014

Tencent

Proprietary

nan

105

99

1275

+3/-3

51,931

Meta

Llama-3.3

nan

106

100

1274

+3/-3

102,133

OpenAI

Proprietary

2023/12

107

99

1274

+4/-4

26,344

DeepSeek

DeepSeek

nan

108

100

1273

+4/-4

55,569

Google

Proprietary

2023/11

109

101

1272

+3/-3

41,519

Alibaba

Qwen

2024/9

110

99

1270

+8/-8

6,093

Tencent

Proprietary

nan

111

102

1269

+3/-3

48,217

Mistral

Mistral Research

2024/7

112

102

1269

+5/-5

14,091

Mistral

Apache 2.0

nan

113

102

1268

+4/-4

29,633

Mistral

MRL

nan

114

102

1267

+5/-5

20,580

NexusFlow

CC-BY-NC-4.0

2024/7

115

107

1266

+3/-3

103,748

OpenAI

Proprietary

2023/4

116

107

1265

+3/-3

97,079

OpenAI

Proprietary

2023/12

117

109

1265

+2/-2

202,641

Anthropic

Proprietary

2023/8

118

109

1264

+3/-3

58,637

Meta

Llama 3.1 Community

2023/12

119

111

1261

+4/-4

26,371

Amazon

Proprietary

nan

120

107

1259

+10/-10

3,010

Ai2

Llama 3.1

nan

121

114

1258

+4/-4

51,628

01 AI

Proprietary

データなし

122

119

1255

+3/-3

55,928

Anthropic

Propretary

nan

123

119

1252

+7/-7

8,534

Mistral

Proprietary

nan

124

119

1251

+6/-6

7,948

Reka AI

Proprietary

nan

125

120

1249

+6/-6

13,279

Reka AI

Proprietary

nan

126

123

1243

+4/-4

65,661

Google

Proprietary

2023/11

127

123

1242

+5/-5

14,626

Alibaba

Proprietary

nan

128

123

1241

+6/-6

9,125

AI21 Labs

Jamba Open

2024/3

129

125

1239

+5/-5

19,508

DeepSeek AI

DeepSeek

nan

130

125

1238

+5/-5

15,321

Mistral

Apache 2.0

nan

131

125

1236

+6/-6

11,725

DeepSeek

Proprietary

nan

132

126

1235

+6/-6

16,624

01 AI

Proprietary

データなし

133

127

1235

+2/-2

79,538

Google

Gemma license

2024/6

134

126

1234

+7/-7

5,730

Alibaba

Apache 2.0

nan

135

126

1233

+6/-6

10,535

Cohere

CC-BY-NC-4.0

2024/8

136

127

1233

+4/-4

20,646

Amazon

Proprietary

nan

137

126

1232

+6/-6

10,548

Princeton

MIT

2024/极7

138

126

1231

+9/-9

3,889

Nvidia

Llama 3.1

2023/12

139

128

1230

+3/-3

37,697

Google

Proprietary

nan

140

127

1230

+6/-6

10,221

Zhipu AI

Proprietary

データなし

141

130

1229

+4/-4

20,608

Nvidia

NVIDIA Open Model

2023/6

142

130

1228

+4/-4

28,768

Cohere

CC-BY-NC-4.0

nan

143

128

1227

+8/-8

11,833

Google

Proprietary

Online

144

132

1224

+6/-6

8,132

Reka AI

Proprietary

nan

145

135

1224

+3/-3

163,629

Meta

Llama 3 Community

2023/12

146

130

1223

+10/-10

3,460

Allen AI

Apache-2.0

nan

147

138

1222

+3/-3

113,067

Anthropic

Proprietary

2023/8

148

138

1222

+4/-4

25,213

Microsoft

MIT

nan

149

138

1221

+4/-4

25,346

Google

Proprietary

2023/11

150

142

1217

+3/-3

62,555

Reka AI

Proprietary

nan

151

142

1217

+6/-6

13,725

Reka AI

Proprietary

nan

152

144

1214

+4/-4

20,654

Amazon

Proprietary

nan

153

149

1212

+3/-3

57,197

Google

Gemma license

2024/6

154

149

1210

+4/-4

55,962

OpenAI

Proprietary

2021/9

155

151

1208

+3/-3

80,846

Cohere

CC-BY-NC-4.0

2024/3

156

144

1208

+11/-11

2,901

Tencent

Proprietary

nan

157

151

1208

+4/-4

38,872

Alibaba

Qianwen LICENSE

2024/6

158

156

1200

+3/-3

122,309

Anthropic

Proprietary

2023/8

159

153

1199

+10/-10

3,074

Ai2

Llama 3.1

nan

160

156

1199

+5/-5

25,696

Alibaba

Proprietary

nan

161

154

1198

+9/-9

7,579

Zhipu AI

Proprietary

データなし

162

154

1198

+8/-8

5,111

Mistral

MRL

nan

163

157

1196

+6/-6

15,753

DeepSeek AI

DeepSeek License

2024/6

164

157

1195

+6/-6

10,851

Cohere

CC-BY-NC-4.0

2024/8

165

157

1193

+6/-6

10,391

Cohere

CC-BY-NC-4.0

nan

166

157

1193

+6/-6

9,274

AI21 Labs

Jamba Open

2024/3

167

159

1193

+3/-3

52,578

Meta

Llama 3.1 Community

2023/12

168

159

1191

+3/-3

91,614

OpenAI

Proprietary

2021/9

169

159

1188

+5/-5

20,427

Reka AI

Proprietary

nan

170

169

1181

+4/-4

64,926

Mistral

Proprietary

nan

171

169

1180

+5/-5

27,430

Alibaba

Qianwen LICENSE

2024/4

172

169

1178

+6/-6

21,149

Anthropic

Proprietary

nan

173

170

1178

+4/-4

25,135

01 AI

Apache-2.0

2024/5

174

169

1176

+7/-7

16,027

Reka AI

Proprietary

Online

175

170

1171

+4/-4

40,658

Alibaba

Qianwen LICENSE

2024/2

176

172

1171

+3/-3

109,056

Meta

Llama 3 Community

2023/3

177

171

1170

+5/-5

35,556

Mistral

Proprietary

nan

178

171

1170

+5/-5

25,803

Reka AI

Proprietary

2023/11

179

170

1170

+10/-10

3,410

Alibaba

Apache 2.0

nan

180

172

1169

+4/-4

56,398

Cohere

CC-BY-NC-4.0

2024/3

181

172

1168

+6/-6

10,599

InternLM

Other

2024/8

182

173

1167

+4/-4

53,751

Mistral

Apache 2.0

2024/4

183

176

1163

+3/-3

48,892

Google

Gemma license

2024/7

184

177

1158

+8/-8

12,763

Anthropic

Proprietary

nan

185

175

1157

+10/-10

3,289

IBM

Apache 2.0

nan

186

181

1154

+7/-7

18,800

Google

Proprietary

2023/4

187

181

1153

+8/-8

12,375

Mistral

Proprietary

nan

188

182

1150

+10/-10

4,854

HuggingFace

Apache 2.0

2024/4

189

184

1146

+6/-6

37,699

Anthropic

Proprietary

nan

190

184

1145

+5/-5

38,955

OpenAI

Proprietary

2021/9

191

184

1144

+5/-5

22,765

Alibaba

Qianwen LICENSE

2024/2

192

185

1143

+4/-4

26,105

Microsoft

MIT

2023/10

193

187

1138

+7/-7极

16,676

Nexusflow

Apache-2.0

2024/3

194

188

1138

+3/-3

76,126

Mistral

Apache 2.0

2023/12

195

185

1137

+11/-11

5,640

OpenAI

Proprietary

2021/9

196

185

1137

+11/-11

6,557

Google

Proprietary

2023/4

197

187

1135

+10/-10

3,380

IBM

Apache 2.0

nan

198

188

1135

+7/-7

20,631

Anthropic

Proprietary

nan

199

188

1135

+6/-6

18,687

Alibaba

Qianwen LICENSE

2024/2

200

188

1134

+6/-6

15,917

01 AI

Yi License

2023/6

201

188

1134

+6/-6

15,917

01 AI

Yi License

2023/6

202

193

1131

+4/-4

68,867

OpenAI

Proprietary

2021/9

203

193

1127

+9/-9

6,658

AllenAI/UW

AI2 ImpACT Low-risk

2023/11

204

195

1125

+5/-5

33,743

Databricks

DBRX LICENSE

2023/12

205

193

1125

+9/-9

8,383

Microsoft

Llama 2 Community

2023/8

206

199

1122

+5/-5

39,595

Meta

Llama 2 Community

2023/7

207

195

1119

+11/-11

3,836

NousResearch

Apache-2.0

2024/1

208

202

1117

+7/-7

8,390

Meta

Llama 3.2

2023/12

209

202

1117

+5/-5

18,476

Microsoft

MIT

2023/10

210

202

1114

+7/-7

10,415

UC Berkeley

CC-BY-NC-4.0

2023/11

211

202

1113

+7/-7

12,990

OpenChat

Apache-2.0

2024/1

212

203

1112

+5/-5

22,936

LMSYS

Non-commercial

2023/8

213

202

1110

+11/-11

4,988

DeepSeek AI

DeepSeek License

2023/11

214

206

1108

+5/-5

34,173

Snowflake

Apache 2.0

2024/4

215

206

1107

+8/-8

7,002

IBM

Apache 2.0

nan

216

203

1105

+12/-12

3,636

Nvidia

Llama 2 Community

2023/11

217

206

1103

+9/-9

8,106

OpenChat

Apache-2.0

2023/11

218

208

1102

+5/-5

25,070

Google

Gemma license

2024/2

219

208

1100

+8/-8

17,036

OpenAI

Proprietary

2021/9

220

208

1100

+10/-10

5,088

NousResearch

Apache-2.0

2023/11

221

208

1099

+10/-10

6,898

Perplexity AI

Proprietary

Online

222

212

1097

+6/-6

20,067

Mistral

Apache-2.0

2023/12

223

215

1092

+6/-6

19,722

Meta

Llama 2 Community

2023/7

224

212

1091

+12/-12

4,286

Upstage AI

CC-BY-NC-4.0

2023/11

225

215

1090

+7/-7

7,191

IBM

Apache 2.0

nan

226

213

1090

+9/-9

4,872

Alibaba

Qianwen LICENSE

2024/2

227

212

1088

+15/-15

1,714

Cognitive Computations

Apache-2.0

2023/10

228

216

1087

+6/-6

12,808

Microsoft

MIT

2023/10

229

217

1084

+9/-9

7,176

Microsoft

Llama 2 Community

2023/7

230

222

1081

+5/-5

21,097

Microsoft

MIT

2023/10

231

223

1076

+8/-8

11,321

HuggingFace

MIT

2023/10

232

222

1075

+12/-12

2,644

MosaicML

CC-BY-NC-SA-4.0

2023/6

233

225

1072

+8/-8

7,509

Meta

Llama 2 Community

2023/7

234

229

1067

+7/-7

8,523

Meta

Llama 3.2

2023/12

235

224

1066

+15/-15

1,811

HuggingFace

MIT

2023/10

236

230

1065

+6/-6

19,775

LMSYS

Llama 2 Community

2023/7

237

223

1065

+17/-17

1,192

Meta

Llama 2 Community

2024/1

238

229

1064

+10/-10

6,339

Perplexity AI

Proprietary

Online

239

230

1063

+9/-9

9,176

Google

Gemma license

2024/2

240

228

1062

+13/-13

2,375

HuggingFace

Apache 2.0

nan

241

226

1061

+17/-17

1,327

TII

Falcon-180B TII License

2023/9

242

230

1061

+6/-6

21,622

Microsoft

MIT

2023/10

243

230

1060

+6/-6

14,532

Meta

Llama 2 Community

2023/7

244

230

1060

+12/-12

2,996

UW

Non-commercial

2023/5

245

230

1058

+10/-10

5,065

Alibaba

Qianwen LICENSE

2023/8

246

235

1044

+10/-10

5,276

Together AI

Apache 2.0

2023/12

247

243

1038

+9/-9

7,017

LMSYS

Llama 2 Community

2023/7

248

240

1037

+10/-10

6,503

Allen AI

Apache-2.0

2024/2

249

245

1034

+9/-9

8,713

Google

Proprietary

2021/6

250

245

1032

+7/-7

11,351

Google

Gemma license

2024/2

251

245

1031

+9/-9

9,142

Mistral

Apache 2.0

2023/9

252

251

1011

+11/-11

4,918

Google

Gemma license

2024/2

253

251

1006

+9/-9

7,816

Alibaba

Qianwen LICENSE

2024/2

254

251

996

+9/-9

7,020

UC Berkeley

Non-commercial

2023/4

255

253

978

+11/-11

4,763

Tsinghua

Apache-2.0

2023/10

256

254

963

+11/-11

3,997

MosaicML

CC-BY-NC-SA-4.0

2023/5

257

254

962

+15/-15

1,788

Nomic AI

Non-commercial

2023/3

258

254

956

+11/-11

4,920

RWKV

Apache 2.0

极2023/4

259

255

947

+13/-13

2,713

Tsinghua

Apache-2.0

2023/6

260

255

940

+11/-11

5,864

Stanford

Non-commercial

2023/3

261

258

925

+12/-12

4,983

Tsinghua

Non-commercial

2023/3

262

258

923

+10/-10

6,368

OpenAssistant

Apache 2.0

2023/4

263

260

901

+12/-12

4,288

LMSYS

Apache 2.0

2023/4

264

263

873

+12/-12

3,336

Stability AI

CC-BY-NC-SA-4.0

2023/4

265

263

857

+13/-13

3,480

Databricks

MIT

2023/4

266

264

840

+16/-16

2,446

Meta

Non-commercial

2023/2

説明

  • 順位(UB): Bradley-Terryモデルに基づくランキング。アリーナでのモデルの総合的なパフォーマンスを反映し、Eloスコアの上限推定を提供し、モデルの潜在的な競争力を理解するのに役立ちます。

  • 順位(StyleCtrl): 会話スタイル制御後のランキング。モデルの応答スタイル(冗長さ、簡潔さなど)による嗜好の偏りを減らし、モデルの核心的な能力をより純粋に評価します。

  • モデル名: 大規模言語モデル(LLM)の名前。関連リンクが埋め込まれており、クリックするとジャンプします。

  • スコア: アリーナでのユーザー投票により獲得したElo評価スコア。Elo評価は相対的なランキングシステムで、スコアが高いほどモデルのパフォーマンスが優れていることを示します。このスコアは動的に変化し、現在の競争環境におけるモデルの相対的な実力を反映します。

  • 信頼区間: モデルのElo評価スコアの95%信頼区間(例: +6/-6)。この区間が小さいほど、スコアが安定していて信頼性が高いことを示します。逆に区間が大きい場合は、データ量が不足しているか、モデルのパフォーマンスが変動しやすい可能性があります。スコアの正確性を定量化します。

  • 投票数: アリーナでそのモデルが受けた総投票数。投票数が多いほど、通常、その評価の統計的信頼性が高くなります。

  • プロバイダ: モデルを提供する組織または企業。

  • ライセンス: モデルのライセンスタイプ(例: 専有(Proprietary)、Apache 2.0、MITなど)。

  • ナレッジカットオフ: モデル訓練データの知識カットオフ日。データなしは情報が提供されていないか不明であることを示します。

データソースと更新頻度

このランキングのデータはfboulnois/llm-leaderboard-csvプロジェクトによって自動生成・提供されており、lmarena.aiからデータを取得・処理しています。このランキングはGitHub Actionsにより毎日自動更新されます。

免責事項

本レポートは参考情報としてご利用ください。ランキングデータは動的に変化し、特定の期間におけるChatbot Arena上でのユーザーの嗜好投票に基づいています。データの完全性と正確性は、上流のデータソースおよびfboulnois/llm-leaderboard-csvプロジェクトの更新と処理に依存します。異なるモデルは異なるライセンスを採用している場合があるため、ご利用の際は必ずモデルプロバイダの公式説明を参照してください。

最后更新于

这有帮助吗?