Skip to content

Commit cc0176e

Browse files
author
Yuwei Yan
committed
update leaderboard
1 parent dc2e091 commit cc0176e

File tree

2 files changed

+32
-2
lines changed

2 files changed

+32
-2
lines changed

docs/assets/data/final/behavior_leaderboard.csv

Lines changed: 20 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -85,4 +85,23 @@ AwesomeAgent,2025-02-09(2),0.8400,0.8123,0.8936,0.7812,0.8262
8585
ustc-agent,2025-02-09(1),0.8500,0.8163,0.8978,0.7901,0.8332
8686
WiseAgents,2025-02-09(3),0.8488,0.8256,0.8947,0.7989,0.8372
8787
FollowSen,2025-02-09(3),0.8497,0.8124,0.8935,0.7894,0.8310
88-
ASC,2025-02-09(3),0.2463,0.6194,0.4543,0.4186,0.4329
88+
ASC,2025-02-09(3),0.2463,0.6194,0.4543,0.4186,0.4329
89+
ustc-agent,2025-02-10(1),0.8312,0.8125,0.8773,0.7849,0.8218
90+
ustc-agent,2025-02-10(2),0.8338,0.8131,0.8821,0.7843,0.8235
91+
CRUISE,2025-02-10(2),0.8460,0.8131,0.8818,0.7947,0.8295
92+
何解?和解!,2025-02-10(1),0.7157,0.7532,0.7631,0.7153,0.7344
93+
何解?和解!,2025-02-10(2),0.7173,0.7542,0.7648,0.7164,0.7358
94+
ASC,2025-02-10(1),0.6837,0.7439,0.7562,0.6855,0.7138
95+
SDU-AI,2025-02-10(1),0.8393,0.8170,0.8848,0.7904,0.8282
96+
何解?和解!,2025-02-10(3),0.8487,0.8109,0.8785,0.7973,0.8298
97+
FollowSen,2025-02-10(3),0.8458,0.8136,0.9032,0.7807,0.8297
98+
WiseAgents,2025-02-10(1),0.8393,0.8199,0.8914,0.7884,0.8296
99+
CRUISE,2025-02-10(1),0.8488,0.8097,0.8725,0.8004,0.8293
100+
ASC,2025-02-10(2),0.2463,0.6194,0.4543,0.4186,0.4329
101+
AwesomeAgent,2025-02-10(2),0.8517,0.8088,0.8930,0.7884,0.8302
102+
AwesomeAgent,2025-02-10(1),0.8523,0.8096,0.8913,0.7907,0.8310
103+
伸腿瞪眼丸,2025-02-10(3),0.8452,0.8250,0.8913,0.7976,0.8351
104+
FollowSen,2025-02-10(2),0.8445,0.8193,0.8889,0.7939,0.8319
105+
Santiago,2025-02-10(3),0.8383,0.8039,0.8483,0.8030,0.8211
106+
CRUISE,2025-02-10(3),0.8457,0.8082,0.8758,0.7944,0.8269
107+
伸腿瞪眼丸,2025-02-10(2),0.8467,0.8231,0.8891,0.7988,0.8349

docs/assets/data/final/recommendation_leaderboard.csv

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -82,4 +82,15 @@ baseline666,2025-02-09(1),0.1867,0.3067,0.3933,0.4222,0.2111,0.2956
8282
BrainNotFound404,2025-02-09(2),0.2683,0.4783,0.5883,0.5958,0.3444,0.4450
8383
100%Hit,2025-02-09(1),0.2183,0.4100,0.5350,0.5639,0.2704,0.3878
8484
yoyo_agent,2025-02-09(2),0.2583,0.4233,0.5300,0.6319,0.2519,0.4039
85-
DummyAgent,2025-02-09(2),0.2683,0.4533,0.5800,0.5931,0.3278,0.4339
85+
DummyAgent,2025-02-09(2),0.2683,0.4533,0.5800,0.5931,0.3278,0.4339
86+
UniqueData,2025-02-10(3),0.1283,0.2567,0.3233,0.3514,0.1593,0.2361
87+
baseline666,2025-02-10(3),0.0000,0.0000,0.0000,0.0000,0.0000,0.0000
88+
SSSmith,2025-02-10(2),0.2633,0.4417,0.5717,0.6208,0.2954,0.4256
89+
baseline666,2025-02-10(1),0.1883,0.3033,0.4000,0.4292,0.2093,0.2972
90+
UniqueData,2025-02-10(1),0.0000,0.0000,0.0000,0.0000,0.0000,0.0000
91+
RecHackers,2025-02-10(2),0.2717,0.4900,0.6083,0.6403,0.3343,0.4567
92+
DummyAgent,2025-02-10(3),0.2950,0.4750,0.6167,0.6236,0.3546,0.4622
93+
SSSmith,2025-02-10(1),0.2583,0.4467,0.5650,0.6111,0.2981,0.4233
94+
tsdxgxxb,2025-02-10(1),0.2100,0.4033,0.5383,0.5444,0.2769,0.3839
95+
BrainNotFound404,2025-02-10(1),0.2317,0.4200,0.5383,0.5694,0.2815,0.3967
96+
DummyAgent,2025-02-10(1),0.2517,0.4017,0.5117,0.4528,0.3454,0.3883

0 commit comments

Comments
 (0)