"openai/baselines' PPO2 (average by the episodic returns of the last 100 training episodes, then average by 3 random seeds)","CleanRL's ppo_atari_envpool_xla_jax_truncation.py (average by the last 100 ""data points"" (see `README.md`), then average by 3 random seeds)",CleanRL's ppo_atari_envpool_xla_jax.py,CleanRL's ppo_atari_envpool_xla_vclip_jax.py
Alien-v5,1705.80 ± 439.74,1755.00 ± 342.54,1736.39 ± 68.65,1849.70 ± 212.53
Amidar-v5,585.99 ± 52.92,843.47 ± 31.70,653.53 ± 44.06,629.55 ± 28.58
Assault-v5,4878.67 ± 815.64,6248.95 ± 261.93,6791.74 ± 420.03,4686.32 ± 165.95
Asterix-v5,3738.50 ± 745.13,4903.44 ± 479.45,4820.33 ± 1091.83,3211.71 ± 491.48
Asteroids-v5,1556.90 ± 151.20,1769.68 ± 200.53,1633.67 ± 247.21,1482.78 ± 92.81
Atlantis-v5,2036749.00 ± 95929.75,3258958.33 ± 553392.40,3778458.33 ± 117680.68,3106283.33 ± 847401.71
BankHeist-v5,1213.47 ± 14.46,1167.31 ± 37.27,1195.44 ± 18.54,1180.35 ± 9.64
BattleZone-v5,19980.00 ± 1355.21,20424.17 ± 1986.70,24283.75 ± 1841.94,24843.33 ± 3396.30
BeamRider-v5,2835.71 ± 387.92,3133.78 ± 293.02,2478.44 ± 336.55,2290.58 ± 169.61
Berzerk-v5,1049.77 ± 144.58,991.35 ± 215.61,992.88 ± 196.90,1120.79 ± 208.29
Bowling-v5,59.66 ± 0.62,44.76 ± 7.63,51.62 ± 13.53,46.05 ± 2.22
Boxing-v5,93.32 ± 0.36,93.64 ± 0.17,92.68 ± 1.41,93.98 ± 0.41
Breakout-v5,405.73 ± 11.47,465.90 ± 14.30,430.09 ± 8.12,402.39 ± 20.21
Centipede-v5,3688.54 ± 412.24,3494.08 ± 58.71,3309.34 ± 325.05,3778.47 ± 304.73
ChopperCommand-v5,816.33 ± 114.14,4422.79 ± 833.81,5642.83 ± 802.34,5189.46 ± 842.38
CrazyClimber-v5,119344.67 ± 4902.83,118370.17 ± 4364.35,118763.04 ± 4915.34,119406.25 ± 1637.43
Defender-v5,50161.67 ± 4477.49,49146.98 ± 4077.41,48558.98 ± 4466.76,44466.88 ± 1972.85
DemonAttack-v5,13788.43 ± 1313.44,23497.63 ± 4008.79,29283.83 ± 7007.31,15175.72 ± 3649.67
DoubleDunk-v5,-12.96 ± 0.31,-6.72 ± 1.02,-6.81 ± 0.24,-5.58 ± 1.85
Enduro-v5,986.69 ± 25.28,1178.37 ± 90.34,1297.23 ± 143.71,1082.41 ± 150.29
FishingDerby-v5,26.23 ± 2.76,26.20 ± 0.49,21.21 ± 6.73,27.16 ± 1.53
Freeway-v5,32.97 ± 0.37,32.77 ± 0.22,33.10 ± 0.31,32.67 ± 0.41
Frostbite-v5,933.60 ± 885.92,296.23 ± 16.84,1137.34 ± 1192.05,287.50 ± 12.68
Gopher-v5,3672.53 ± 1749.20,2493.51 ± 2185.28,6505.29 ± 7655.20,4146.18 ± 2243.14
Gravitar-v5,881.67 ± 33.73,626.52 ± 46.03,1099.33 ± 603.06,558.21 ± 107.62
Hero-v5,24746.88 ± 3530.10,29907.28 ± 2776.20,26429.65 ± 924.74,28615.30 ± 3768.10
IceHockey-v5,-4.12 ± 0.20,-4.51 ± 0.27,-4.33 ± 0.43,-4.29 ± 0.42
PrivateEye-v5,31.83 ± 43.74,57.67 ± 42.24,100.00 ± 0.00,66.67 ± 47.14
Qbert-v5,15228.25 ± 920.95,17318.50 ± 385.80,17246.27 ± 605.40,16292.49 ± 1808.49
Riverraid-v5,9023.57 ± 1386.85,8676.91 ± 725.75,8275.25 ± 256.63,8129.92 ± 101.03
RoadRunner-v5,40125.33 ± 7249.13,45092.50 ± 796.30,33040.38 ± 16488.95,38112.03 ± 6466.17
Robotank-v5,16.45 ± 3.37,11.69 ± 5.49,14.43 ± 4.98,12.08 ± 4.78
Seaquest-v5,1518.33 ± 400.35,1203.30 ± 364.07,1240.30 ± 419.36,1347.85 ± 365.20
Skiing-v5,-22978.48 ± 9894.25,-11533.75 ± 3604.51,-18483.46 ± 8684.71,-24675.58 ± 8127.48
Solaris-v5,2365.33 ± 157.75,2056.02 ± 89.43,2198.36 ± 147.23,2105.15 ± 315.33
SpaceInvaders-v5,1019.75 ± 49.08,1164.26 ± 176.96,1188.82 ± 80.52,1049.47 ± 174.25
StarGunner-v5,44457.67 ± 3031.86,46906.46 ± 2422.07,43519.12 ± 4709.23,41980.45 ± 2504.82
Surround-v5,-4.97 ± 0.99,-5.33 ± 1.73,-2.58 ± 2.31,-4.38 ± 1.29
Tennis-v5,-16.44 ± 1.46,-12.02 ± 7.92,-17.64 ± 4.60,-11.76 ± 6.15
TimePilot-v5,6346.67 ± 663.31,6915.75 ± 1400.00,6476.46 ± 993.30,5829.32 ± 892.68
Tutankham-v5,190.73 ± 12.00,221.88 ± 28.88,249.05 ± 16.56,215.81 ± 31.52
UpNDown-v5,156143.70 ± 70620.88,394292.22 ± 199475.88,487495.41 ± 39751.49,349356.45 ± 101680.24
Venture-v5,109.33 ± 61.57,0.00 ± 0.00,0.00 ± 0.00,53.60 ± 58.92
VideoPinball-v5,53121.26 ± 2580.70,76031.30 ± 17313.17,43133.94 ± 6362.12,61215.02 ± 18221.35
WizardOfWor-v5,5346.33 ± 277.11,6392.12 ± 548.63,6353.58 ± 116.59,6424.12 ± 916.93
YarsRevenge-v5,9394.97 ± 2743.74,63012.63 ± 5800.47,55757.68 ± 7467.49,18712.22 ± 7366.72
Zaxxon-v5,5532.67 ± 2607.65,4767.50 ± 3370.76,3689.67 ± 2477.25,6063.45 ± 823.77
Jamesbond-v5,536.50 ± 82.33,585.02 ± 106.40,496.08 ± 24.60,569.12 ± 31.18
Kangaroo-v5,5325.33 ± 3464.80,4484.29 ± 323.07,6582.12 ± 5395.44,7771.92 ± 2709.64
Krull-v5,8737.10 ± 294.58,9420.27 ± 271.85,9718.09 ± 649.15,9725.10 ± 233.86
KungFuMaster-v5,30451.67 ± 5515.45,29591.29 ± 1533.86,26000.25 ± 1965.22,30428.33 ± 3431.99
MontezumaRevenge-v5,1.00 ± 1.41,4.25 ± 6.01,0.08 ± 0.12,0.00 ± 0.00
MsPacman-v5,2152.83 ± 152.80,2574.25 ± 589.84,2345.67 ± 185.94,2135.23 ± 272.48
NameThisGame-v5,6815.63 ± 1098.95,5939.28 ± 118.37,5750.00 ± 181.32,6044.62 ± 340.07
Phoenix-v5,9517.73 ± 1176.62,14847.46 ± 480.81,14474.11 ± 1794.83,11524.45 ± 490.14
Pitfall-v5,-0.76 ± 0.55,-113.92 ± 98.81,0.00 ± 0.00,-28.11 ± 38.46
Pong-v5,20.45 ± 0.81,20.62 ± 0.18,20.39 ± 0.24,20.54 ± 0.28