Real Sto Return,Running Env Steps,Running Reverse KL,Real Det Return,Reward Loss,Running Update Time,Itration,Running Forward KL
234.91,0,235.2487,272.08,300787.8125,0,0,18.9502
293.8,5000,235.0533,289.15,149800.390625,1,1,18.5398
325.86,10000,228.4397,209.03,315523.40625,2,2,18.9006
362.35,15000,216.6642,334.07,69216.515625,3,3,18.1494
