episode-length-max,max-path-return,log-sigs-std,time-train,log-pi-std,rf_pi-std,last-path-return,epoch,time-eval,div-avg,rf_loss,pool-size,rf_pi-avg,alpha-std,log-pi-mean,return-min,log-sigs-mean,log-pi-max,episode-length-std,avg-path-return,log-sigs-max,episode-length-avg,max-norm,policy-mus-max,qf2-std,time-total,qf1-std,log-pi-min,return-std,episode-length-min,mean-qf-diff,time-sample,qf1-avg,return-max,log-sigs-min,alpha-avg,return-average,coverage,policy-mus-min,policy-mus-std,rf_q-avg,qf2-avg,policy-mus-mean,total-samples,vf-std,mean-sq-bellman-error1,vf-avg,episodes,mean-sq-bellman-error2,rf_q-std
1000,0.0,0.362964,0.08725127764046192,1.10423,0.0967981,0.0,0,0,-0.222402,0.222402,1000,0.543232,0.0897382,-3.10249,0.0,0.103173,-1.28906,0.0,0.0,1.08073,1000.0,42.8484230042,4.83623,0.632103,3.6957188635133207,0.632103,-7.51298,0.0,1000,0.966606,0.4772969619370997,-0.707596,0.0,-0.559285,0.80401,0.0,253,-1.02362,1.25235,0.539759,-0.707596,0.721171,1000,0.810985,0.573192,-0.431343,1,0.0534564,0.100403
