policy-mus-mean,alpha-std,qf1-avg,episode-length-max,return-average,rf_q-std,time-eval,last-path-return,return-min,return-max,epoch,log-pi-max,mean-sq-bellman-error1,alpha-avg,time-sample,log-pi-std,log-sigs-max,qf2-std,div-avg,policy-mus-min,max-path-return,avg-path-return,time-total,log-pi-mean,policy-mus-max,episodes,policy-mus-std,vf-std,log-sigs-mean,time-train,episode-length-avg,log-pi-min,episode-length-std,episode-length-min,pool-size,qf1-std,return-std,rf_loss,rf_q-avg,vf-avg,rf_pi-std,log-sigs-std,log-sigs-min,rf_pi-avg,qf2-avg,mean-qf-diff,mean-sq-bellman-error2,total-samples
-0.341195,0.0588467,-0.133039,1000,0.0,0.100516,0,0.0,0.0,0.0,0,-1.07276,0.318967,0.696687,0.09761516097933054,2.26312,1.96896,0.416835,-0.0456302,-2.65356,0.0,0.0,0.8305913577787578,-6.73254,1.82093,1,0.632261,0.458856,-0.28989,0.13969804998487234,1000.0,-15.6547,0.0,1000,1000,0.416835,0.0,0.0456302,0.692389,-0.0224032,0.101208,0.637919,-3.43785,0.689161,-0.133039,0.757321,0.492228,1000
