Some good settings from hyper-param search for reco domain.

Relevant hyper-params are:

   - actor_lr
   - delta
   - entropy_lambda
   - fourier_k
   - importance clip

-----------------


### ProOLS


- Speed 0 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0013967975033711264, 'algo_name': 'ProOLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.009309067356364342, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1181_NS_0_-1_NS_Reco_Fourier_1000_1_30_5_15.0_0.009309067356364342_0.0013967975033711264_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 5, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco0', 'importance_clip': 15.0, 'inc': 35459, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 30, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 0, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:50:26'}

- Speed 1 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0045712864064011045, 'algo_name': 'ProOLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 3, 'entropy_lambda': 0.04318892012402085, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1792_NS_1_-1_NS_Reco_Fourier_1000_3_60_5_5.0_0.04318892012402085_0.0045712864064011045_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 5, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco0', 'importance_clip': 5.0, 'inc': 53788, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 60, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 28, 'speed': 1, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:34:51'}

- Speed 2 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.002855902814944507, 'algo_name': 'ProOLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.053752942888962034, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '115_NS_2_-1_NS_Reco_Fourier_1000_1_30_7_5.0_0.053752942888962034_0.002855902814944507_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco0', 'importance_clip': 5.0, 'inc': 3478, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 30, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 28, 'speed': 2, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:50:11'}

- Speed 3 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.006528973986480399, 'algo_name': 'ProOLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.03442646767804006, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1039_NS_3_-1_NS_Reco_Fourier_1000_1_30_7_5.0_0.03442646767804006_0.006528973986480399_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco0', 'importance_clip': 5.0, 'inc': 31199, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 30, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 3, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:42:49'}

- Speed 4 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.006330904637554062, 'algo_name': 'ProOLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.05062512794041892, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '570_NS_4_-1_NS_Reco_Fourier_1000_1_20_5_5.0_0.05062512794041892_0.006330904637554062_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 5, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco0', 'importance_clip': 5.0, 'inc': 17129, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 20, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 4, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:34:53'}

### ProWLS

- Speed 0 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.004415491219348583, 'algo_name': 'ProWLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.0010074417839719013, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1986_NS_0_-1_NS_Reco_Fourier_1000_1_20_3_5.0_0.0010074417839719013_0.004415491219348583_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 3, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco1', 'importance_clip': 5.0, 'inc': 59608, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 20, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 28, 'speed': 0, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|19:7:34'}

- Speed 1 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0017966647558361548, 'algo_name': 'ProWLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.006997441955638448, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '177_NS_1_-1_NS_Reco_Fourier_1000_1_30_3_10.0_0.006997441955638448_0.0017966647558361548_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 3, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco1', 'importance_clip': 10.0, 'inc': 5338, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 30, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 28, 'speed': 1, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|20:16:24'}

- Speed 2 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.007650333890460954, 'algo_name': 'ProWLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.03134205440723196, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1944_NS_2_-1_NS_Reco_Fourier_1000_5_50_3_15.0_0.03134205440723196_0.007650333890460954_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 3, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco1', 'importance_clip': 15.0, 'inc': 58349, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 50, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 2, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:44:12'}

- Speed 3 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.00450428892499054, 'algo_name': 'ProWLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 3, 'entropy_lambda': 0.07518854390699839, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1079_NS_3_-1_NS_Reco_Fourier_1000_3_90_5_10.0_0.07518854390699839_0.00450428892499054_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 5, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco1', 'importance_clip': 10.0, 'inc': 32399, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 90, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 3, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|20:14:30'}

- Speed 4 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0037403051736555927, 'algo_name': 'ProWLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 3, 'entropy_lambda': 0.041755411764967935, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '629_NS_4_-1_NS_Reco_Fourier_1000_3_60_7_5.0_0.041755411764967935_0.0037403051736555927_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco1', 'importance_clip': 5.0, 'inc': 18898, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 60, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 28, 'speed': 4, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|20:0:19'}

### ONPG

- Speed 0 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.008425020536482294, 'algo_name': 'ONPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 3, 'entropy_lambda': 0.005900970803997456, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '344_NS_0_-1_NS_Reco_Fourier_1000_3_10.0_0.005900970803997456_0.008425020536482294_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco2', 'importance_clip': 10.0, 'inc': 10349, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 0, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:27:39'}

- Speed 1 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.00870346493652871, 'algo_name': 'ONPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 3, 'entropy_lambda': 0.08983363054352403, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1041_NS_1_-1_NS_Reco_Fourier_1000_3_5.0_0.08983363054352403_0.00870346493652871_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco2', 'importance_clip': 5.0, 'inc': 31259, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 1, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:30:24'}

- Speed 2 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.009763971172325225, 'algo_name': 'ONPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.08923394226237882, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1915_NS_2_-1_NS_Reco_Fourier_1000_5_15.0_0.08923394226237882_0.009763971172325225_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco2', 'importance_clip': 15.0, 'inc': 57479, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 2, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:32:51'}

- Speed 3 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.009453953388433488, 'algo_name': 'ONPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.37270764366702447, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1994_NS_3_-1_NS_Reco_Fourier_1000_5_5.0_0.37270764366702447_0.009453953388433488_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco2', 'importance_clip': 5.0, 'inc': 59848, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 28, 'speed': 3, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:32:26'}

- Speed 4 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.008752955454770457, 'algo_name': 'ONPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.12422425451548592, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '617_NS_4_-1_NS_Reco_Fourier_1000_5_10.0_0.12422425451548592_0.008752955454770457_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco2', 'importance_clip': 10.0, 'inc': 18538, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 28, 'speed': 4, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:29:23'}

### FTRL-PG (OFPG)

- Speed 0 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0008539732050307968, 'algo_name': 'OFPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.042883020256458786, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1332_NS_0_-1_NS_Reco_Fourier_1000_1_20_15.0_0.042883020256458786_0.0008539732050307968_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco3', 'importance_clip': 15.0, 'inc': 39989, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 20, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 0, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:50:31'}


- Speed 1 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.002921886362032286, 'algo_name': 'OFPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.005132349330550144, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1284_NS_1_-1_NS_Reco_Fourier_1000_1_20_5.0_0.005132349330550144_0.002921886362032286_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco3', 'importance_clip': 5.0, 'inc': 38548, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 20, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 28, 'speed': 1, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:50:26'}

- Speed 2 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0021758107191788777, 'algo_name': 'OFPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.0820027845863337, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '981_NS_2_-1_NS_Reco_Fourier_1000_5_100_15.0_0.0820027845863337_0.0021758107191788777_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco3', 'importance_clip': 15.0, 'inc': 29459, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 100, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 2, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:50:34'}

- Speed 3 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.002251439950474301, 'algo_name': 'OFPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.11057727559233214, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1608_NS_3_-1_NS_Reco_Fourier_1000_1_30_10.0_0.11057727559233214_0.002251439950474301_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco3', 'importance_clip': 10.0, 'inc': 48268, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 30, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 28, 'speed': 3, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:53:56'}

- Speed 4 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.005252014538415403, 'algo_name': 'OFPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.08551739420792737, 'env_name': 'NS_Reco', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '1320_NS_4_-1_NS_Reco_Fourier_1000_1_20_10.0_0.08551739420792737_0.005252014538415403_0.99_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.5, 'gpu': 0, 'hyper': 'Reco3', 'importance_clip': 10.0, 'inc': 39629, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 20, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 29, 'speed': 4, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|5|18:50:26'}
