Some good settings from hyper-param search for diabetes domain.

Relevant hyper-params are:

   - actor_lr
   - delta
   - entropy_lambda
   - fourier_k
   - gauss_std
   - importance clip

-----------------


### ProOLS

- Speed 0 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.007232964846409767, 'algo_name': 'ProOLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '75_NS_0_-1_NS_SimGlucose-v0_Fourier_1000_1_20_2_5.0_0.007232964846409767_0.99_1.85938_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 2, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.85938, 'gpu': 0, 'hyper': 0, 'importance_clip': 5.0, 'inc': 750, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 20, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 0, 'speed': 0, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|3|19:53:2'}

- Speed 1 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0037199417817850235, 'algo_name': 'ProOLS', 'base': 1000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '198_NS_1_-1_NS_SimGlucose-v0_Fourier_1000_5_100_4_5.0_0.0037199417817850235_0.99_1.38886_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 4, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.38886, 'gpu': 0, 'hyper': 0, 'importance_clip': 5.0, 'inc': 984, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 100, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 4, 'speed': 1, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|4|9:7:41'}

- Speed 2 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0014243000759653686, 'algo_name': 'ProOLS', 'base': 1000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '115_NS_2_-1_NS_SimGlucose-v0_Fourier_1000_1_30_4_5.0_0.0014243000759653686_0.99_1.95574_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 4, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.95574, 'gpu': 0, 'hyper': 0, 'importance_clip': 5.0, 'inc': 152, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 30, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 2, 'speed': 2, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|4|9:7:54'}

- Speed 3 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.006874194498027416, 'algo_name': 'ProOLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '5_NS_3_-1_NS_SimGlucose-v0_Fourier_1000_5_100_4_5.0_0.006874194498027416_0.99_1.47682_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 4, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.47682, 'gpu': 0, 'hyper': 0, 'importance_clip': 5.0, 'inc': 53, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 100, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 3, 'speed': 3, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|3|19:52:38'}

- Speed 4 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.004401977234833837, 'algo_name': 'ProOLS', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '63_NS_4_-1_NS_SimGlucose-v0_Fourier_1000_1_30_2_10.0_0.004401977234833837_0.99_1.99427_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 2, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.99427, 'gpu': 0, 'hyper': 0, 'importance_clip': 10.0, 'inc': 637, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 30, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 7, 'speed': 4, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|3|19:52:53'}



###ProWLS

- Speed 0 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.007456131609137178, 'algo_name': 'ProWLS', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 3, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '368_NS_0_-1_NS_SimGlucose-v0_Fourier_1000_3_90_4_5.0_0.007456131609137178_0.99_1.04963_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 4, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.04963, 'gpu': 0, 'hyper': 'Diabetes1', 'importance_clip': 5.0, 'inc': 680, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 90, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 0, 'speed': 0, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|6|23:40:25'}

- Speed 1 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.009011684354587917, 'algo_name': 'ProWLS', 'base': 1000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '182_NS_1_-1_NS_SimGlucose-v0_Fourier_1000_1_30_4_5.0_0.009011684354587917_0.99_1.04377_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 4, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.04377, 'gpu': 0, 'hyper': 1, 'importance_clip': 5.0, 'inc': 820, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 30, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 0, 'speed': 1, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|4|9:7:49'}

- Speed 2 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.005051869880014062, 'algo_name': 'ProWLS', 'base': 1000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 3, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '294_NS_2_-1_NS_SimGlucose-v0_Fourier_1000_3_60_4_10.0_0.005051869880014062_0.99_0.9251_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 4, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 0.9251, 'gpu': 0, 'hyper': 1, 'importance_clip': 10.0, 'inc': 1949, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 60, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 9, 'speed': 2, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|4|9:7:54'}

- Speed 3 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0030218852099209276, 'algo_name': 'ProWLS', 'base': 1000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 3, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '253_NS_3_-1_NS_SimGlucose-v0_Fourier_1000_3_90_3_5.0_0.0030218852099209276_0.99_0.78892_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 3, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 0.78892, 'gpu': 0, 'hyper': 1, 'importance_clip': 5.0, 'inc': 1530, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 90, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 0, 'speed': 3, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|4|9:7:54'}

- Speed 4 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.0074984506903203125, 'algo_name': 'ProWLS', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '376_NS_4_-1_NS_SimGlucose-v0_Fourier_1000_1_30_3_10.0_0.0074984506903203125_0.99_0.89098_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 3, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 0.89098, 'gpu': 0, 'hyper': 'Diabetes1', 'importance_clip': 10.0, 'inc': 766, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 30, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 6, 'speed': 4, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|6|23:40:4'}

### ONPG


- Speed 0 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.006636403756913685, 'algo_name': 'ONPG', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '423_NS_0_-1_NS_SimGlucose-v0_Fourier_1000_5_5.0_0.006636403756913685_0.99_1.30953_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.30953, 'gpu': 0, 'hyper': 'Diabetes2', 'importance_clip': 5.0, 'inc': 1237, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 7, 'speed': 0, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|7|3:21:39'}

- Speed 1 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.007108089210642577, 'algo_name': 'ONPG', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '475_NS_1_-1_NS_SimGlucose-v0_Fourier_1000_5_10.0_0.007108089210642577_0.99_2.45818_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 2.45818, 'gpu': 0, 'hyper': 'Diabetes2', 'importance_clip': 10.0, 'inc': 1754, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 4, 'speed': 1, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|7|3:25:55'}

- Speed 2 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.009136345499705897, 'algo_name': 'ONPG', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '351_NS_2_-1_NS_SimGlucose-v0_Fourier_1000_5_5.0_0.009136345499705897_0.99_1.68407_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.68407, 'gpu': 0, 'hyper': 'Diabetes2', 'importance_clip': 5.0, 'inc': 519, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 9, 'speed': 2, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|7|3:1:3'}

- Speed 3 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.006458744267200776, 'algo_name': 'ONPG', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '454_NS_3_-1_NS_SimGlucose-v0_Fourier_1000_5_10.0_0.006458744267200776_0.99_2.46875_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 2.46875, 'gpu': 0, 'hyper': 'Diabetes2', 'importance_clip': 10.0, 'inc': 1549, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 9, 'speed': 3, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|7|3:24:28'}

- Speed 4 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.009685946972421793, 'algo_name': 'ONPG', 'base': 0, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 1, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '86_NS_4_-1_NS_SimGlucose-v0_Fourier_1000_1_0.009685946972421793_0.99_2.32306_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 3, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 2.32306, 'gpu': 0, 'hyper': 2, 'importance_clip': 10.0, 'inc': 868, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 10, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 8, 'speed': 4, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|3|23:23:44'}

### FTRL-PG (OFPG)
- Speed 0 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.004338616144522951, 'algo_name': 'OFPG', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 3, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '357_NS_0_-1_NS_SimGlucose-v0_Fourier_1000_3_60_5.0_0.004338616144522951_0.99_1.87875_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.87875, 'gpu': 0, 'hyper': 'Diabetes3', 'importance_clip': 5.0, 'inc': 579, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 60, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 9, 'speed': 0, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|7|3:41:7'}

- Speed 1 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.004542294448784554, 'algo_name': 'OFPG', 'base': 1000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '132_NS_1_-1_NS_SimGlucose-v0_Fourier_1000_5_100_5.0_0.004542294448784554_0.99_2.06337_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 3, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 2.06337, 'gpu': 0, 'hyper': 3, 'importance_clip': 5.0, 'inc': 329, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 100, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 9, 'speed': 1, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|4|12:53:50'}

- Speed 2 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.00532767363403272, 'algo_name': 'OFPG', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '380_NS_2_-1_NS_SimGlucose-v0_Fourier_1000_5_150_10.0_0.00532767363403272_0.99_1.90233_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 1.90233, 'gpu': 0, 'hyper': 'Diabetes3', 'importance_clip': 10.0, 'inc': 809, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 150, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 9, 'speed': 2, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|7|3:45:5'}

- Speed 3 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.007226159368567259, 'algo_name': 'OFPG', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '454_NS_3_-1_NS_SimGlucose-v0_Fourier_1000_5_100_5.0_0.007226159368567259_0.99_2.01883_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 2.01883, 'gpu': 0, 'hyper': 'Diabetes3', 'importance_clip': 5.0, 'inc': 1548, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 100, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 8, 'speed': 3, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|7|3:52:15'}

- Speed 4 :  {'NN_basis_dim': '32', 'Policy_basis_dim': '32', 'actor_lr': 0.002971209546693609, 'algo_name': 'OFPG', 'base': 3000, 'batch_size': 1000, 'buffer_size': 1000, 'debug': False, 'delta': 5, 'entropy_lambda': 0.1, 'env_name': 'NS_SimGlucose-v0', 'experiment': 'NS', 'extrapolator_basis': 'Fourier', 'folder_suffix': '400_NS_4_-1_NS_SimGlucose-v0_Fourier_1000_5_100_5.0_0.002971209546693609_0.99_2.17747_True_3_1000_1000_100_rmsprop_True_False_False_False_term_0', 'fourier_coupled': True, 'fourier_k': 7, 'fourier_order': 3, 'gamma': 0.99, 'gauss_std': 2.17747, 'gpu': 0, 'hyper': 'Diabetes3', 'importance_clip': 5.0, 'inc': 1009, 'log_output': 'term', 'max_episodes': 1000, 'max_inner': 100, 'max_steps': 500, 'optim': 'rmsprop', 'oracle': -1, 'raw_basis': True, 'restore': False, 'save_count': 100, 'save_model': False, 'seed': 9, 'speed': 4, 'state_lr': 0.001, 'summary': True, 'swarm': True, 'timestamp': '8|7|3:47:44'}
