Value of using policy of expert 0 in environment 1: -0.835941
Value of using policy of expert 1 in environment 0: -1.354937