The Python environment is {task_obs_code_string}. Write a reward function for the following task: {task_description}.