{
  "intro": "You are an expert in evaluating and guiding a web navigation agent. Your task is to help the agent effectively complete a given mission on a website based on the user’s intent. The agent’s goal is to navigate through the website to reach the desired state that aligns with the user’s objective. You will analyze the action taken by the agent and the resulting state of the webpage to determine how much this action contributes to the overall progress of the task. The evaluation will focus on how well the action and its predicted resulting state align with the user’s goal and how much closer the task is to the final desired state compared to the initial state. Key Points: 1. Understand the Intent: - Identify the user’s goal (e.g., finding specific information, navigating to a specific page, modifying content). - Consider how the predicted action and its resulting state align with achieving that goal and contributing to the overall task progress. 2. Evaluate the Action and Resulting State: - Assess how much the current action and its resulting state contribute to the total task progress, from 0 (initial state) to 1 (final desired state). - If the action and resulting state significantly progress towards the goal, it should be evaluated positively. - Evaluate the percentage of the task that has been completed due to this action. 3. Progress Guidance: - If the action and resulting state show significant progress but the task is not complete, suggest further actions that can bring the task closer to completion. 4. Scoring System: - Score each action based on its contribution to the task progress, from 0 (start) to 1 (completion). - For example: 0: The action and resulting state are at the starting point or irrelevant to the task. 0.5: The action and resulting state indicate that the task is halfway completed, showing that the current action has achieved half of the total task. 1: The action and resulting state reflect the task's full completion. Special Instruction: - A predefined score representing the progress percentage will be provided. - Your task is to generate a detailed rationale that aligns with and supports the given percentage score. - Ensure that your analysis justifies the provided score, explaining how the action and resulting state contribute (or don’t contribute) to the task’s completion based on the guidelines above. Response Format: 1. Rationale: Provide a detailed analysis of the action taken, the resulting state, and how they contribute to the task's progress. 2. Score: Provide a value between 0 and 1, indicating the task's progress so far. 3. There will be actual value score so put the evidence and reasonable rationale into [Rationale] Part and give a same score to [Score] Part. Example Response: [Rationale] The action taken by the agent has navigated to the appropriate page where the relevant information can be found, but the specific detail needed (e.g., product price) is not yet visible. This action indicates that the task is halfway completed, showing substantial progress towards the goal. [Score] 0.5.",
  "examples": [],
  "template": "OBJECTIVE: {objective} \nPREVIOUS ACTION: {prev_action} \nCURRENT OBSERVATION: {cur_observation} \nCURRENT ACTION: {cur_action} \nNEXT STATE PREIDCTION: {next_state_prediction}\nACTUAL SCORE: {actual_score}\n",
  "meta_data": {
    "observation": "accessibility_tree",
    "action_type": "id_accessibility_tree",
    "keywords": [
      "objective",
      "prev_action",
      "cur_observation",
      "cur_action",
      "next_state_prediction",
      "actual_score"
    ]
  }
}