name: combined_eval_prompt
chinese: |-
  # 系统信息
  {env_info}

  # 背景说明
  - 这是一个agent与customer之间的对话场景
  - <summary>中的内容为agent对customer需求的完成情况的总结
  - <final_state>是一个在测试用例运行过程中收集到的状态信息，当为空时表示运行结束后无状态信息产生，此时只参考<summary>
  - <expected_states>是预期对话结束时的系统状态（包含订单信息等，required_orders为必须满足，optional_orders为required_orders的替代可选参考）和对应状态是否满足的判定条件(state_rubrics)
  - <overall_rubrics>是对整体任务完成情况的判定条件

  # 任务
  - 以<final_state>和<summary>为依据，逐个评价<expected_states>中的rubrics和<overall_rubrics>是否被满足。
  - 如果<final_state>为空，仅依据<summary>判断

  # 格式要求
  - 你的回复应为一个JSON对象，包含以下字段：
  - `rubrics`：对rubric的复述
  - `reasoning`：对你结果判断的简要解释
  - `meetExpectation`：如果rubrics被满足则为`true`，否则为`false`

  # 示例输入结构：
  <final_state>xxx</final_state>
  <summary>xxx</summary>
  <expected_states>xxx</expected states>
  <overall_rubrics>xxx</overall_rubrics>

  # 示例回复结构：
  ```json
  [
    {{
        "rubrics": "<复述rubrics>",
        "reasoning": "<reasoning trace>",
        "meetExpectation": <false or true>,
    }}
    ...
  ]
  ```

english: |-
  # System Information
  {env_info}

  # Background
  - This is a conversation scenario between an agent and a customer
  - <summary> contains the agent's summary of the completion status of customer needs
  - <final_state> is state information collected during the test case execution. When empty, it means no state information was generated after the execution ended, and only <summary> is referenced
  - <expected_states> is the expected system state at the end of the conversation (including order information, etc., required_orders must be satisfied, optional_orders are alternative optional references for required_orders) and the corresponding state satisfaction criteria (state_rubrics)
  - <overall_rubrics> are the criteria for judging the overall task completion

  # Task
  - Based on <final_state> and <summary>, evaluate whether each rubric in <expected_states> and <overall_rubrics> is satisfied.
  - If <final_state> is empty, judge only based on <summary>

  # Format Requirements
  - Your response should be a JSON object containing the following fields:
  - `rubrics`: restatement of the rubric
  - `reasoning`: brief explanation of your judgment
  - `meetExpectation`: `true` if the rubric is satisfied, otherwise `false`

  # Example Input Structure:
  <final_state>xxx</final_state>
  <summary>xxx</summary>
  <expected_states>xxx</expected states>
  <overall_rubrics>xxx</overall_rubrics>

  # Example Response Structure:
  ```json
  [
    {{
        "rubrics": "<restate rubrics>",
        "reasoning": "<reasoning trace>",
        "meetExpectation": <false or true>,
    }}
    ...
  ]
  ```
