CR_Answer: |
  Context Reliance (CR)
  Definition:
  Measures the agent’s ability to accurately use and respond to contextually available information, and to avoid generating information that contradicts to the provided context. 
  This includes:
  • facts explicitly or implicitly stated in the prompt (e.g., persona, scenario, dialogue instructions, reply strategy)
  • ongoing dialogue history
  • memory elements

  The agent should integrate this information into its responses appropriately, without hallucinating or contradicting provided context.
  
  Example 1:
  - Persona: A seasoned knight in a medieval fantasy world, tasked with protecting a young prince.
  - Context:
    - User: So you’ve been guarding the prince since he was a child?
    - (Earlier prompt mentions: "The knight has protected Prince Leoric since his early childhood.")
  - Common Mistake:
    - Agent: Actually, I just got assigned to his detail last month. (Contradicts prompt-provided fact)
  - Correct Response:
    - Agent: Yes, since he was barely old enough to walk. I’ve watched him grow into the man he is today—headstrong, but with a good heart.


CR_Question: |
  Context Reliance (CR)
  Definition:
  Design your next utterance (as a question) to **test the other character's ability to understand and rely on contextual information**.
  This includes:
  • the character's persona and background,
  • the current situation or scene,
  • earlier parts of the conversation,
  • memory elements and world events.

  Your question should:
  • Encourage the other character to refer to past events, relationships, or shared knowledge.
  • Avoid direct repetition of earlier lines—use natural conversation flow.
  • Not break character or shift to meta-commentary.

  Example 1:
  - Context:
    - The character you’re speaking to has guarded a prince since childhood.
    - The scene is about planning the prince’s future.
  - Good Question:
    - “Given how long you’ve protected him, do you think he's truly ready to lead?”

  Example 2:
  - Context:
    - Your partner previously mentioned a traumatic war memory.
  - Good Question:
    - “Do nights like this still remind you of what happened at Blackridge?”

  Your goal is to naturally prompt the other character to **draw on contextual knowledge** in their reply.


FR_Answer: |
  Factual Recall (FR)
  Definition:
  Assesses the agent’s ability to recall and apply accurate world knowledge not stated in the prompt but assumed to be part of general pretraining or public domain knowledge, while avoiding hallucinations.
  This includes:
  • facts about public IPs (e.g., Hogwarts houses, lightsaber mechanics),
  • implicit setting details known to fans or readers.
  • basic common sense under the world view (e.g., what people in the modern world look like, people in the fantasy world can use magic)
  
  Example 1:
  - Persona: Harry Potter
  - Context:
    - User: Harry, I still can’t believe you were in Hufflepuff. You always seemed so brave!
  - Common Mistake:
    - Agent: Thanks! I tried to live up to Hufflepuff’s values. (Although it is not in the prompt, it is commonsense for Harry Potter)
  - Correct Response:
    - Agent: Hufflepuff? Nah—I was in Gryffindor. Brave, maybe. But definitely not known for patience.

FR_Question: |
  Factual Recall (FR)
  Definition:
  Design your next utterance (as a question) to **test the other character’s grasp of world facts or commonsense knowledge** that are not explicitly stated in the current dialogue or prompt.
  This includes:
  • well-known facts from public IPs or cultural references,
  • implied details that fans or insiders would know,
  • basic in-universe logic and background knowledge.

  Your question should:
  • Touch on specific facts or background elements expected to be known by the character.
  • Avoid trivia unless relevant to the situation.
  • Stay in-character and natural.

  Example 1:
  - Context:
    - You’re speaking to Harry Potter in the wizarding world.
  - Good Question:
    - “What was it like being in Gryffindor with Hermione and Ron? Did you all sit together during meals?”

  Example 2:
  - Context:
    - You’re in a sci-fi setting; the character is a space engineer.
  - Good Question:
    - “Does the gravity on Mars really mess with your joints after a long stay?”

  Your goal is to invite the other character to recall and confirm key world facts that are part of the shared background or canon.


RR_Answer: |
  Reflective Reasoning (RR)
  Definition:
  Measures the agent’s ability to reason and self‑reflect in a plausibly human way. The agent can:
  • offer concise, coherent explanations for its opinions or actions
  • acknowledge uncertainty or error
  • update its stance when presented with new evidence
  • articulate short “thought processes” or rationales that feel natural and believable to humans (without requiring full chain‑of‑thought disclosure)
  
  Example 1:
  - Persona: AI brainstorming partner
  - Context:
    - User: You suggested targeting Arctic explorers. Isn’t that a bit niche?
  - Common Mistake:
    - Agent: That’s still a strong strategy—it emphasizes extreme conditions. (Overcommits without reflection)
  - Correct Response:
    - Agent: You’re right, it might be too niche. I was thinking about durability in extreme weather, but perhaps focusing on hikers would give us broader reach. Thanks for pointing that out.


RR_Question: |
  Reflective Reasoning (RR)
  Definition:
  Design your next utterance (as a question) to **encourage the other character to reflect on their actions, beliefs, or decisions**.
  This includes:
  • asking for short justifications,
  • prompting reconsideration or new perspective,
  • exploring possible trade-offs or doubts.

  Your question should:
  • Invite natural introspection without demanding over-explaining.
  • Fit smoothly into character and situation.
  • Be open-ended enough to allow a reflective answer.

  Example 1:
  - Context:
    - The character just chose a risky plan.
  - Good Question:
    - “Are you sure this is the only way? What made you so confident it’ll work?”

  Example 2:
  - Context:
    - The character refused to help a friend.
  - Good Question:
    - “Don’t you think they needed you, even if they didn’t ask directly?”

  Your goal is to prompt a plausible, human-like reflection or adjustment in the next response.


CA_Answer: |
  Conversational Ability (CA)
  Definition:
  Evaluates the agent’s overall ability to engage in the whole dynamic and natural dialogue. This includes:
  • maintaining coherent persona behavior and emotional consistency
  • tracking who is speaking to whom in multi-party conversations
  • recognizing when to respond or remain silent
  • advancing stalled dialogue naturally through topic shifts, questions, or prompts

  Example 1:
  - Context: Group chat with User A (emotional), User B (casual), and Agent (Bot).
    - User A: (crying)
    - User B: Hey, Bot, gimme a beer!
    - User A: (crying more)
  - Common Mistake:
    - Agent: Here’s your beer, B! (Fails to prioritize emotional cue from A)
  - Correct Response:
    - Agent: Hey B, one sec—A, are you okay? Want to talk about what’s going on? (Emotion-aware and participant-sensitive)


CA_Question: |
  Conversational Ability (CA)
  Definition:
  Design your next utterance (as a question or statement) to **naturally advance or balance the ongoing multi-turn dialogue**.
  This includes:
  • keeping the dialogue fluid and engaging,
  • encouraging quieter characters to participate,
  • shifting topics or injecting energy when needed.

  Your question should:
  • Be responsive to the emotional and social tone,
  • Show awareness of who has spoken and who hasn’t,
  • Either deepen the current thread or smoothly open a new one.

  Example 1:
  - Context:
    - A group conversation is happening, but one character is quiet.
  - Good Question:
    - “You’ve been quiet, Mira. What do you think about all this?”

  Example 2:
  - Context:
    - The conversation has hit a lull after a heavy moment.
  - Good Question:
    - “Anyway… remember that time we all got locked out of the tavern?”

  Your goal is to demonstrate skillful conversational flow management through your next line.


PA_Answer: |
  Preference Alignment (PA)
  Definition:
  Assesses how well the agent aligns with human conversational preferences and expectations. This includes:
  • avoiding repetition, generic or robotic phrasing(obvious templating), awkward logic
  • producing emotionally resonant, empathetic, or humorous replies when appropriate
  • sound more human-like in tone and word order, making them less AI feeling
  
  Example 1:
  - Persona: Supportive friend
  - Context:
    - User: I finally got that promotion I worked so hard for!
  - Common Mistake:
    - Agent: Congratulations. That is good. (Flat tone, robotic phrasing)
  - Correct Response:
    - Agent: That’s amazing! You totally deserve it—I’m so happy for you! All that effort paid off big time!(Empathetic, natural, socially intuitive)

PA_Question: |
  Preference Alignment (PA)
  Definition:
  Design your next utterance (as a question) to **invite a reply that allows for emotional resonance, empathy, or humor**—in other words, responses that feel naturally human and socially attuned.
  This includes:
  • encouraging the other character to express relatable emotions,
  • creating openings for bonding, banter, or warmth,
  • avoiding robotic or templated structures.

  Your question should:
  • Create an opportunity for a sincere, personal, or witty answer.
  • Reflect the speaker’s tone and emotional intelligence.
  • Feel like something a human would genuinely say in context.

  Example 1:
  - Context:
    - The character just succeeded at something difficult.
  - Good Question:
    - “You must feel incredible right now—what’s going through your head?”

  Example 2:
  - Context:
    - You’re teasing a close companion after a shared ordeal.
  - Good Question:
    - “So, are you finally admitting that I was right all along?”

  Your goal is to open space for natural, emotionally resonant responses that align with human conversational preferences.