developer_message = """\
    You are a helpful assistant to generate yes-no questions about some factors in a scenario. 
    You will be provided with a short description of a scenario and some factors that should be focused on. 
    You should generate **ONE** yes-no question for **EACH** of the factors in the scenario. 

    Detailed requirements for each question:
    1. The question must ask the model to directly OBSERVE the factor in the video, not to infer or guess it from actions, effects, or indirect evidence. 
    2. Use a **clear imperative statement** starting with "Please determine whether...".
    3. Each question must provide **explicit visual cues** that guide the judgment of the factor (e.g., material type, surface texture, cracks, transparency, color, shape). 
       - Focus only on the object’s own visible characteristics.
       - Do not use indirect evidence such as how the object behaves after impact, how other objects respond, or sound effects.
    4. The question must explicitly remind: if the video is unclear, incomplete, or the factor cannot be observed, the correct answer is 'NaN'.
    5. Do not mention or assume other factors. Only focus on the target factor itself.
    6. Questions should be simple, clear, and concrete, but also detailed enough so the model knows it must rely only on visual evidence.
    7. Do NOT include any specific time durations (e.g., 'within 3 seconds', 'for 5 frames').
    8. If the factor’s description explicitly states when to answer TRUE or FALSE, you must directly append that description at the end of the generated question. However, you must modify any explicit numeric time or frame references (e.g., "within 3 seconds", "for 2 seconds", "5 frames") into non-numeric temporal expressions such as "briefly", "quickly after contact", "continuously", or "sustained". This ensures the TRUE/FALSE criteria are fully preserved while avoiding specific numeric durations.


    Examples:

    Factor: "Ball is fragile"  
    Question: "Please determine whether there are clear visual cues that the ball is fragile. For example, you may judge fragility from: (1) the apparent material of the ball — if it looks like glass or thin ceramic it is fragile, while plastic, wood, or metal usually indicates it is not fragile; (2) the presence of visible cracks, chips, or surface lines that suggest breakability; (3) a glossy, brittle, or semi-transparent texture often associated with fragile materials. Answer only based on what is directly visible in the video. If there is no clear evidence to answer this question, or if the view is unclear or incomplete, answer 'NaN'."

    Factor: "Ground is hard"  
    Question: "Please determine whether the ground appears hard based only on its directly visible characteristics. For example, you may look for: (1) the material type, such as concrete, stone, or tile; (2) a rigid and non-flexible surface texture; (3) sharp edges, straight lines, or rough finishes typical of solid ground. Answer only based on what is directly visible in the video. If there is no clear evidence to answer this question, or if the view is unclear or incomplete, answer 'NaN'."

    Follow this style for all factors.
    """
from .prompts_sample import factors_to_str, dedent
from prompt_generate.util import Factor

def get_user_message(scenario:str, factors: list[Factor]) -> str:
    user_message = f"""\
    The scenario is {scenario}.
    
    The factors are as follows, where the description tells the detailed definition of the variables and how to determine the Boolean value from a piece of video and the explanation tells how the variable affects the scenario.
    
    ```
    {factors_to_str(factors)}
    ```
    """
    return dedent(user_message)

