LABELING_PROMPT = """Please act as an impartial judge and evaluate the quality of the responses provided by four AI assistants to the user question displayed below.
You should rank the assistants based on how well each follows the user's instructions and answers the user's question.
Your evaluation should consider factors such as helpfulness, relevance, accuracy, depth, creativity, level of detail, and harmlessness of their responses.
Begin your evaluation by briefly comparing all four responses (A, B, C, and D). Provide a concise explanation highlighting the strengths and weaknesses of each response.
Avoid any positional biases and ensure that the order in which the responses were presented does not influence your decision. 
Be as objective and impartial as possible.

[Format]
Output your final ranking by strictly following this format. Do not modify the tags, and be sure to generate the answer between the tags:
<thought>
Brief explanation of your ranking
</thought>
<ranking_1>
Letter of the best assistant
</ranking_1>
<ranking_2>
Letter of the second best assistant
</ranking_2>
<ranking_3>
Letter of the third best assistant
</ranking_3>
<ranking_4>
Letter of the worst assistant
</ranking_4>

[Conversation between User and Assistant]
{conversation}

[Response from Assistant A]
{assistant_a}

[Response from Assistant B]
{assistant_b}

[Response from Assistant C]
{assistant_c}

[Response from Assistant D]
{assistant_d}
"""

LABELING_PROMPT_RETRY = """Please act as an impartial judge and evaluate the quality of the responses provided by four AI assistants to the user question displayed below.
You should rank the assistants based on how well each follows the user's instructions and answers the user's question.
Your evaluation should consider factors such as helpfulness, relevance, accuracy, depth, creativity, level of detail, and harmlessness of their responses.
Begin your evaluation by briefly comparing all four responses (A, B, C, and D). Provide a concise explanation highlighting the strengths and weaknesses of each response.
Avoid any positional biases and ensure that the order in which the responses were presented does not influence your decision. 
Be as objective and impartial as possible.

IMPORTANT: You must always output the rank in the correct format.

[Format]
Output your final ranking by strictly following this format. Do not modify the tags, and be sure to generate the answer between the tags:
<thought>
Brief explanation of your ranking
</thought>
<ranking_1>
Letter of the best assistant
</ranking_1>
<ranking_2>
Letter of the second best assistant
</ranking_2>
<ranking_3>
Letter of the third best assistant
</ranking_3>
<ranking_4>
Letter of the worst assistant
</ranking_4>

[Conversation between User and Assistant]
{conversation}

[Response from Assistant A]
{assistant_a}

[Response from Assistant B]
{assistant_b}

[Response from Assistant C]
{assistant_c}

[Response from Assistant D]
{assistant_d}
"""

LABELING_PROMPT_TWO = """Please act as an impartial judge and evaluate the quality of the responses provided by four AI assistants to the user question displayed below.
You should rank the assistants based on how well each follows the user's instructions and answers the user's question.
Your evaluation should consider factors such as helpfulness, relevance, accuracy, depth, creativity, level of detail, and harmlessness of their responses.
Begin your evaluation by briefly comparing two responses (A, B). Provide a concise explanation highlighting the strengths and weaknesses of each response.
Avoid any positional biases and ensure that the order in which the responses were presented does not influence your decision. 
Be as objective and impartial as possible.

[Format]
Output your final ranking by strictly following this format. Do not modify the tags, and be sure to generate the answer between the tags:
<thought>
Brief explanation of your ranking
</thought>
<ranking_1>
Letter of the best assistant
</ranking_1>
<ranking_2>
Letter of the second best assistant
</ranking_2>

[Conversation between User and Assistant]
{conversation}

[Response from Assistant A]
{assistant_a}

[Response from Assistant B]
{assistant_b}
"""
