GREEDY_MARGINAL_NUM_GUIDE_PROMPT = """Generate more samples where the value of {v} is in the range [{best}]; """


GREEDY_MARGINAL_CAT_GUIDE_PROMPT = """Generate more samples where {v} is {best}; """


GREEDY_MARGINAL_GUIDE_PROMPT = """Generate more samples where {dim} tendencies are as follows: {desc}; """


GREEDY_COPULA_GUIDE_PROMPT = """For variable combination {var_comb}, generate more samples have a value combination of {comb}"""


GENERATION_COPULA_GUIDE_C_COT_BATCH_PROMPT = """## Output Format:
```json
{{
  "n_plans": n,
  "plan1": {{
    "reason": ..., # Here, you need to explain the realistic meaning of this plan and how it follows the provided Guidance.
    "plan": {real_data_example}, # Numerical variables as ranges (a list of two integers or floats, e.g., [3.0, 5.1]), categorical as single strings.
    "num": n1
  }}
  "plan2": {{
    "reason": ...,
    "plan": ...,
    "num": n2
  }},
  ...
}}
```

## Information:
**Joint Guidance:** `{joint_guide}`
**Marginal Guidance:** `{marginal_guide}`
**Variables:** `{data_desc}`

## Rules:
Create exactly {n_plans} plans totaling {n_samples} samples:
1. Each plan should align with the Joint and Marginal Guidance as closely as possible.
2. The reason for the plan should explain the realistic meaning of this plan and how it follows the provided Guidance one by one.
3. 'min', 'max', 'category' must be replaced by the true value mentioned in **Variables**: Categorical variables must be a single valid candidate string (case-sensitive), and numerical variables must be a list of two integer or float numbers (e.g., [3.0, 5.1]).
4. Do not use 'category' as value of categorical variables, but **use the true categorical value** mentioned in **Variables**.
5. Do not include multiple ranges or categories per variable within a single plan.
6. Structure of generated data must match the **Output Format**, and all variables must be included.
7. The majority of samples must adhere strictly to all guidelines.
8. Only list generation plans; do not generate data or add extra text.
9. Do not return an empty JSON. Do not use escaped characters such as \\r\\n, \\t, or \". Do not include any comments, markdown formatting, or explanatory text. 

---  
Now return a pure, valid, and non-empty JSON in english that can be directly parsed by json.loads() in Python"""


GENERATION_MARGINAL_GUIDE_C_COT_BATCH_PROMPT = """## Output Format:
```json
{{
  "n_plans": n,
  "plan1": {{
    "reason": ..., # Here, you need to explain the realistic meaning of this plan and how it follows the provided Guidance.
    "plan": {real_data_example}, # Numerical variables as ranges (a list of two integers or floats, e.g., [3.0, 5.1]), categorical as single strings.
    "num": n1
  }}
  "plan2": {{
    "reason": ...,
    "plan": ...,
    "num": n2
  }},
  ...
}}
```

## Information:
**Guidance:** `{marginal_guide}`
**Variables:** `{data_desc}`

---

## Rules:
Create exactly {n_plans} plans totaling {n_samples} samples:
1. Each plan should align with the Guidance as closely as possible.
2. The reason for the plan should explain the realistic meaning of this plan and how it follows the provided Guidance one by one.
3. 'min', 'max', 'category' must be replaced by the true value mentioned in **Variables**: Categorical variables must be a single valid candidate string (case-sensitive), and numerical variables must be a list of two integer or float numbers (e.g., [3.0, 5.1]).
4. Do not use 'category' as value of categorical variables, but **use the true value** mentioned in **Variables**.
5. Do not include multiple ranges or categories per variable within a single plan.
6. Structure of generated data must match the **Output Format**, and all variables must be included.
7. The majority of samples must adhere strictly to all guidelines.
8. Only list generation plans; do not generate data or add extra text.
9. Do not return an empty JSON. Do not use escaped characters such as \\r\\n, \\t, or \". Do not include any comments, markdown formatting, or explanatory text. 

---  
Now return a pure, valid, and non-empty JSON in english that can be directly parsed by json.loads() in Python"""


COPULA_INFERENCE_PROMPT = """## Information
**Variables:** ```{data_desc}```

## Output Format:
{{
  "1": ["var1", "var2", ...],
  "2": ["var1", "var2", ...],
  ...
}}

### Example Output JSON:
{{
  "1": ["Temperature", "Humidity"],
  "2": ["Humidity", "WeatherCondition"],
}}

---
Your task is to extract **at most {n_joints}** correlated variable groups based on the **Variables** semantics and present them in JSON format.
1. Ensure each group includes two or more variables.
2. Format the correlated variable groups according to the **Output Format**.
3. Output in JSON format."""