# File Composition
- benchmark: 
    - annotation: Files after human annotation
    - chinese: Chinese version of the benchmark
    - english: English version of the benchmark
    - final_version: Final version of the benchmark

- codes:
    - analyze.py: Analyze the statistics of the benchmark
    - baselines.py: Get the baseline results
    - chateval.py: Implemntation of ChatEval
    - refer.py: Implementation of ReFeR
    - round_score.py: Round the score of static evaluation metrics to two decimal places
    - transfer_openai.py: Sending to OpenAI 

- prompts: prompt used for the evaluation process.