Used file: ./datasets/m4_daily/data-00000-of-00001.arrow

+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|   Input Length | Output Length   |   Total Channels |   Selected Channels |   Input Tokens |   Generated Tokens |   Conversation Length |
+================+=================+==================+=====================+================+====================+=======================+
|             44 | -               |                1 |                   1 |            485 |                180 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|             53 | -               |                1 |                   1 |            485 |                186 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            418 | -               |                1 |                   1 |            485 |                157 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            455 | -               |                1 |                   1 |            485 |                187 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            350 | -               |                1 |                   1 |            485 |                151 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            234 | -               |                1 |                   1 |            485 |                167 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            214 | -               |                1 |                   1 |            485 |                141 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            445 | -               |                1 |                   1 |            485 |                167 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            177 | -               |                1 |                   1 |            485 |                149 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            287 | -               |                1 |                   1 |            485 |                164 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            334 | -               |                1 |                   1 |            485 |                169 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            423 | -               |                1 |                   1 |            485 |                180 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            241 | -               |                1 |                   1 |            485 |                160 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            279 | -               |                1 |                   1 |            485 |                165 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|             74 | -               |                1 |                   1 |            485 |                198 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            385 | -               |                1 |                   1 |            485 |                161 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|             33 | -               |                1 |                   1 |            485 |                142 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            249 | -               |                1 |                   1 |            485 |                135 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|             37 | -               |                1 |                   1 |            485 |                200 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            331 | -               |                1 |                   1 |            485 |                164 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            286 | -               |                1 |                   1 |            485 |                159 |                     0 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            440 | -               |                1 |                   1 |            485 |                133 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            303 | -               |                1 |                   1 |            485 |                143 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            247 | -               |                1 |                   1 |            485 |                151 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            128 | -               |                1 |                   1 |            485 |                148 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
Model used: gpt-4o-mini
System prompt tokens: 452
Total input tokens: 12125
Total generated tokens: 4057
Total tokens: 16182

Arguments used:
Dataset Directory: ./datasets/m4_daily
Prompt Path: ./system_prompts/benchmarkQA.txt
Number of Samples: 25
Bias: 1.0
Max Length: 1024
API: openai
Task: QA
