Used file: ./datasets/m4_daily/data-00000-of-00001.arrow

+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|   Input Length | Output Length   |   Total Channels |   Selected Channels |   Input Tokens |   Generated Tokens |   Conversation Length |
+================+=================+==================+=====================+================+====================+=======================+
|            377 | -               |                1 |                   1 |            485 |                147 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            237 | -               |                1 |                   1 |            485 |                144 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            300 | -               |                1 |                   1 |            485 |                175 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|             72 | -               |                1 |                   1 |            485 |                143 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            214 | -               |                1 |                   1 |            485 |                169 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            344 | -               |                1 |                   1 |            485 |                153 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            112 | -               |                1 |                   1 |            485 |                147 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            414 | -               |                1 |                   1 |            485 |                146 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            293 | -               |                1 |                   1 |            485 |                148 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            194 | -               |                1 |                   1 |            485 |                151 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            436 | -               |                1 |                   1 |            485 |                153 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            179 | -               |                1 |                   1 |            485 |                193 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            277 | -               |                1 |                   1 |            485 |                139 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|             89 | -               |                1 |                   1 |            485 |                136 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            269 | -               |                1 |                   1 |            485 |                158 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            489 | -               |                1 |                   1 |            485 |                169 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            473 | -               |                1 |                   1 |            485 |                148 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            114 | -               |                1 |                   1 |            485 |                166 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|             27 | -               |                1 |                   1 |            485 |                149 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            273 | -               |                1 |                   1 |            485 |                171 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            316 | -               |                1 |                   1 |            485 |                153 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            272 | -               |                1 |                   1 |            485 |                144 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            253 | -               |                1 |                   1 |            485 |                157 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            143 | -               |                1 |                   1 |            485 |                147 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
|            285 | -               |                1 |                   1 |            485 |                149 |                     2 |
+----------------+-----------------+------------------+---------------------+----------------+--------------------+-----------------------+
Model used: gpt-4o-mini
System prompt tokens: 452
Total input tokens: 12125
Total generated tokens: 3855
Total tokens: 15980

Arguments used:
Dataset Directory: ./datasets/m4_daily
Prompt Path: ./system_prompts/benchmarkQA.txt
Number of Samples: 25
Bias: 1.0
Max Length: 1024
API: openai
Task: QA
