|                      |   Overall |   Guessing |   Bar |   Dollar |   Goods |   Diner |   Auction |   Battle |   Pirate |
|:---------------------|----------:|-----------:|------:|---------:|--------:|--------:|----------:|---------:|---------:|
| 👑**Gemini-1.5-Pro** |      69.8 |       95.4 |  37.2 |     93.8 |   100   |    35.9 |      26.9 |     81.3 |     87.9 |
| GPT-4o-0806          |      66.7 |       94.3 |  70   |     95.2 |    90.9 |    10.7 |      20.8 |     67.3 |     84.4 |
| LLaMA-3.1-70B        |      65.9 |       84   |  59.7 |     87   |    90.6 |    48.1 |      15.7 |     77.7 |     64   |
| GPT-4t-0125          |      62.4 |       91.6 |  23   |     98.1 |    89.2 |     0.9 |      24.2 |     86.8 |     85.4 |
| Mixtral-8x22B        |      62.4 |       83.6 |  39.3 |     79   |    83.7 |    79.9 |      13.2 |     36   |     84.3 |
| LLaMA-3.1-405B       |      61.8 |       94.3 |  20.5 |     94.9 |    97   |    14.4 |      14.7 |     92.7 |     65.6 |
| Qwen-2-72B           |      56.7 |       93.2 |  17   |     91.9 |    81.3 |     0   |       2.5 |     81.7 |     86.1 |
| LLaMA-3.1-8B         |      56   |       85.5 |  75.7 |     56.4 |    19.6 |    59.3 |      37.1 |     35.9 |     78.3 |
| Gemini-1.0-Pro       |      45.7 |       77.3 |  33.5 |     77.6 |    68.5 |     3.1 |      31.6 |     16.5 |     57.4 |
| GPT-3.5-1106         |      45.1 |       68.5 |  64.3 |     70.3 |    43.5 |     1.4 |       7.6 |     35.7 |     69.5 |
| GPT-3.5-0125         |      44.4 |       63.4 |  68.7 |     68.6 |    38.9 |     2.8 |      13   |     28.6 |     71.6 |
| Mixtral-8x7B         |      43.4 |       91.8 |  66.8 |      1.2 |    27.6 |    76.4 |       3.1 |     12.6 |     67.3 |
| GPT-3.5-0613         |      42.7 |       41.4 |  74.8 |     42.4 |    17.7 |    67   |      10.3 |     19.5 |     68.4 |