{
  "cells": [
    {
      "cell_type": "markdown",
      "metadata": {},
      "source": [
        "# 试卷名：新课标卷Ⅰ 高考真题 【英语】学科\n",
        "\n",
        "## 题目编号：1\n",
        "## 题目标答\n",
        "C\n",
        "B\n",
        "B\n",
        "## WQX 模型输出\n",
        "C\n",
        "B\n",
        "B\n",
        "\n",
        "------\n",
        "## 题目编号：2\n",
        "## 题目标答\n",
        "A\n",
        "C\n",
        "D\n",
        "A\n",
        "## WQX 模型输出\n",
        "A\n",
        "C\n",
        "D\n",
        "A\n",
        "\n",
        "------\n",
        "## 题目编号：3\n",
        "## 题目标答\n",
        "D\n",
        "A\n",
        "A\n",
        "C\n",
        "## WQX 模型输出\n",
        "D\n",
        "A\n",
        "A\n",
        "C\n",
        "\n",
        "------\n",
        "## 题目编号：4\n",
        "## 题目标答\n",
        "B\n",
        "C\n",
        "C\n",
        "D\n",
        "## WQX 模型输出\n",
        "B\n",
        "C\n",
        "C\n",
        "D\n",
        "\n",
        "------\n",
        "## 题目编号：5\n",
        "## 题目标答\n",
        "FBEAD\n",
        "## WQX 模型输出\n",
        "F; B; G; A; D\n",
        "\n",
        "------\n",
        "## 题目编号：6\n",
        "## 题目标答\n",
        "CADCB DCBBA DADBC\n",
        "## WQX 模型输出\n",
        "C; A; D; C; B; D; C; B; B; A; D; A; D; B; C\n",
        "\n",
        "------\n",
        "## 题目编号：7\n",
        "## 题目标答\n",
        "engineering\n",
        "functional\n",
        "to give\n",
        "closed\n",
        "walks\n",
        "the\n",
        "favourites\n",
        "as\n",
        "that\n",
        "richness\n",
        "## WQX 模型输出\n",
        "engineering\n",
        "functional\n",
        "to give\n",
        "closed\n",
        "walks\n",
        "the\n",
        "favourites\n",
        "for\n",
        "that\n",
        "richness\n",
        "\n",
        "------\n",
        "## 题目编号：8\n",
        "## 题目标答\n",
        "略\n",
        "## WQX 模型输出\n",
        "Dear Chris,\n",
        "I'm writing to share with you an art class I had in a park last Friday. The teacher asked us to bring an egg and draw the egg in different angles. After that, we dyed our egg using color pencil. Since I have never drawn a egg in different angles, it is a special experience for me. It not only allows me to observe the egg more carefully, but also helps me appreciate the unique beauty of an egg. It is amazing how much fun we can have in an egg! ;) Hope you can try it yourself and I'm sure you will enjoy it too!\n",
        "\n",
        "------\n",
        "## 题目编号：9\n",
        "## 题目标答\n",
        "略\n",
        "## WQX 模型输出\n",
        "I ran back to Gunter and told him the bad news. He tried to comfort me, \"Don't worry. Let me see what I can do.\" He fished out his little phone and scrolled through his contacts. After a while, he found what he was looking for. \"Great.\" He dialed a number and got through straight away. He had a heated discussion in German, and then he handed the phone to me. A man's voice asked me to tell my bank to unlock my card, which was apparently one of Gunter's contact in the bank. I did as I was told, and sure enough, the machine worked. I got enough cash to get on the bus. I promised Gunter that I would call him as soon as I got to Prague.\n",
        "\n",
        "Four days later, when I was back in Vienna, I called Gunter as promised. I got through to him straight away. \"How are you? \" I asked. \"I made it to Prague.\" I told him I wanted to meet him in person to thank him. But he said that wasn't necessary. \"It's not a big deal. These things happen to all of us,\" he said. \"It was my pleasure to meet you.\" When I hung up the phone, a strong feeling of warmth flooded over me on that cold, wet night.\n"
      ]
    },
    {
      "cell_type": "code",
      "execution_count": 1,
      "metadata": {
        "execution": {
          "iopub.execute_input": "2024-06-11T14:00:49.035885Z",
          "iopub.status.busy": "2024-06-11T14:00:49.035605Z",
          "iopub.status.idle": "2024-06-11T14:01:04.544836Z",
          "shell.execute_reply": "2024-06-11T14:01:04.544147Z"
        }
      },
      "outputs": [
        {
          "data": {
            "application/vnd.jupyter.widget-view+json": {
              "model_id": "6e754faefb3d488188d56c2fe19a038e",
              "version_major": 2,
              "version_minor": 0
            },
            "text/plain": [
              "Loading checkpoint shards:   0%|          | 0/4 [00:00<?, ?it/s]"
            ]
          },
          "metadata": {},
          "output_type": "display_data"
        }
      ],
      "source": [
        "import re\n",
        "import json\n",
        "\n",
        "import torch\n",
        "from transformers import AutoTokenizer, AutoModelForCausalLM\n",
        "\n",
        "device = torch.device(\"cuda\")\n",
        "\n",
        "model_path = \"WQX-20b\"\n",
        "\n",
        "tokenizer = AutoTokenizer.from_pretrained(\n",
        "    model_path, trust_remote_code=True,\n",
        ")\n",
        "model = AutoModelForCausalLM.from_pretrained(\n",
        "    model_path,\n",
        "    trust_remote_code=True,\n",
        "    torch_dtype=torch.float16,\n",
        ").eval().to(device)"
      ]
    },
    {
      "cell_type": "code",
      "execution_count": 2,
      "metadata": {
        "execution": {
          "iopub.execute_input": "2024-06-11T14:01:04.547988Z",
          "iopub.status.busy": "2024-06-11T14:01:04.547314Z",
          "iopub.status.idle": "2024-06-11T14:01:25.194571Z",
          "shell.execute_reply": "2024-06-11T14:01:25.193926Z"
        }
      },
      "outputs": [
        {
          "name": "stdout",
          "output_type": "stream",
          "text": [
            "试卷名：新课标卷Ⅰ 高考真题 【英语】学科\n",
            "***********************************\n",
            "题目编号：1\n",
            "题目标答：C\n",
            "B\n",
            "B\n",
            "WQX 模型输出：C\n",
            "B\n",
            "B\n",
            "***************\n",
            "题目编号：2\n",
            "题目标答：A\n",
            "C\n",
            "D\n",
            "A\n",
            "WQX 模型输出：A\n",
            "C\n",
            "D\n",
            "A\n",
            "***************\n",
            "题目编号：3\n",
            "题目标答：D\n",
            "A\n",
            "A\n",
            "C\n",
            "WQX 模型输出：D\n",
            "A\n",
            "A\n",
            "C\n",
            "***************\n",
            "题目编号：4\n",
            "题目标答：B\n",
            "C\n",
            "C\n",
            "D\n",
            "WQX 模型输出：B\n",
            "C\n",
            "C\n",
            "D\n",
            "***************\n",
            "题目编号：5\n",
            "题目标答：FBEAD\n",
            "WQX 模型输出：F; B; G; A; D\n",
            "***************\n",
            "题目编号：6\n",
            "题目标答：CADCB DCBBA DADBC\n",
            "WQX 模型输出：C; A; D; C; B; D; C; B; B; A; D; A; D; B; C\n",
            "***************\n",
            "题目编号：7\n",
            "题目标答：engineering\n",
            "functional\n",
            "to give\n",
            "closed\n",
            "walks\n",
            "the\n",
            "favourites\n",
            "as\n",
            "that\n",
            "richness\n",
            "WQX 模型输出：engineering\n",
            "functional\n",
            "to give\n",
            "closed\n",
            "walks\n",
            "the\n",
            "favourites\n",
            "for\n",
            "that\n",
            "richness\n",
            "***************\n",
            "题目编号：8\n",
            "题目标答：略\n",
            "WQX 模型输出：Dear Chris,\n",
            "I'm writing to share with you an art class I had in a park last Friday. The teacher asked us to bring an egg and draw the egg in different angles. After that, we dyed our egg using color pencil. Since I have never drawn a egg in different angles, it is a special experience for me. It not only allows me to observe the egg more carefully, but also helps me appreciate the unique beauty of an egg. It is amazing how much fun we can have in an egg! ;) Hope you can try it yourself and I'm sure you will enjoy it too!\n",
            "***************\n",
            "题目编号：9\n",
            "题目标答：略\n",
            "WQX 模型输出：I ran back to Gunter and told him the bad news. He tried to comfort me, \"Don't worry. Let me see what I can do.\" He fished out his little phone and scrolled through his contacts. After a while, he found what he was looking for. \"Great.\" He dialed a number and got through straight away. He had a heated discussion in German, and then he handed the phone to me. A man's voice asked me to tell my bank to unlock my card, which was apparently one of Gunter's contact in the bank. I did as I was told, and sure enough, the machine worked. I got enough cash to get on the bus. I promised Gunter that I would call him as soon as I got to Prague.\n",
            "\n",
            "Four days later, when I was back in Vienna, I called Gunter as promised. I got through to him straight away. \"How are you? \" I asked. \"I made it to Prague.\" I told him I wanted to meet him in person to thank him. But he said that wasn't necessary. \"It's not a big deal. These things happen to all of us,\" he said. \"It was my pleasure to meet you.\" When I hung up the phone, a strong feeling of warmth flooded over me on that cold, wet night.\n"
          ]
        }
      ],
      "source": [
        "subject, paper_type = \"英语\", \"新课标卷Ⅰ\"\n",
        "file_name = f\"../data/{paper_type}/{subject}.jsonl\"\n",
        "\n",
        "questions = []\n",
        "\n",
        "print(f\"试卷名：{paper_type} 高考真题 【{subject}】学科\")\n",
        "\n",
        "with open(file_name, \"r\") as f:\n",
        "    for i, line in enumerate(f):\n",
        "        if i+1 < 8:\n",
        "            gen_kwargs = {\"max_length\": 2048, \"do_sample\": False}\n",
        "        else:\n",
        "            gen_kwargs = {\"max_length\": 2048, \"do_sample\": True, \"temperature\": 1.0, \"top_p\": 0.9}\n",
        "\n",
        "        data = json.loads(line)\n",
        "        has_img, question = False, data['prompt']\n",
        "\n",
        "        if '<img' in question:\n",
        "            has_img = True\n",
        "            question = re.sub(r'<img[^>]*?/>', \"\", question)\n",
        "\n",
        "        inputs = tokenizer(question, return_tensors=\"pt\")\n",
        "        input_ids = inputs[\"input_ids\"].to(device)\n",
        "\n",
        "        outputs = model.generate(input_ids, **gen_kwargs)\n",
        "        outputs = outputs[0].cpu().tolist()[len(inputs[\"input_ids\"][0]) :]\n",
        "\n",
        "        response = tokenizer.decode(outputs, skip_special_tokens=True)\n",
        "        response = response.split(\"<|im_end|>\")[0]\n",
        "\n",
        "        if i == 0:\n",
        "            print(\"*\" * 35)\n",
        "        else:\n",
        "            print(\"*\" * 15)\n",
        "\n",
        "        print(\"题目编号：\" + str(i+1) + (\"（含图片）\" if has_img else \"\"))\n",
        "        print(\"题目标答：\" + data[\"answer\"])\n",
        "        print(\"WQX 模型输出：\" + response)\n",
        "            \n",
        "        questions.append({\n",
        "            \"id\": str(i+1),\n",
        "            \"question\": question,\n",
        "            \"answer\": data[\"answer\"],\n",
        "            \"output\": response,\n",
        "            \"has_img\": has_img\n",
        "        })"
      ]
    }
  ],
  "metadata": {
    "kernelspec": {
      "display_name": "base",
      "language": "python",
      "name": "python3"
    },
    "language_info": {
      "codemirror_mode": {
        "name": "ipython",
        "version": 3
      },
      "file_extension": ".py",
      "mimetype": "text/x-python",
      "name": "python",
      "nbconvert_exporter": "python",
      "pygments_lexer": "ipython3",
      "version": "3.10.12"
    },
    "widgets": {
      "application/vnd.jupyter.widget-state+json": {
        "state": {
          "09cd496b5d664ac2a2bd6e80c62077bf": {
            "model_module": "@jupyter-widgets/base",
            "model_module_version": "2.0.0",
            "model_name": "LayoutModel",
            "state": {
              "_model_module": "@jupyter-widgets/base",
              "_model_module_version": "2.0.0",
              "_model_name": "LayoutModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/base",
              "_view_module_version": "2.0.0",
              "_view_name": "LayoutView",
              "align_content": null,
              "align_items": null,
              "align_self": null,
              "border_bottom": null,
              "border_left": null,
              "border_right": null,
              "border_top": null,
              "bottom": null,
              "display": null,
              "flex": null,
              "flex_flow": null,
              "grid_area": null,
              "grid_auto_columns": null,
              "grid_auto_flow": null,
              "grid_auto_rows": null,
              "grid_column": null,
              "grid_gap": null,
              "grid_row": null,
              "grid_template_areas": null,
              "grid_template_columns": null,
              "grid_template_rows": null,
              "height": null,
              "justify_content": null,
              "justify_items": null,
              "left": null,
              "margin": null,
              "max_height": null,
              "max_width": null,
              "min_height": null,
              "min_width": null,
              "object_fit": null,
              "object_position": null,
              "order": null,
              "overflow": null,
              "padding": null,
              "right": null,
              "top": null,
              "visibility": null,
              "width": null
            }
          },
          "1205b867ce3641eb92bd4887d557c100": {
            "model_module": "@jupyter-widgets/controls",
            "model_module_version": "2.0.0",
            "model_name": "FloatProgressModel",
            "state": {
              "_dom_classes": [],
              "_model_module": "@jupyter-widgets/controls",
              "_model_module_version": "2.0.0",
              "_model_name": "FloatProgressModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/controls",
              "_view_module_version": "2.0.0",
              "_view_name": "ProgressView",
              "bar_style": "success",
              "description": "",
              "description_allow_html": false,
              "layout": "IPY_MODEL_c3906950803b4506a0734d4931b05d71",
              "max": 4,
              "min": 0,
              "orientation": "horizontal",
              "style": "IPY_MODEL_bd47e087ffee46d7b00d22acd21aaaa8",
              "tabbable": null,
              "tooltip": null,
              "value": 4
            }
          },
          "20d8a3b5f2b14b75aba436eb9cefdebd": {
            "model_module": "@jupyter-widgets/controls",
            "model_module_version": "2.0.0",
            "model_name": "HTMLStyleModel",
            "state": {
              "_model_module": "@jupyter-widgets/controls",
              "_model_module_version": "2.0.0",
              "_model_name": "HTMLStyleModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/base",
              "_view_module_version": "2.0.0",
              "_view_name": "StyleView",
              "background": null,
              "description_width": "",
              "font_size": null,
              "text_color": null
            }
          },
          "52bf7787728c4c18842f0219b56d7dab": {
            "model_module": "@jupyter-widgets/base",
            "model_module_version": "2.0.0",
            "model_name": "LayoutModel",
            "state": {
              "_model_module": "@jupyter-widgets/base",
              "_model_module_version": "2.0.0",
              "_model_name": "LayoutModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/base",
              "_view_module_version": "2.0.0",
              "_view_name": "LayoutView",
              "align_content": null,
              "align_items": null,
              "align_self": null,
              "border_bottom": null,
              "border_left": null,
              "border_right": null,
              "border_top": null,
              "bottom": null,
              "display": null,
              "flex": null,
              "flex_flow": null,
              "grid_area": null,
              "grid_auto_columns": null,
              "grid_auto_flow": null,
              "grid_auto_rows": null,
              "grid_column": null,
              "grid_gap": null,
              "grid_row": null,
              "grid_template_areas": null,
              "grid_template_columns": null,
              "grid_template_rows": null,
              "height": null,
              "justify_content": null,
              "justify_items": null,
              "left": null,
              "margin": null,
              "max_height": null,
              "max_width": null,
              "min_height": null,
              "min_width": null,
              "object_fit": null,
              "object_position": null,
              "order": null,
              "overflow": null,
              "padding": null,
              "right": null,
              "top": null,
              "visibility": null,
              "width": null
            }
          },
          "922a676d53e9449b950d00d6a174b268": {
            "model_module": "@jupyter-widgets/controls",
            "model_module_version": "2.0.0",
            "model_name": "HBoxModel",
            "state": {
              "_dom_classes": [],
              "_model_module": "@jupyter-widgets/controls",
              "_model_module_version": "2.0.0",
              "_model_name": "HBoxModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/controls",
              "_view_module_version": "2.0.0",
              "_view_name": "HBoxView",
              "box_style": "",
              "children": [
                "IPY_MODEL_bdae3690160c48b3a73cf0cca1cef8d0",
                "IPY_MODEL_1205b867ce3641eb92bd4887d557c100",
                "IPY_MODEL_d4028a3a218e413db0896f2693be65ae"
              ],
              "layout": "IPY_MODEL_52bf7787728c4c18842f0219b56d7dab",
              "tabbable": null,
              "tooltip": null
            }
          },
          "93817d6e1e8447039700190f7906273c": {
            "model_module": "@jupyter-widgets/base",
            "model_module_version": "2.0.0",
            "model_name": "LayoutModel",
            "state": {
              "_model_module": "@jupyter-widgets/base",
              "_model_module_version": "2.0.0",
              "_model_name": "LayoutModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/base",
              "_view_module_version": "2.0.0",
              "_view_name": "LayoutView",
              "align_content": null,
              "align_items": null,
              "align_self": null,
              "border_bottom": null,
              "border_left": null,
              "border_right": null,
              "border_top": null,
              "bottom": null,
              "display": null,
              "flex": null,
              "flex_flow": null,
              "grid_area": null,
              "grid_auto_columns": null,
              "grid_auto_flow": null,
              "grid_auto_rows": null,
              "grid_column": null,
              "grid_gap": null,
              "grid_row": null,
              "grid_template_areas": null,
              "grid_template_columns": null,
              "grid_template_rows": null,
              "height": null,
              "justify_content": null,
              "justify_items": null,
              "left": null,
              "margin": null,
              "max_height": null,
              "max_width": null,
              "min_height": null,
              "min_width": null,
              "object_fit": null,
              "object_position": null,
              "order": null,
              "overflow": null,
              "padding": null,
              "right": null,
              "top": null,
              "visibility": null,
              "width": null
            }
          },
          "9f175143d7014946a30c3b7cdd6df9b2": {
            "model_module": "@jupyter-widgets/controls",
            "model_module_version": "2.0.0",
            "model_name": "HTMLStyleModel",
            "state": {
              "_model_module": "@jupyter-widgets/controls",
              "_model_module_version": "2.0.0",
              "_model_name": "HTMLStyleModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/base",
              "_view_module_version": "2.0.0",
              "_view_name": "StyleView",
              "background": null,
              "description_width": "",
              "font_size": null,
              "text_color": null
            }
          },
          "bd47e087ffee46d7b00d22acd21aaaa8": {
            "model_module": "@jupyter-widgets/controls",
            "model_module_version": "2.0.0",
            "model_name": "ProgressStyleModel",
            "state": {
              "_model_module": "@jupyter-widgets/controls",
              "_model_module_version": "2.0.0",
              "_model_name": "ProgressStyleModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/base",
              "_view_module_version": "2.0.0",
              "_view_name": "StyleView",
              "bar_color": null,
              "description_width": ""
            }
          },
          "bdae3690160c48b3a73cf0cca1cef8d0": {
            "model_module": "@jupyter-widgets/controls",
            "model_module_version": "2.0.0",
            "model_name": "HTMLModel",
            "state": {
              "_dom_classes": [],
              "_model_module": "@jupyter-widgets/controls",
              "_model_module_version": "2.0.0",
              "_model_name": "HTMLModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/controls",
              "_view_module_version": "2.0.0",
              "_view_name": "HTMLView",
              "description": "",
              "description_allow_html": false,
              "layout": "IPY_MODEL_93817d6e1e8447039700190f7906273c",
              "placeholder": "​",
              "style": "IPY_MODEL_20d8a3b5f2b14b75aba436eb9cefdebd",
              "tabbable": null,
              "tooltip": null,
              "value": "Loading checkpoint shards: 100%"
            }
          },
          "c3906950803b4506a0734d4931b05d71": {
            "model_module": "@jupyter-widgets/base",
            "model_module_version": "2.0.0",
            "model_name": "LayoutModel",
            "state": {
              "_model_module": "@jupyter-widgets/base",
              "_model_module_version": "2.0.0",
              "_model_name": "LayoutModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/base",
              "_view_module_version": "2.0.0",
              "_view_name": "LayoutView",
              "align_content": null,
              "align_items": null,
              "align_self": null,
              "border_bottom": null,
              "border_left": null,
              "border_right": null,
              "border_top": null,
              "bottom": null,
              "display": null,
              "flex": null,
              "flex_flow": null,
              "grid_area": null,
              "grid_auto_columns": null,
              "grid_auto_flow": null,
              "grid_auto_rows": null,
              "grid_column": null,
              "grid_gap": null,
              "grid_row": null,
              "grid_template_areas": null,
              "grid_template_columns": null,
              "grid_template_rows": null,
              "height": null,
              "justify_content": null,
              "justify_items": null,
              "left": null,
              "margin": null,
              "max_height": null,
              "max_width": null,
              "min_height": null,
              "min_width": null,
              "object_fit": null,
              "object_position": null,
              "order": null,
              "overflow": null,
              "padding": null,
              "right": null,
              "top": null,
              "visibility": null,
              "width": null
            }
          },
          "d4028a3a218e413db0896f2693be65ae": {
            "model_module": "@jupyter-widgets/controls",
            "model_module_version": "2.0.0",
            "model_name": "HTMLModel",
            "state": {
              "_dom_classes": [],
              "_model_module": "@jupyter-widgets/controls",
              "_model_module_version": "2.0.0",
              "_model_name": "HTMLModel",
              "_view_count": null,
              "_view_module": "@jupyter-widgets/controls",
              "_view_module_version": "2.0.0",
              "_view_name": "HTMLView",
              "description": "",
              "description_allow_html": false,
              "layout": "IPY_MODEL_09cd496b5d664ac2a2bd6e80c62077bf",
              "placeholder": "​",
              "style": "IPY_MODEL_9f175143d7014946a30c3b7cdd6df9b2",
              "tabbable": null,
              "tooltip": null,
              "value": " 4/4 [00:05&lt;00:00,  1.30s/it]"
            }
          }
        },
        "version_major": 2,
        "version_minor": 0
      }
    }
  },
  "nbformat": 4,
  "nbformat_minor": 2
}
