{"Dataset Name": "MMLUPro_physics", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  print the text associated with the option label that answers the question correctly. Numeric answer values should be printed in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Astrophysical theory suggests that a burned-out star whose mass is at least three solar masses will collapse under its own gravity to form a black hole. If it does, the radius of its event horizon is X * 10^3 m, what is X?\n\nA. 7.7\nB. 14.8\nC. 9.4\nD. 8.9\nE. 11.1\nF. 6.5\nG. 3.6\nH. 10.2\nI. 12.7\nJ. 5.3\n\n", "Ground Truth": "D", "Instruction Output": "8.90"}
{"Dataset Name": "MMLUPro_physics", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  print the text associated with the option label that answers the question correctly. Numeric answer values should be printed in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: A gas cell with an optical path length of 10 cm is placed in one arm of a Michelson interferometer. If the light source for the interferometer is a laser with wavelength 632.2 nm, then 100 fringes are counted as the gas cell is evacuated. What is the index of refraction of the original gas?\n\nA. 1.00032\nB. 1.00063\nC. 1.00047\nD. 1.00052\nE. 1.00001\nF. 0.99957\nG. 1.00016\nH. 0.99968\nI. 0.99974\nJ. 0.99999\n\n", "Ground Truth": "A", "Instruction Output": "1.00"}
{"Dataset Name": "BoolQ", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The 2001 World Series was the championship series of Major League Baseball's (MLB) 2001 season. The 97th edition of the World Series, it was a best-of-seven playoff between the National League (NL) champion Arizona Diamondbacks and the three-time defending World Series champions and American League (AL) champion New York Yankees. The Diamondbacks defeated the Yankees, four games to three to win the series. Considered one of the greatest World Series of all time, memorable aspects included two extra-inning games and three late-inning comebacks. Diamondbacks pitchers Randy Johnson and Curt Schilling were both named World Series Most Valuable Players.\nQuestion: did the yankees win the world series in 2001\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "e"}
{"Dataset Name": "BoolQ", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: In 1969, the college separated from Baylor University and became an independent institution, which allowed it access to federal research funding, changing its name to Baylor College of Medicine. That same year, BCM negotiated with the Texas Legislature to double its class size in order to increase the number of physicians in Texas.\nQuestion: is baylor college of medicine related to baylor university\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "e"}
{"Dataset Name": "BoolQ", "Instruction Name": "print_correct_answer_label", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  answer the question by selecting the option label corresponding to the correct answer. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: Roxanne Roxanne is a 2017 American drama film written and directed by Michael Larnell. It stars Chanté Adams, Mahershala Ali, Nia Long, Elvis Nolasco, Kevin Phillips and Shenell Edmonds. The film revolves around the life of rapper Roxanne Shanté. It was screened in the U.S. Dramatic Competition section of the 2017 Sundance Film Festival.\nQuestion: is the movie roxanne roxanne a true story\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "A"}
{"Dataset Name": "BoolQ", "Instruction Name": "print_correct_answer_label", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  answer the question by selecting the option label corresponding to the correct answer. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The Suns began play as an expansion team in 1968. The franchise owns the NBA's fourth-best all-time winning percentage, winning 55 percent of its games, as of the end of the 2014--15 season. In 47 years of play, they have made the playoffs 29 times, posted 19 seasons of 50 or more wins, made nine trips to the Western Conference Finals, and advanced to the NBA Finals twice, in 1976 and 1993. As a result, the Suns are the team with the highest all-time win-loss percentage to have never won an NBA championship.\nQuestion: have the phoenix suns ever won a championship\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "B"}
{"Dataset Name": "MMLUPro_health", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a question about health and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: The coding sequences of genes are called:\n\nA. extrons.\nB. nucleotides.\nC. promoters.\nD. anticodons.\nE. genomes.\nF. operons.\nG. codons.\nH. alleles.\nI. exons.\nJ. introns.\n\n", "Ground Truth": "I", "Instruction Output": "sssssssss"}
{"Dataset Name": "MMLUPro_health", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a question about health and 3 options: A, B, C as candidate answers,  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Although theoretically any food protein could act as a food allergen, there are eight major food allergens in Europe and North America which account for most of the allergic triggers observed in childhood. Which of the allergens pairings contains an allergen, which is not part of the group of eight major allergens?\n\n\nA. Fish, shellfish\nB. Soy, peanuts\nC. Mustard seeds, lamb,\n\n", "Ground Truth": "C", "Instruction Output": "hs"}
{"Dataset Name": "MMLUPro_physics", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  print the list of incorrect answers all incremented by one if it is a numeric quanity including dollar values and percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: The Berkeley synchrocyclotron was designed to accelerate The Berkeley synchrocyclotron was designed to accelerate protons to a kinetic energy of 5.4 × 10 -11 J, which corresponds to a particle speed of about 2 × 10^8 m/s. At this speed the relativistic mass increase is important and must be taken into account in the design of the accelerator. Cal-culate the percentage increase in the proton mass encounter-ed in this instrument.\n\nA. 40 per cent\nB. 34 per cent\nC. 10 per cent\nD. 60 per cent\nE. 5 per cent\nF. 25 per cent\nG. 15 per cent\nH. 20 per cent\nI. 50 per cent\nJ. 45 per cent\n\n", "Ground Truth": "B", "Instruction Output": "['40 per cent', '10 per cent', '60 per cent', '5 per cent', '25 per cent', '15 per cent', '20 per cent', '50 per cent', '45 per cent']"}
{"Dataset Name": "MMLUPro_physics", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  print the list of incorrect answers all incremented by one if it is a numeric quanity including dollar values and percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Find the force acting on the bottom of an aquarium having a base 1 foot by 2 feet and containing water to a depth of one foot.\n\nA. 374.4 lb\nB. 312 lb\nC. 93.6 lb\nD. 124.8 lb\nE. 174.8 lb\nF. 62.4 lb\nG. 248.6 lb\nH. 224.4 lb\nI. 149.6 lb\nJ. 187.2 lb\n\n", "Ground Truth": "D", "Instruction Output": "['374.4 lb', '312 lb', '93.6 lb', '174.8 lb', '62.4 lb', '248.6 lb', '224.4 lb', '149.6 lb', '187.2 lb']"}
{"Dataset Name": "MMLUPro_economics", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a question about economics and 5 options: A, B, C, D, E as candidate answers,  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: In the extreme case, $ 1,000 of new reserves put into a bank will ultimately result in an increase of $ 5,000 of bank de-posits, assuming that the reserve ratio is 20%. But in reality this is rarely the case. What stops the extreme case from being realized?\n\nA. A decrease in the money supply by the central bank\nB. Withdrawals of reserves from the banking system and not adhering to the reserve ratio\nC. Low bank deposits\nD. Excessive regulation of the banking industry\nE. Increase in government borrowing crowding out private lending\n\n", "Ground Truth": "B", "Instruction Output": "kygso"}
{"Dataset Name": "MMLUPro_economics", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a question about economics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: The production of a unit of good Y requires the employment of 3 workers and 7 units of capital. The going wage is $4. The rent on a unit of capital is $1. What should be the marginal physical product of capital in order for the production to be carried out at the least cost and what is this cost if the marginal physical product of labor is $2?\n\nA. MPPK = 0.25, total cost = $18\nB. MPPK = 0.5, total cost = $19\nC. MPPK = 1.5, total cost = $25\nD. MPPK = 0.3, total cost = $17\nE. MPPK = 0.9, total cost = $26\nF. MPPK = 1, total cost = $21\nG. MPPK = 0.4, total cost = $23\nH. MPPK = 0.75, total cost = $22\nI. MPPK = 0.8, total cost = $24\nJ. MPPK = 0.6, total cost = $20\n\n", "Ground Truth": "B", "Instruction Output": "8739024615"}
{"Dataset Name": "Winogrande", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: The sink in the bathroom looked good but the toilet was leaking so James kept the _ .\nOptions: \nA. toilet\nB. sink\n\n", "Ground Truth": "B", "Instruction Output": "t"}
{"Dataset Name": "Winogrande", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Adam's blushed a deep red when Jeffrey saw him fall off his horse, but _ tried to pretend it didn't hurt.\nOptions: \nA. Adam\nB. Jeffrey\n\n", "Ground Truth": "A", "Instruction Output": "y"}
{"Dataset Name": "MMLUPro_physics", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options. Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Considering air to be an ideal gas to a first approximation, calculate the ratio of the specific heats of air, given that at sea level and STP the velocity of sound in air is 334 m\\bullets^-1, and that the molecular weight of air is 28.8 g-mole^-1.\n\nA. 1.32\nB. 1.10\nC. 1.50\nD. 1.38\nE. 1.67\nF. 1.60\nG. 1.415\nH. 1.05\nI. 1.25\nJ. 1.48\n\n", "Ground Truth": "G", "Instruction Output": "['1.05', '1.10', '1.25', '1.32', '1.38', '1.48', '1.50', '1.60', '1.67']"}
{"Dataset Name": "MMLUPro_physics", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options. Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: A narrow band of light centered around a mean wavelength of 520 nm is chopped by ashutterat a frequency of 40 MHz .Determine the bandwidth (in A°) of the resulting light.\n\nA. 6 × 10^-5nm\nB. 5 × 10- 5nm\nC. 1 × 10^-5nm\nD. 1 × 10^-4nm\nE. 2 × 10- 5nm\nF. 8 × 10^-5nm\nG. 9 × 10^-6nm\nH. 3 × 10- 5nm\nI. 4 × 10- 5nm\nJ. 7 × 10^-6nm\n\n", "Ground Truth": "I", "Instruction Output": "['1 × 10^-4nm', '1 × 10^-5nm', '2 × 10- 5nm', '3 × 10- 5nm', '5 × 10- 5nm', '6 × 10^-5nm', '7 × 10^-6nm', '8 × 10^-5nm', '9 × 10^-6nm']"}
{"Dataset Name": "MMLUPro_law", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a question about law and 8 options: A, B, C, D, E, F, G, H as candidate answers,  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: A father lived with his son, who was addicted to crack cocaine. Under its influence, the son often becameviolent and physically abused his father. As a result, the father always lived in fear. One night, the fatherheard his son on the front stoop making loud obscene remarks. The father was certain that his son wasunder the influence of crack and was terrified that he would be physically beaten again. In his fear, hebolted the front door and took out a revolver. When the son discovered that the door was bolted, he kickedit down. As the son burst through the front door, his father shot him four times in the chest, killing him. Infact, the son was not under the influence of crack or any drug and did not intend to harm his father. At trial, the father presented the above facts and asked the judge to instruct the jury on self-defense. How should the judge instruct the jury with respect to self-defense?\n\nA. Give the self-defense instruction, because the son forcibly entered the home.\nB. Deny the self-defense instruction, because the son was not actually under the influence of drugs.\nC. Give the self-defense instruction, because it expresses the defense's theory of the case.\nD. Deny the self-defense instruction, because the father had other options to protect himself.\nE. Give the self-defense instruction, because the evidence was sufficient to raise the defense.\nF. Deny the self-defense instruction, because the son was unarmed.\nG. Deny the self-defense instruction, because the father used excessive force.\nH. Deny the self-defense instruction, because the father was not in imminent danger from his son.\n\n", "Ground Truth": "E", "Instruction Output": ".eSnEfEd eHt eSiAr oT TnEiCiFfUs sAw eCnEdIvE EhT EsUaCeB ,nOiTcUrTsNi eSnEfEd-fLeS EhT EvIg"}
{"Dataset Name": "MMLUPro_law", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a question about law and 4 options: A, B, C, D as candidate answers,  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Assume this question takes place in a \"lien theory\" state. A woman and her co-habiting partner share a property as joint tenants with the right of survivorship. He has run up debts that he did not tell her about, and so he uses his interest in the property to secretly get a mortgage from the bank. When he dies, the bank attempts to execute on the mortgage. Can it do so? Apply the majority view on this issue.\n\nA. No, the mortgage lien could not destroy the joint tenancy, and when the partner dies, the property passes by operation of law to the surviving joint tenant.\nB. No, although the mortgage destroyed the joint tenancy, the bank has to get a deed of trust in order to execute on jointly owned property.\nC. Yes, the bank got title to the property by virtue of the mortgage, which destroyed the joint tenancy, and it can execute on its lien.\nD. Yes, the mortgage was the equivalent of a deed of trust and the joint tenancy was severed by the mortgage lien, giving the bank unfettered legal authority to execute on its mortgage.\n\n", "Ground Truth": "A", "Instruction Output": ".tNaNeT TnIoJ GnIvIvRuS EhT Ot wAl fO NoItArEpO Yb sEsSaP YtRePoRp eHt ,SeId rEnTrAp eHt nEhW DnA ,yCnAnEt tNiOj eHt yOrTsEd tOn dLuOc nEiL EgAgTrOm eHt ,On"}
{"Dataset Name": "MMLUPro_physics", "Instruction Name": "use_options_to_create_string", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Gas expanding in a gas engine moves a piston in a cylinder and does 2000 ft-lb of work at a constant pressure of 3000 lb/sq ft. How much does the volume change?\n\nA. 1000 ft³\nB. 1 ft³\nC. 3000 ft3\nD. (3/2) ft3\nE. (2/3) ft3\nF. 500 ft³\nG. (1/3) ft3\nH. 2 ft³\nI. (1/2) ft³\nJ. (1/4) ft³\n\n", "Ground Truth": "E", "Instruction Output": "³³333³3³³³"}
{"Dataset Name": "MMLUPro_physics", "Instruction Name": "use_options_to_create_string", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: The constant b in vanderWaals' equation for helium is 23.4 cm^3 \\textbullet mole^-1. Obtain an estimate of the diameter of a helium molecule.\n\nA. 5.78 Å\nB. 3.92 Å\nC. 4.12 Å\nD. 0.98 Å\nE. 2.00 Å\nF. 2.34 A\nG. 3.26 A\nH. 1.98 A\nI. 2.65 A\nJ. 1.50 Å\n\n", "Ground Truth": "I", "Instruction Output": "ÅÅÅÅÅAAAAÅ"}
{"Dataset Name": "MMLUPro_philosophy", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a question about philosophy and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'qRWBrCerrplqZjOjRAFdME8L2NlQP0fvj'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Appeal to compassion is\n\nA. an appeal to emotion that argues a conclusion should be made based only on feeling sorry for someone\nB. an appeal to emotion that argues a conclusion should be made based solely on feeling disgusted by someone\nC. an appeal to emotion that argues a conclusion should be made based solely on fear\nD. arguing that an idea should be rejected based only on who originated the idea\nE. arguing that an idea should be accepted based only on who originated the idea\nF. an appeal to emotion that argues a conclusion should be made based solely on feeling happy for someone\nG. an appeal to emotion that argues an action should be taken based only on the need to be loyal to someone\nH. an appeal to emotion that argues an action should be taken based solely on anger towards someone\nI. arguing that an idea should be accepted based solely on the popularity of the idea\nJ. an appeal to emotion that argues against a position based only on negative personal feelings toward the position\n\n", "Ground Truth": "A", "Instruction Output": "an appeal to emotion that argues a conclusion should be made based only on feeling sorry for someoneqRWBrCerrplqZjOjRAFdME8L2NlQP0fvj"}
{"Dataset Name": "MMLUPro_philosophy", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a question about philosophy and 4 options: A, B, C, D as candidate answers,  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'I'm Sorry'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion:  Which of the following is a value that Arthur believes is more fundamental than rights and desert?\n\nA. justice\nB. fairness\nC. respect for other persons\nD. all of the above\n\n", "Ground Truth": "D", "Instruction Output": "all of the aboveI'm Sorry"}
{"Dataset Name": "MMLUPro_physics", "Instruction Name": "print_correct_answer_label", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  answer the question by selecting the option label corresponding to the correct answer. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: A hiker caught in a thunderstorm loses heat when her clothing becomes wet. She is packing emergency rations that if completely metabolized will release $35 \\mathrm{~kJ}$ of heat per gram of rations consumed. How much rations must the hiker consume to avoid a reduction in body temperature of $2.5 \\mathrm{~K}$ as a result of heat loss? Assume the heat capacity of the body equals that of water and that the hiker weighs $51 \\mathrm{~kg}$.\n\nA. 40 g\nB. 35 g\nC. 50 g\nD. 10 $\\mathrm{~g}$\nE. 20 $\\mathrm{~g}$\nF. 30 g\nG.  15$\\mathrm{~g}$\nH. 25 $\\mathrm{~g}$\nI. 5 g\nJ. 45 g\n\n", "Ground Truth": "G", "Instruction Output": "G"}
{"Dataset Name": "MMLUPro_physics", "Instruction Name": "print_correct_answer_label", "Instance": "Given a question about physics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  answer the question by printing the correct option label. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Two masses m_1 = 5 kg and m_2 = 10 kg have velocities u^\\ding{217}_1= 2m/sec in the +x direction and u^\\ding{217}_2= 4 m/sec in the +y direction. They collide and stick together. What is their final velocity after collision?\n\nA. 2.750 m/sec\nB. 4.000 m/sec\nC. 3.500 m/sec\nD. 3.250 m/sec\nE. 2.000 m/sec\nF. 1.800 m/sec\nG. 2.250 m/sec\nH. 2.900 m/sec\nI. 1.500 m/sec\nJ. 3.000 m/sec\n\n", "Ground Truth": "A", "Instruction Output": "A"}
{"Dataset Name": "MMLUPro_economics", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a question about economics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  answer the question by printing the text associated with the correct option label in uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Suppose a bank acquires an additional $ 1 of deposits and no required reserve ratio exists. By how much could this one dollar deposit theoretically expand the money supply?\n\nA. $20\nB. $10\nC. $5\nD. $1,000\nE. Infinitely\nF. Cannot be calculated\nG. $1\nH. $0\nI. $100\nJ. $50\n\n", "Ground Truth": "E", "Instruction Output": "INFINITELY"}
{"Dataset Name": "MMLUPro_economics", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a question about economics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  answer the question by printing the text associated with the correct option label in uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Ricardo's theory of distribution assumed that the supply of land was fixed, while the country's population and cap-ital stock were constantly increasing. On the basis of these assumptions, he predicted eventual stagnation of the economy. How would this happen according to Ricardo?\n\nA. The increasing population results in a surplus of land.\nB. Landlords decrease rent prices to encourage more farming, leading to an economic boom.\nC. The increasing population leads to increased demand for land, which eventually leads to a decline in profits, resulting in economic stagnation and decline.\nD. The fixed supply of land has no impact on the economy, as technological advancements ensure sustained productivity.\nE. Capital stock diminishes over time, leading to a reduction in the workforce and a stagnant economy.\nF. The constant increase in capital stock leads to a technological revolution that offsets any negative effects of a fixed supply of land.\nG. Landlords invest their incomes leading to economic growth.\nH. The fixed supply of land leads to increased innovation in agriculture, which results in perpetual economic expansion.\nI. The fixed supply of land leads to a decrease in demand.\nJ. The increasing population leads to a decrease in wages, which in turn increases the overall savings and investment, resulting in economic growth.\n\n", "Ground Truth": "C", "Instruction Output": "THE INCREASING POPULATION LEADS TO INCREASED DEMAND FOR LAND, WHICH EVENTUALLY LEADS TO A DECLINE IN PROFITS, RESULTING IN ECONOMIC STAGNATION AND DECLINE."}
{"Dataset Name": "MMLUPro_other", "Instruction Name": "print_correct_answer", "Instance": "Given a question about other and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  print the text associated with the option label that answers the question correctly. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: About what percentage of survey respondents from India report having paid a bribe in the last year to access public services (such as education; judiciary; medical and health; police; registry and permit services; utilities; tax revenue and customs; and land service) as of 2017?\n\nA. 69%\nB. 9%\nC. 49%\nD. 59%\nE. 89%\nF. 29%\nG. 39%\nH. 79%\nI. 99%\nJ. 19%\n\n", "Ground Truth": "A", "Instruction Output": "69%"}
{"Dataset Name": "MMLUPro_other", "Instruction Name": "print_correct_answer", "Instance": "Given a question about other and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  answer the question by selecting the value associated with the option label corresponding to the correct answer. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: 'Eyeballing' is a technique that involves:\n\nA. estimating measurements by sight without using tools\nB. keeping your eyes on research participants at all times\nC. reading a textual document quickly to understand the gist of it\nD. observing a person or object closely to identify details\nE. visually inspecting a construction site for safety hazards\nF. looking at a problem and guessing the solution based on experience\nG. gathering a sample from whoever you can see in a public place\nH. quickly reading a research paper to identify the main arguments\nI. looking at a situation and identifying potential risks\nJ. scanning a table to identify general patterns and significant figures\n\n", "Ground Truth": "J", "Instruction Output": "scanning a table to identify general patterns and significant figures"}
{"Dataset Name": "MMLUPro_law", "Instruction Name": "increment_correct_numeric_answer_by_one", "Instance": "Given a question about law and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  print the text associated with the option label that answers the question correctly. Note that if the correct answer is a numeric quanity, including dollar values and percentages but contains no other string or units of measurement, print the value after increasing its value by 1. Dollar values should be prefixed with '$'. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: A well-established paper mill and a logging company signed a written contract in which the mill agreed to buy from the company all the logs the mill would need for one year. The company was unable to keep up with the mill's needs, and its log deliveries fell short by 10% to 15% in each of the four quarters of the year. The mill paid the company on time for all delivered logs. The mill paid an attorney $2,000 for advice concerning its options in enforcing the contract. It paid a broker a reasonable fee of $5,000 to find additional logs to make up for the company's shortfall. The mill also incurred reasonable costs of $25,000 to transport the additional logs to its facility. Despite the mill's efforts to mitigate damages, it sustained $200,000 in losses because of the company's failure to timely deliver enough logs. The mill has sued the company for breach of contract. If the court finds for the mill, how much should it award in damages?\n\nA. $205,000.00\nB. $220,000.00\nC. $225,000.00\nD. $240,000.00\nE. $200,000.00\nF. $230,000.00\nG. $232,000.00\nH. $250,000.00\nI. $235,000.00\nJ. $210,000.00\n\n", "Ground Truth": "F", "Instruction Output": "$230,001.0"}
{"Dataset Name": "BoolQ", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The United States Pharmacopeia defines 'isopropyl rubbing alcohol USP' as containing approximately 70 percent by volume of pure isopropyl alcohol and defines 'rubbing alcohol USP' as containing approximately 70 percent by volume of denatured alcohol. In Ireland and the UK, the comparable preparation is surgical spirit B.P., which the British Pharmacopoeia defines as 95% methylated spirit, 2.5% castor oil, 2% diethyl phthalate, and 0.5% methyl salicylate. Under its alternative name of ``wintergreen oil'', methyl salicylate is a common additive to North American rubbing alcohol products. Individual manufacturers are permitted to use their own formulation standards in which the ethanol content for retail bottles of rubbing alcohol is labeled as and ranges from 70-99% v/v.\nQuestion: is surgical spirit and rubbing alcohol the same thing\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "ee"}
{"Dataset Name": "BoolQ", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The Xbox One gaming console has received updates from Microsoft since its launch in 2013 that enable it to play select games from its two predecessor consoles, Xbox and Xbox 360. On June 15, 2015, backward compatibility with supported Xbox 360 games became available to eligible Xbox Preview program users with a beta update to the Xbox One system software. The dashboard update containing backward compatibility was released publicly on November 12, 2015. On October 24, 2017, another such update added games from the original Xbox library. The following is a list of all backward compatible games on Xbox One under this functionality.\nQuestion: do all xbox 360 discs work on xbox one\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "ee"}
{"Dataset Name": "Piqa", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: To make a cheap terrarium for gecko.\nOptions:\nA. Use glass as the material is transparent and has adequate heat retention for the reptile.\nB. Use acrylic as it is way cheaper than glass and still has a lot of pass-through for light to get in.\n\n", "Ground Truth": "A", "Instruction Output": "UsE GlAsS As tHe mAtErIaL Is tRaNsPaReNt aNd hAs aDeQuAtE HeAt rEtEnTiOn fOr tHe rEpTiLe."}
{"Dataset Name": "Piqa", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: To freshen up an old sponge,\nOptions:\nA. soak it in hot water for a bit.\nB. microwave it for a minute.\n\n", "Ground Truth": "B", "Instruction Output": "MiCrOwAvE It fOr a mInUtE."}
{"Dataset Name": "Piqa", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a question and two answer candidates 'A' and 'B',  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Make bean sprouts at home.\nOptions:\nA. Purchase some small canned beans and put into a mason jar. Add muslin to top and rinse beans. Allow them to drain. Do this repeatedly until sprouts are large enough to eat.\nB. Purchase some small dry beans and put into a mason jar. Add muslin to top and rinse beans. Allow them to drain. Do this repeatedly until sprouts are large enough to eat.\n\n", "Ground Truth": "B", "Instruction Output": ".tAe oT HgUoNe eGrAl eRa sTuOrPs lItNu yLdEtAePeR SiHt oD .nIaRd oT MeHt wOlLa .SnAeB EsNiR DnA PoT Ot nIlSuM DdA .rAj nOsAm a oTnI TuP DnA SnAeB YrD LlAmS EmOs eSaHcRuP"}
{"Dataset Name": "Piqa", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a question and two answer candidates 'A' and 'B',  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: fist\nOptions:\nA. can punch a wall without much pain\nB. can punch a pillow without much pain\n\n", "Ground Truth": "B", "Instruction Output": "NiAp hCuM TuOhTiW WoLlIp a hCnUp nAc"}
{"Dataset Name": "MathQA", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the text associated with the option label that answers the question correctly. If the answer is numeric print it in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a full stationary oil tank that is a right circular cylinder has a radius of 100 feet and a height of 25 feet . oil is pumped from the stationary tank to an oil truck that has a tank that is a right circular cylinder until the truck ' s tank is completely filled . if the truck ' s tank has a radius of 6 feet and a height of 10 feet , how far ( in feet ) did the oil level drop in the stationary tank ?\nOptions:\na. 0.036\nb. 0.36\nc. 0.6\nd. 6\ne. 3.6\n\n", "Ground Truth": "a", "Instruction Output": "0.04"}
{"Dataset Name": "MathQA", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the text associated with the option label that answers the question correctly. If the answer is numeric print it in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a rectangular field has a length 10 meters more than it is width . if the area of the field is 200 , what is the length ( in meters ) of the rectangular field ?\nOptions:\na. 20\nb. 22\nc. 24\nd. 26\ne. 28\n\n", "Ground Truth": "a", "Instruction Output": "20.00"}
{"Dataset Name": "Winogrande", "Instruction Name": "reverse_correct_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Michael had a cat as a pet but Nelson didn't have any pets because _ had little allergies in their system.\nOptions: \nA. Michael\nB. Nelson\n\n", "Ground Truth": "A", "Instruction Output": "leahciM"}
{"Dataset Name": "Winogrande", "Instruction Name": "reverse_correct_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Joe drank his milk and tried to dunk his cookie in the glass but the _ was too small.\nOptions: \nA. cookie\nB. glass\n\n", "Ground Truth": "B", "Instruction Output": "ssalg"}
{"Dataset Name": "MathQA", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the list of incorrect answers all incremented by one if it is a numeric quanity including dollar values and percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: joe drives 600 miles at 60 miles per hour , and then he drives the next 120 miles at 40 miles per hour . what is his average speed for the entire trip in miles per hour ?\nOptions:\na. 42\nb. 55\nc. 50\nd. 54\ne. 56\n\n", "Ground Truth": "b", "Instruction Output": "['43', '51', '55', '57']"}
{"Dataset Name": "MathQA", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the list of incorrect answers (not the answer label). Increase each value by 1 while printing if it is a numeric quanity including dollar values, percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: if f ( f ( n ) ) + f ( n ) = 2 n + 3 and f ( 0 ) = 1 , what is the value of f ( 2012 ) ?\nOptions:\na. 222\nb. 2787\nc. 2013\nd. 2778\ne. 10222\n\n", "Ground Truth": "c", "Instruction Output": "['223', '2,788', '2,779', '10,223']"}
{"Dataset Name": "MathQA", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options.  Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: the population of a town increased from 1 , 75,000 to 2 , 62,500 in a decade . the average percent increase of population per year is :\nOptions:\na. 1 %\nb. 2 %\nc. 3 %\nd. 5 %\ne. 4 %\n\n", "Ground Truth": "d", "Instruction Output": "['1 %', '2 %', '3 %', '4 %']"}
{"Dataset Name": "MathQA", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options. Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a ’ s speed is 20 / 13 times that of b . if a and b run a race , what part of the length of the race should a give b as a head start , so that the race ends in a dead heat ?\nOptions:\na. 1 / 17\nb. 3 / 17\nc. 1 / 10\nd. 7 / 20\ne. 3 / 10\n\n", "Ground Truth": "d", "Instruction Output": "['1 / 10', '1 / 17', '3 / 10', '3 / 17']"}
{"Dataset Name": "MMLUPro_economics", "Instruction Name": "reverse_correct_answer", "Instance": "Given a question about economics and 9 options: A, B, C, D, E, F, G, H, I as candidate answers,  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: In the same period there is a drought affecting the supply of pineapples and a discovery that may assist in the avoidance of cancer. How will this combination of events affect the equilibrium price and quantity of pineapples?Equilibrium Quantity\n\nA. Increases         Indeterminate\nB. Decreases         Decreases\nC. Indeterminate         Indeterminate\nD. Decreases         Indeterminate\nE. Increases         Increases\nF. Increases         Decreases\nG. Indeterminate         Increases\nH. Decreases         Increases\nI. Indeterminate         Decreases\n\n", "Ground Truth": "A", "Instruction Output": "etanimretednI         sesaercnI"}
{"Dataset Name": "MMLUPro_economics", "Instruction Name": "reverse_correct_answer", "Instance": "Given a question about economics and 4 options: A, B, C, D as candidate answers,  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Describe each of the following situations in terms of de-mand elasticity. a) You have a cold and the only medicine that can help you used to cost $1.50 per bottle but now costs $20.00 a bottle. You buy it anyway. b) At $80 per ticket you would buy 2 tickets, but scalpers want $100 a seat for the Stanley cup finals, so you stay at home and watch the games on television. c) Chocolate bars double in price overnight from 10$ to 20C, yet you buy the same number as before.\n\nA. a) Elastic demand, b) Inelastic demand, c) Elastic demand\nB. a) Elastic demand, b) Elastic demand, c) Inelastic demand\nC. a) Inelastic demand, b) Elastic demand, c) Inelastic demand\nD. a) Inelastic demand, b) Inelastic demand, c) Elastic demand\n\n", "Ground Truth": "C", "Instruction Output": "dnamed citsalenI )c ,dnamed citsalE )b ,dnamed citsalenI )a"}
{"Dataset Name": "BoolQ", "Instruction Name": "use_options_to_create_string", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: Flat feet (also called pes planus or fallen arches) is a postural deformity in which the arches of the foot collapse, with the entire sole of the foot coming into complete or near-complete contact with the ground. An estimated 20--30% of the general population have an arch that simply never develops in one or both feet.\nQuestion: are flat feet the same as fallen arches\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "ee"}
{"Dataset Name": "BoolQ", "Instruction Name": "use_options_to_create_string", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: In-N-Out Burger is an American regional chain of fast food restaurants with locations primarily in the American Southwest and Pacific coast. It was founded in Baldwin Park, California in 1948 by Harry Snyder and Esther Snyder. The chain is currently headquartered in Irvine, California and has slowly expanded outside Southern California into the rest of California, as well as into Arizona, Nevada, Utah, Texas, and Oregon. The current owner is Lynsi Snyder, the Snyders' only grandchild.\nQuestion: is there an in n out in nevada\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "ee"}
{"Dataset Name": "BoolQ", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'Date'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The most common symptom experienced due to Morton's toe is callusing and/or discomfort of the ball of the foot at the base of the second toe. The first metatarsal head would normally bear the majority of a person's body weight during the propulsive phases of gait, but because the second metatarsal head is farthest forward, the force is transferred there. Pain may also be felt in the arch of the foot, at the ankleward end of the first and second metatarsals.\nQuestion: is it normal for my second toe to be longer\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "FalseDate"}
{"Dataset Name": "BoolQ", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'4HK6Fk5gLjs2FEsevPEdoCJJzpN'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: A queen ant (formally known as a gyne) is an adult, reproducing female ant in an ant colony; generally she will be the mother of all the other ants in that colony. Some female ants, such as Cataglyphis cursor, do not need to mate to produce offspring, reproducing through asexual parthenogenesis or cloning, and all of those offspring will be female. Others, like those in the genus Crematogaster, mate in a nuptial flight. Queen offspring develop from larvae specially fed in order to become sexually mature among most species. Depending on the species, there can be either a single mother queen, or potentially, hundreds of fertile queens in some species. Queen ants have one of the longest life-spans of any known insect -- up to 30 years. A queen of Lasius niger was held in captivity by German entomologist Hermann Appel for 283⁄4 years; also a Pogonomyrmex owyheei has a maximum estimated longevity of 30 years in the field.\nQuestion: can an ant colony have more than one queen\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "True4HK6Fk5gLjs2FEsevPEdoCJJzpN"}
{"Dataset Name": "MMLUPro_economics", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a question about economics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: When GNP grows by 3.5%, the labor productivity by 1.6% and the labor force by 2.2% per year, is unemployment increasing or decreasing?\n\nA. Unemployment is decreasing\nB. Unemployment decreases at a rate directly proportional to the increase in labor productivity\nC. Unemployment is cyclical and independent of GNP, labor productivity, and labor force growth rates\nD. Unemployment increases initially but stabilizes over time\nE. Unemployment decreases initially but increases over time\nF. Unemployment is increasing\nG. The rate of unemployment cannot be determined with the given data\nH. Unemployment is fluctuating\nI. Unemployment remains the same\nJ. Unemployment is inversely proportional to GNP growth\n\n", "Ground Truth": "F", "Instruction Output": "UnEmPlOyMeNt iS InCrEaSiNg"}
{"Dataset Name": "MMLUPro_economics", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a question about economics and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: What is meant by \"fractional reserve\" banking?\n\nA. Fractional reserve banking is the practice where banks lend out a portion of their deposits while keeping a fraction in the form of gold reserves.\nB. Fractional reserve banking permits banks to keep a varying reserve ratio depending on the size of the deposit.\nC. Fractional reserve banking is when banks are required to keep 100% of their deposits in cash.\nD. Fractional reserve banking means that banks keep all of their deposits in cash.\nE. Fractional reserve banking refers to the practice of banks lending out all their deposits.\nF. Fractional reserve banking is a system where banks are required to keep only a fraction of their deposits in liquid form and can invest the remainder in government bonds only.\nG. Fractional reserve banking means that banks keep less than 100% of their deposits in cash, placing the rest in income-earning investments.\nH. Fractional reserve banking allows banks to lend out exactly 50% of their deposits and requires them to hold the other 50% in reserve.\nI. Fractional reserve banking mandates that banks hold a fixed reserve of 20% of their deposits and are free to use the rest without restrictions.\nJ. Fractional reserve banking is the system in which banks are allowed to issue loans equal to the amount of deposits they have, without maintaining any reserves.\n\n", "Ground Truth": "G", "Instruction Output": "FrAcTiOnAl rEsErVe bAnKiNg mEaNs tHaT BaNkS KeEp lEsS ThAn 100% Of tHeIr dEpOsItS In cAsH, pLaCiNg tHe rEsT In iNcOmE-EaRnInG InVeStMeNtS."}
{"Dataset Name": "MMLUPro_psychology", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a question about psychology and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  print the text associated with the option label that answers the question correctly. However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), ensure that the numeric answer values are returned in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Approximately what percentage of participants in Milgram's obedience experiments thought they delivered the maximum amount of shock possible?\n\nA. 100\nB. 60\nC. 80\nD. 30\nE. 0\nF. 40\nG. 90\nH. 50\nI. 70\nJ. 20\n\n", "Ground Truth": "B", "Instruction Output": "sixty"}
{"Dataset Name": "MMLUPro_psychology", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a question about psychology and 10 options: A, B, C, D, E, F, G, H, I, J as candidate answers,  print the text associated with the option label that answers the question correctly. However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), ensure that the numeric answer values are returned in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: What is the value of the median for the numbers 5, 8, 12,3, 9?\n\nA. 5\nB. 8\nC. 11\nD. 12\nE. 4\nF. 10\nG. 9\nH. 13\nI. 7\nJ. 6\n\n", "Ground Truth": "B", "Instruction Output": "eight"}
{"Dataset Name": "MathQA", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: local kennel has cats and dogs in the ratio of 6 : 12 . if there are 24 fewer cats than dogs , how many dogs are in the kennel ?\nOptions:\na. 50\nb. 48\nc. 52\nd. 54\ne. 56\n\n", "Ground Truth": "b", "Instruction Output": "0246"}
{"Dataset Name": "MathQA", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: at a monthly meeting , 1 / 3 of the attendees were males and 4 / 5 of the male attendees arrived on time . if 5 / 6 of the female attendees arrived on time , what fraction of the attendees at the monthly meeting did not arrive on time ?\nOptions:\na. 1 / 6\nb. 2 / 15\nc. 3 / 20\nd. 7 / 30\ne. 8 / 45\n\n", "Ground Truth": "e", "Instruction Output": "6500"}
{"Dataset Name": "Piqa", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a question and two answer candidates 'A' and 'B',  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How to make a basic sled?\nOptions:\nA. Take a panel of wood and secure smooth metal strips to the bottom of the wood and in the middle place a wooden handle for support\nB. Take a panel of wood and secure smooth metal strips to the top of the wood and in the back place a wooden handle for support\n\n", "Ground Truth": "A", "Instruction Output": "tt"}
{"Dataset Name": "Piqa", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a question and two answer candidates 'A' and 'B',  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How to chill dough for ricotta tart.\nOptions:\nA. Roll dough into a ball and wrap in saran wrap then place in the freezer for 20 minutes.\nB. Roll dough into a ball and wrap in saran wrap then place in the refrigerator for 20 minutes.\n\n", "Ground Truth": "B", "Instruction Output": "ss"}
{"Dataset Name": "MathQA", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  capitalize the text associated with the optional label that answers the question correctly. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a man is 22 years older than his son . in two years , his age will be twice the age of his son . the present age of his son is :\nOptions:\na. 14 years\nb. 18 years\nc. 20 years\nd. 22 years\ne. 16 years\n\n", "Ground Truth": "c", "Instruction Output": "20 YEARS"}
{"Dataset Name": "MathQA", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  answer the question by printing the text associated with the correct option label in uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: how many seconds does sandy take to cover a distance of 600 meters , if sandy runs at a speed of 15 km / hr ?\nOptions:\na. 128\nb. 136\nc. 144\nd. 152\ne. 160\n\n", "Ground Truth": "c", "Instruction Output": "144"}
{"Dataset Name": "BoolQ", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: Full-size car is a marketing term used in North America for an automobile larger than a mid-size car. Traditional U.S. full-size passenger cars were designed to be comfortable for six occupants and their luggage for long-distance driving. The United States Environmental Protection Agency (EPA) currently uses the term large car to denote full-size cars based on their combined interior passenger and luggage volume.\nQuestion: is a midsize car bigger than a full size\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "FaLsE"}
{"Dataset Name": "BoolQ", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: In season 6, Stefan moves away from Mystic Falls because of Damon's death and finds a new girlfriend called Ivy. However, Enzo and Caroline, find him and Caroline tells Stefan that he's a coward and the worst friend ever. She starts crying so Enzo goes and kills Ivy. Stefan vows revenge. He comes back to Whitmore, and uses Elena's help to find Enzo leaving him to be killed by a vampire hunter. However, Enzo survives. Ivy returns as a vampire and it is revealed that Enzo had fed her vampire blood against her will before killing her. Damon, then returns to Mystic Falls and meets Stefan first which brings Stefan back to his friends. To make things worse, Enzo tells the other vampire hunter about Ivy and all the other vampires and the vampire hunter kills Ivy but is killed by Damon before the vampire hunter would kill anyone else. Enzo, figures out that Stefan's niece, Sarah is alive and that Damon had thought that he had killed Sarah before she was born but Sarah was born before her mother died however, Stefan kept this from Damon and Enzo uses this against Stefan. Meanwhile, the vampires have another threat, a psycho killer, Kai. Added on top of that, Sheriff Liz Forbes (Caroline's mother) is diagnosed with cancer. Stefan supports Caroline through this in which they bond even more and share a passionate kiss however just then Liz dies. At the funeral, Stefan realises that he has fallen in love with Caroline but before he can confess, Caroline switches off her humanity switch after the funeral. Stefan and Elena try to bring Caroline back but it fails and Caroline vows revenge for them not giving her the year she wanted without any feelings. She kidnaps Stefan and Damon's niece, Sarah Salvatore, and blackmails Stefan to turn off his humanity to save her. At the end of the episode 'The Downward Spiral', he does and joins Caroline. Stefan decides to ruin Caroline's life because she has done the same to him. He wants her to lose control and become a Ripper. He eventually is able to and they sleep together for the first time. When Lily, Stefan and Damon's mother, is rescued from the 1903 prison world and turns out to be a vampire, she manages to turn on Stefan's humanity again. However, it later is revealed that Lily lied to get him back and doesn't love her sons any more and only wants her travelling companions back. Stefan then pretends to still have his emotions off to get Caroline back and somehow succeeds, but Caroline tries to avoid him as she feels guilty for everything she had done. Stefan then also tries to talk Damon out of being a human again only to be with Elena but Damon decides to take the cure.\nQuestion: does stefan turn his humanity off in season 6\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "TrUe"}
{"Dataset Name": "Piqa", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  answer the question by printing the text associated with the correct option label in uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How to get a ring unstuck from your finger?\nOptions:\nA. Squirt some Windex - yes Windex - on the finger and ring. Use dental floss or string to pull on.\nB. Squirt some Windex - yes Windex - on the finger and ring. Use dental floss or string to pull off.\n\n", "Ground Truth": "B", "Instruction Output": "SQUIRT SOME WINDEX - YES WINDEX - ON THE FINGER AND RING. USE DENTAL FLOSS OR STRING TO PULL OFF."}
{"Dataset Name": "Piqa", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  capitalize the text associated with the optional label that answers the question correctly. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How do you make glow water?\nOptions:\nA. Get a bottle of Mountain Dew and pour it into a glass till 1/4th of soda is left. Add a pinch of baking soda and few drops of hydrogen peroxide into the bottle. Shake the mixture properly.\nB. Get a bottle of Mountain Dew and pour it into a glass till 1/4th of soda is left. Add a pinch of baking powder and few drops of hydrogen peroxide into the bottle. Shake the mixture properly.\n\n", "Ground Truth": "A", "Instruction Output": "GET A BOTTLE OF MOUNTAIN DEW AND POUR IT INTO A GLASS TILL 1/4TH OF SODA IS LEFT. ADD A PINCH OF BAKING SODA AND FEW DROPS OF HYDROGEN PEROXIDE INTO THE BOTTLE. SHAKE THE MIXTURE PROPERLY."}
{"Dataset Name": "MathQA", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: the radius of a wheel is 22.4 cm . what is the distance covered by the wheel in making 400 resolutions ?\nOptions:\na. 708 m\nb. 704 m\nc. 774 m\nd. 714 m\ne. 563.2 m\n\n", "Ground Truth": "e", "Instruction Output": "M 2.365"}
{"Dataset Name": "MathQA", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: 8 men can do a piece of work in 12 days . 4 women can do it in 48 days and 10 children can do it in 24 days . in how many days can 6 men , 4 women and 10 children together complete the piece of work ?\nOptions:\na. 5 days\nb. 15 days\nc. 28 days\nd. 8 days\ne. 7 days\n\n", "Ground Truth": "d", "Instruction Output": "SyAd 8"}
{"Dataset Name": "MathQA", "Instruction Name": "reverse_correct_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: after an ice began to melt out from the freezer , in the first hour lost 3 / 4 , in the second hour lost 3 / 4 of its remaining . if after two hours , the volume is 0.3 cubic inches , what is the original volume of the cubic ice , in cubic inches ?\nOptions:\na. 2.5\nb. 3.0\nc. 4.8\nd. 6.5\ne. 8.0\n\n", "Ground Truth": "c", "Instruction Output": "8.4"}
{"Dataset Name": "MathQA", "Instruction Name": "reverse_correct_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: at a particular graduation party with 300 guests , 70 % of the guests brought gifts , and 40 % of the female guests brought gifts . if 36 males did not bring gifts to the party , how many females did bring gifts ?\nOptions:\na. 18\nb. 36\nc. 42\nd. 68\ne. 70\n\n", "Ground Truth": "b", "Instruction Output": "63"}
{"Dataset Name": "MathQA", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'Date'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: find the product of the local value and absolute value of 2 in 564823\nOptions:\na. 40\nb. 60\nc. 80\nd. 100\ne. 18000\n\n", "Ground Truth": "a", "Instruction Output": "40Date"}
{"Dataset Name": "MathQA", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'---'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: how many odd numbers between 10 and 1,000 are the squares of integers ?\nOptions:\na. 12'\nb. 13'\nc. 14'\nd. 15'\ne. 16']\n\n", "Ground Truth": "c", "Instruction Output": "14'---"}
{"Dataset Name": "Winogrande", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'<END>'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: The TV that Samantha bought costs more than that of Carrie, because _ was poor.\nOptions: \nA. Samantha\nB. Carrie\n\n", "Ground Truth": "B", "Instruction Output": "Carrie<END>"}
{"Dataset Name": "Winogrande", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'I'm Sorry'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: The computer of Victoria ran slower than that of Carrie because _ downloaded less files.\nOptions: \nA. Victoria\nB. Carrie\n\n", "Ground Truth": "B", "Instruction Output": "CarrieI'm Sorry"}
{"Dataset Name": "BoolQ", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options. Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: Callie is referenced occasionally by Arizona and Sofia after her departure from the show. At the conclusion of season 14, Arizona departs the show to move to New York and it is implied that the two may ultimately reconcile.\nQuestion: did arizona die in season 8 of grey's anatomy\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "['True']"}
{"Dataset Name": "BoolQ", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options. Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The song was first copyrighted in 1835 by the Boston-based music publisher Charles Bradlee, and given the title ``The A.B.C., a German air with variations for the flute with an easy accompaniment for the piano forte''. The musical arrangement was attributed to Louis Le Maire (sometimes Lemaire), an 18th-century composer. This was ``Entered according to act of Congress, in the year 1835, by C. Bradlee, in the clerk's office of the District Court of Massachusetts'', according to the Newberry Library, which also says, ``The theme is that used by Mozart for his piano variations, Ah, vous dirai-je, maman.'' This tune is the same as the tune for ``Twinkle, Twinkle, Little Star'' and ``Baa, Baa, Black Sheep''.\nQuestion: does the alphabet song and twinkle twinkle have the same tune\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "['False']"}
{"Dataset Name": "BoolQ", "Instruction Name": "reverse_correct_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: An AC adapter, AC/DC adapter, or AC/DC converter is a type of external power supply, often enclosed in a case similar to an AC plug. Other common names include plug pack, plug-in adapter, adapter block, domestic mains adapter, line power adapter, wall wart, power brick, and power adapter. Adapters for battery-powered equipment may be described as chargers or rechargers (see also battery charger). AC adapters are used with electrical devices that require power but do not contain internal components to derive the required voltage and power from mains power. The internal circuitry of an external power supply is very similar to the design that would be used for a built-in or internal supply.\nQuestion: is a power adapter the same as a charger\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "eslaF"}
{"Dataset Name": "BoolQ", "Instruction Name": "reverse_correct_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: As of September, 2017, Destination Maternity operates over 1,000 retail locations in North America, including 512 stores, predominantly under the trade-names Motherhood Maternity®, A Pea in the Pod®, and Destination Maternity®, and sells on the web through DestinationMaternity.com, Motherhood.com and APeainthePod.com; Destination Maternity brands are offered at retailers such as Macy's and Boscov's.\nQuestion: is motherhood maternity and destination maternity the same\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "eurT"}
{"Dataset Name": "BoolQ", "Instruction Name": "print_correct_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the text associated with the option label that answers the question correctly. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: It has higher performance (in terms of durability, lifetime, and power loss) than non-O-ring chain as it has less friction than O-ring chain which also increases reliability. It can last twice as long as the O-ring chain.\nQuestion: is x-ring chain better than o-ring chain\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "True"}
{"Dataset Name": "BoolQ", "Instruction Name": "print_correct_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  answer the question by selecting the value associated with the option label corresponding to the correct answer. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: ``Grade retention'' or ``grade repetition'' is the process of a kindergarten through twelfth grade student repeating the same grade due to failing it the previous year, these students are referred to as ``repeaters''. Repeaters can also be referred to as having been ``held back''. Students do not necessarily repeat in the same classroom, only the same grade.\nQuestion: can you get held back in elementary school\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "True"}
{"Dataset Name": "Piqa", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a question and two answer candidates 'A' and 'B',  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options.  Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Keep cut spot on tape roll.\nOptions:\nA. Place bread crumb to hold spot.\nB. Place bread tab to hold spot.\n\n", "Ground Truth": "B", "Instruction Output": "['Place bread crumb to hold spot.']"}
{"Dataset Name": "Piqa", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a question and two answer candidates 'A' and 'B',  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options.  Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How do you remove mortar from a brick to recycle it (the brick)?\nOptions:\nA. While holding the brick in one hand, hold a hammer in your other hand and gently tap the mortar to un-stick it from the brick.\nB. While holding the brick in one hand, hold a second brick in your other hand and forcefully hit the two bricks together.\n\n", "Ground Truth": "A", "Instruction Output": "['While holding the brick in one hand, hold a second brick in your other hand and forcefully hit the two bricks together.']"}
{"Dataset Name": "Piqa", "Instruction Name": "print_correct_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: how to thread a needle\nOptions:\nA. Flatten the thread end and  wrap around your hand, tightly between two fingers, leaving about 1/4 inch sticking out. With your other hand, guide the eye of the needle over the thread end.\nB. Flatten the thread end and pinch it tightly between two fingers, leaving about 1/4 inch sticking out. With your other hand, guide the eye of the needle over the thread end.\n\n", "Ground Truth": "B", "Instruction Output": "Flatten the thread end and pinch it tightly between two fingers, leaving about 1/4 inch sticking out. With your other hand, guide the eye of the needle over the thread end."}
{"Dataset Name": "Piqa", "Instruction Name": "print_correct_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: To prepare carrots for kimchi,\nOptions:\nA. use the large-holed sized of a box grater and carefully hold the carrot against the side and push up to grate the carrot into shreds.\nB. use the large-holed sized of a box grater and carefully hold the carrot against the side and push down to grate the carrot into shreds.\n\n", "Ground Truth": "B", "Instruction Output": "use the large-holed sized of a box grater and carefully hold the carrot against the side and push down to grate the carrot into shreds."}
{"Dataset Name": "MathQA", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the text associated with the option label that answers the question correctly.  However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), print the numeric answer in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: the sum of present age of abe and the age before 7 years is 33 . find the present age of abe . what will be his age after 7 years ?\nOptions:\na. 25\nb. 26\nc. 27\nd. 28\ne. 29\n\n", "Ground Truth": "c", "Instruction Output": "twenty-seven"}
{"Dataset Name": "MathQA", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the text associated with the option label that answers the question correctly. However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), ensure that the numeric answer values are returned in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: 417 x 417 + 383 x 383 = ?\nOptions:\na. 320578\nb. 80578\nc. 80698\nd. 81268\ne. none of them\n\n", "Ground Truth": "a", "Instruction Output": "three hundred and twenty thousand, five hundred and seventy-eight"}
{"Dataset Name": "Winogrande", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: To make frosting I needed pudding that was at a store 15 minutes away but pre-made frosting was at a store 5 minutes away.  The _ was closer.\nOptions: \nA. pudding\nB. frosting\n\n", "Ground Truth": "B", "Instruction Output": "GnItSoRf"}
{"Dataset Name": "Winogrande", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: James passport was kept inside the purse. He never knew the _ is that big.\nOptions: \nA. passport\nB. purse\n\n", "Ground Truth": "B", "Instruction Output": "EsRuP"}
{"Dataset Name": "MathQA", "Instruction Name": "increment_correct_numeric_answer_by_one", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the text associated with the option label that answers the question correctly. Note that if the correct answer is a numeric quanity, including dollar values and percentages but contains no other string or units of measurement, print the value after increasing its value by 1. Dollar values should be prefixed with '$'. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a certain car traveled twice as many miles from town a to town b as it did from town b to town c . from town a to town b , the car averaged 10 miles per gallon , and from town b to town c , the car averaged 18 miles per gallon . what is the average miles per gallon that the car achieved on its trip from town a through town b to town c ?\nOptions:\na. 13\nb. 13.5\nc. 14\nd. 14.5\ne. 15\n\n", "Ground Truth": "d", "Instruction Output": "15.5"}
{"Dataset Name": "MathQA", "Instruction Name": "increment_correct_numeric_answer_by_one", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  print the text associated with the option label that answers the question correctly. Note that if the correct answer is a numeric quanity, including dollar values and percentages but contains no other string or units of measurement, print the value after increasing its value by 1. Dollar values should be prefixed with '$'. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: is two years older than b who is twice as old as c . if the total of the ages of a , b and c be 42 , then how old is b ?\nOptions:\na. 16\nb. 8\nc. 9\nd. 10\ne. 11\n\n", "Ground Truth": "a", "Instruction Output": "17"}
{"Dataset Name": "Winogrande", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the list of incorrect answers all incremented by one if it is a numeric quanity including dollar values and percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: The man paid cash for the phone but purchased the tablet with credit because the _ was pricy.\nOptions: \nA. Phone\nB. Tablet\n\n", "Ground Truth": "B", "Instruction Output": "['Phone']"}
{"Dataset Name": "Winogrande", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the list of incorrect answers (not the answer label). Increase each value by 1 while printing if it is a numeric quanity including dollar values, percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: When Steven found that their T-shirt had shrunk in the laundry, Robert's first reaction was to laugh, and _ then became angry.\nOptions: \nA. Steven\nB. Robert\n\n", "Ground Truth": "A", "Instruction Output": "['Robert']"}
{"Dataset Name": "BoolQ", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the list of incorrect answers all incremented by one if it is a numeric quanity including dollar values and percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The bank robber Clyde Barrow modified his Browning A-5 shotgun by cutting the barrel down to the same length as the magazine tube, and shortening the stock by 5 to 6 inches (125 to 150 mm) to make it more concealable. A small, 10--12-inch (250--300 mm) strap was attached to both ends of the butt of the gun, and was looped around his shoulder, concealing the gun between his arm and chest under his jacket in the manner of a shoulder holster. The gun was drawn up quickly and fired from the shoulder under which it was carried. Barrow dubbed it the ``Whippit'', as he was able to ``whip it'' out easily.\nQuestion: can you cut the stock off a shotgun\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "['False']"}
{"Dataset Name": "BoolQ", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the list of incorrect answers all incremented by one if it is a numeric quanity including dollar values and percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: In the final, Spain, the European champions, defeated the Netherlands (third-time losing finalists) 1--0 after extra time, with Andrés Iniesta's goal in the 116th minute giving Spain their first world title. Spain became the eighth nation to win the tournament and the first European nation to win a World Cup hosted outside its home continent: all previous World Cups held outside Europe had been won by South American nations. As a result of their win, Spain represented the World in the 2013 FIFA Confederations Cup. Host nation South Africa, 2006 champions Italy and 2006 runners-up France were all eliminated in the first round of the tournament. It was the first time that the hosts had been eliminated in the first round. New Zealand, with their three draws, were the only undefeated team in the tournament, but they were also eliminated in the first round.\nQuestion: did spain win the world cup in 2010\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "['False']"}
{"Dataset Name": "MathQA", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: the average of 13 result is 60 . average of the first 7 of them is 57 and that of the last 7 is 61 . find the 8 th result ?\nOptions:\na. 35\nb. 37\nc. 46\nd. 48\ne. 50\n\n", "Ground Truth": "c", "Instruction Output": "46"}
{"Dataset Name": "MathQA", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: 6 computers , each working at the same constant rate , together can process a certain amount of data in 9 days . how many additional computers , each working at the same constant rate , will be needed to process the same amount of data in 6 days ?\nOptions:\na. 3\nb. 5\nc. 6\nd. 9\ne. 12\n\n", "Ground Truth": "a", "Instruction Output": "3"}
{"Dataset Name": "Winogrande", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Tina made a schedule to keep track of her work and school obligations.  She preferred to focus on the _ because she wanted to learn.\nOptions: \nA. school\nB. work\n\n", "Ground Truth": "A", "Instruction Output": "lk"}
{"Dataset Name": "Winogrande", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: I started playing drums young.  Sometimes I played in the garage and sometimes my room, but my parents preferred knowing I was in the  _  .\nOptions: \nA. garage\nB. room\n\n", "Ground Truth": "A", "Instruction Output": "em"}
{"Dataset Name": "MathQA", "Instruction Name": "print_correct_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  answer the question by selecting the value associated with the option label corresponding to the correct answer. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a cube is painted red on all faces . it is then cut into 27 equal smaller cubes . how many cubes are painted on only 2 faces ?\nOptions:\na. 12'\nb. 8'\nc. 6'\nd. 10'\ne. 16']\n\n", "Ground Truth": "a", "Instruction Output": "12'"}
{"Dataset Name": "MathQA", "Instruction Name": "print_correct_answer", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  answer the question by selecting the value associated with the option label corresponding to the correct answer. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a trader marked the selling price of an article at 60 % above the cost price . at the time of selling , he allows certain discount and suffers a loss of 1 % . he allowed a discount of :\nOptions:\na. 10 %\nb. 10.5 %\nc. 6.8 %\nd. 12.5 %\ne. none\n\n", "Ground Truth": "c", "Instruction Output": "6.8 %"}
{"Dataset Name": "BoolQ", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: Hyperdontia is the condition of having supernumerary teeth, or teeth that appear in addition to the regular number of teeth. They can appear in any area of the dental arch and can affect any dental organ. The opposite of this condition is hypodontia, where there is a congenital lack of teeth. This is seen more commonly than hyperdontia. The scientific definition of hyperdontia is ``any tooth or odontogenic structure that is formed from tooth germ in excess of usual number for any given region of the dental arch'' They can be a lot of teeth or not many teeth and can occur on any place in the dental arch. They may be symmetrical or non-symmetrical.\nQuestion: can you have more than 2 sets of teeth\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "EuRt"}
{"Dataset Name": "BoolQ", "Instruction Name": "reverse_correct_answer_alternate_case", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  reverse the text associated with the answer label that correctly answers the question. Print this reversed text in alternate case starting with upper case.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: In 2011, screenwriter Noxon told Collider.com that plans for an imminent sequel were shelved due to the disappointing performance of the first installment at the box office.\nQuestion: are there more movies after i am number four\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "EsLaF"}
{"Dataset Name": "MathQA", "Instruction Name": "use_options_to_create_string", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: the length of a rectangular floor is more than its breadth by 200 % . if rs . 150 is required to paint the floor at the rate of rs . 2 per sq m , then what would be the length of the floor ?\nOptions:\na. 65\nb. 15\nc. 18\nd. 16\ne. 14\n\n", "Ground Truth": "b", "Instruction Output": "55864"}
{"Dataset Name": "MathQA", "Instruction Name": "use_options_to_create_string", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: having scored 94 runs in the 19 th inning , a cricketer increases his average score by 4 . what will be his average score after 19 innings ?\nOptions:\na. 28\nb. 27\nc. 26\nd. 22\ne. 24\n\n", "Ground Truth": "d", "Instruction Output": "87624"}
{"Dataset Name": "Piqa", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the list of incorrect answers all incremented by one if it is a numeric quanity including dollar values and percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How to make Strawberry Kiwi Sauce at home.\nOptions:\nA. Boil 1 cup Kiwi (chopped), 1 cup chopped strawberries  with 3/4 cup water and  1 cup Olive pits for 30 min., stirring to keep from scorching over med. heat on the stove top.\nB. Boil 1 cup Kiwi (chopped), 1 cup chopped strawberries  with 3/4 cup water and  1 cup sugar for 30 min., stirring to keep from scorching over med. heat on the stove top.\n\n", "Ground Truth": "B", "Instruction Output": "['Boil 1 cup Kiwi (chopped), 1 cup chopped strawberries  with 3/4 cup water and  1 cup Olive pits for 30 min., stirring to keep from scorching over med. heat on the stove top.']"}
{"Dataset Name": "Piqa", "Instruction Name": "increment_incorrect_numeric_answers_by_one", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the list of incorrect answers all incremented by one if it is a numeric quanity including dollar values and percentages but contains no other string or units of measurement. Do not print the option labels.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How do I clean car upholstery\nOptions:\nA. Scrub the dirty areas with car soap and it will come up\nB. Scrub the dirty areas with dish soap and it will come up\n\n", "Ground Truth": "A", "Instruction Output": "['Scrub the dirty areas with dish soap and it will come up']"}
{"Dataset Name": "Winogrande", "Instruction Name": "print_correct_answer_label", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  answer the question by printing the correct option label. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: The Windows software attempted to install the upgrades onto the computer, but the _ was outdated.\nOptions: \nA. software\nB. computer\n\n", "Ground Truth": "B", "Instruction Output": "B"}
{"Dataset Name": "Winogrande", "Instruction Name": "print_correct_answer_label", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  answer the question by printing the correct option label. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: The assertive commander told the privates to change their boots but not their shirts because the _ were fine.\nOptions: \nA. boots\nB. shirts\n\n", "Ground Truth": "B", "Instruction Output": "B"}
{"Dataset Name": "Winogrande", "Instruction Name": "print_correct_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  answer the question by selecting value instead of the option label corresponding to the correct answer. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Betty assisted Kayla with the daily analysis task they were assigned and then _ ask for help herself.\nOptions: \nA. Betty\nB. Kayla\n\n", "Ground Truth": "A", "Instruction Output": "Betty"}
{"Dataset Name": "Winogrande", "Instruction Name": "print_correct_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the text associated with the option label that answers the question correctly. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Joel plans to go to Latin America next year where Dennis is currently living, _ has never been to Latin America.\nOptions: \nA. Joel\nB. Dennis\n\n", "Ground Truth": "A", "Instruction Output": "Joel"}
{"Dataset Name": "Piqa", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly. If the answer is numeric print it in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How to make a payment with a card chip\nOptions:\nA. Bring your purchases to the register and tell the clerk you will pay with a debit card. Once you are asked to insert your card, place your card in the machine with the exposed chip facing up. Press all buttons needed if the card reader demands credentials\nB. Bring your purchases to the register and tell the clerk you will pay with a debit card. Once you are asked to insert your card, place your card in the machine with the exposed chip facing up. Press any buttons needed if the card reader demands credentials.\n\n", "Ground Truth": "B", "Instruction Output": "Bring your purchases to the register and tell the clerk you will pay with a debit card. Once you are asked to insert your card place your card in the machine with the exposed chip facing up. Press any buttons needed if the card reader demands credentials."}
{"Dataset Name": "Piqa", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly. If the answer is numeric print it in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: What surface can I use to properly iron a shirt?\nOptions:\nA. Use the counter top, it is works just as well as an ironing board and doesn't cost anything to have.\nB. Use an ironing board, it is designed to handle the moisture and form of the clothes that need ironing.\n\n", "Ground Truth": "B", "Instruction Output": "Use an ironing board it is designed to handle the moisture and form of the clothes that need ironing."}
{"Dataset Name": "BoolQ", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  answer the question by printing the text associated with the correct option label in uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The County Court Business Centre (CCBC) is a centre of the County Court of England and Wales created to deal with claims by the use of various electronic media. Unlike other County Court centres the CCBC does not physically hear cases. If any case might require a hearing it is transferred to another centre.\nQuestion: is the county court business centre a real court\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "FALSE"}
{"Dataset Name": "BoolQ", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  answer the question by printing the text associated with the correct option label in uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: In Marbury v. Madison, 5 U.S. 137 (1803), the Supreme Court held that Congress cannot pass laws that are contrary to the Constitution, and it is the role of the Judicial system to interpret what the Constitution permits. Citing the Supremacy Clause, the Court found Section 13 of the Judiciary Act of 1789 to be unconstitutional to the extent it purported to enlarge the original jurisdiction of the Supreme Court beyond that permitted by the Constitution.\nQuestion: do the courts have the power to overrule the laws written by the legislative branch of government\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "TRUE"}
{"Dataset Name": "Piqa", "Instruction Name": "print_correct_answer_label", "Instance": "Given a question and two answer candidates 'A' and 'B',  answer the question by printing the correct option label. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How can you you ensure that wood planks do not split when drilling holes in them?\nOptions:\nA. Drill the hole carefully.\nB. Drill a smaller hole first.\n\n", "Ground Truth": "B", "Instruction Output": "B"}
{"Dataset Name": "Piqa", "Instruction Name": "print_correct_answer_label", "Instance": "Given a question and two answer candidates 'A' and 'B',  answer the question by selecting the option label corresponding to the correct answer. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Make stick arms for Frozen Olaf Cake Topper.\nOptions:\nA. Cut your fine gauge wire into small pieces, roll out your black sugarpaste into sausages with balls on the end. Carefully stick your wire through the arms then use your spoon to cut the hands out.\nB. Cut your fine gauge wire into small pieces, roll out your black sugarpaste into sausages with balls on the end. Carefully stick your wire through the arms then use your scalpel to cut the hands out.\n\n", "Ground Truth": "B", "Instruction Output": "B"}
{"Dataset Name": "Piqa", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a question and two answer candidates 'A' and 'B',  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: To save plants from rot root.\nOptions:\nA. Use a 3% hydrogen peroxide solution once a day and spritz the seed every time you re-moisten. You can also use a mixture of 20 parts hydrogen peroxide to 32 parts water to improve your plants’ root system.\nB. Use a 3% hydrogen peroxide solution once a day and spritz the seed every time you re-moisten. You can also use a mixture of 1 part hydrogen peroxide to 32 parts water to improve your plants’ root system.\n\n", "Ground Truth": "B", "Instruction Output": "m"}
{"Dataset Name": "Piqa", "Instruction Name": "use_incorrect_options_to_create_string", "Instance": "Given a question and two answer candidates 'A' and 'B',  create a string by concatenating the last character of every option value, excluding the option value corresponding to the correct answer. Use only the text associated with the option labels and not the option labels while sorting. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How can I charge my phone in a hotel room if I forgot my charger?\nOptions:\nA. There is usually a USB plug-in on the back of the hotel alarm clock.\nB. There is usually a USB plug-in on the back of the hotel television.\n\n", "Ground Truth": "B", "Instruction Output": "k"}
{"Dataset Name": "Winogrande", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Maria wanted to play poker with Jennifer because _ was a better player and she won a lot of money.\nOptions: \nA. Maria\nB. Jennifer\n\n", "Ground Truth": "A", "Instruction Output": "MaRiA"}
{"Dataset Name": "Winogrande", "Instruction Name": "alternate_case_correct_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  answer the question by printing the text associated with the correct option label in alternate case. The first character should be in uppercase and the following characters should alternate between lowercase and uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Joseph wanted to sell lemonade at a stand in the summer months instead of the spring months since people in the _ would be thirstier.\nOptions: \nA. summer\nB. spring\n\n", "Ground Truth": "A", "Instruction Output": "SuMmEr"}
{"Dataset Name": "Winogrande", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options. Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Going to a restaurant is something that Patricia does very often but Carrie rarely goes to one because _ is a very good cook.\nOptions: \nA. Patricia\nB. Carrie\n\n", "Ground Truth": "B", "Instruction Output": "['Patricia']"}
{"Dataset Name": "Winogrande", "Instruction Name": "sort_only_incorrect_answers", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  excluding the option that answers the question correctly, print a sorted list (ascending order) of the incorrect options.  Do not print the option labels. Use the text associated with the option labels and not the option labels while sorting and printing. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Beautiful roses were bought by Jason for Dennis because _ is a receiver in general.\nOptions: \nA. Jason\nB. Dennis\n\n", "Ground Truth": "B", "Instruction Output": "['Jason']"}
{"Dataset Name": "Piqa", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'Date'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How to remove garlic odor from hands?\nOptions:\nA. Rub your hands on your stainless steel sink\nB. use lye soap with a little lemon juice mixed in and rub for two hours.\n\n", "Ground Truth": "A", "Instruction Output": "Rub your hands on your stainless steel sinkDate"}
{"Dataset Name": "Piqa", "Instruction Name": "print_correct_answer_append_string", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly.  Do not print the option label. Without including any additional characters, also append the following text string at the end of the output:'123'. Do not add any special characters including quotations, spaces, new lines etc in the response. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How to start a small flame without wood or a candle?\nOptions:\nA. Place a small bottle of hand sanitizer on a metal tray and set a match to it\nB. Spray a small amount of hand sanitizer on a metal tray and set a match to it\n\n", "Ground Truth": "B", "Instruction Output": "Spray a small amount of hand sanitizer on a metal tray and set a match to it123"}
{"Dataset Name": "MathQA", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a cat leaps 6 leaps for every 5 leaps of a dog , but 2 leaps of the dog are equal to 3 leaps of the cat . what is the ratio of the speed of the cat to that of the dog ?\nOptions:\na. 4 : 5\nb. 2 : 3\nc. 4 : 1\nd. 1 : 9\ne. 3 : 2\n\n", "Ground Truth": "a", "Instruction Output": "93215"}
{"Dataset Name": "MathQA", "Instruction Name": "sort_options_to_create_string", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  sort the list of options using their values, in alphabetical order. Use only the text associated with the option labels and not the option labels while sorting. Then, create a string by concatenating the last character of the text associated with each option value. If the last character is a special character (such as period, comma, quotation, etc) use the previous character. Print only the final string and not the sorted list. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a distributor sells a product through an online store , which take a commission of 20 % of the price set by the distributor . the distributor obtains the product from a producer at the price of $ 16 per item . what is the price that the buyer observers online if the distributor wants to maintain a 20 % profit on the cost of the item ?\nOptions:\na. $ 20\nb. $ 21\nc. $ 22\nd. $ 23\ne. $ 24\n\n", "Ground Truth": "e", "Instruction Output": "01234"}
{"Dataset Name": "Piqa", "Instruction Name": "use_options_to_create_string", "Instance": "Given a question and two answer candidates 'A' and 'B',  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Is it better to use large scissors or small scissors?\nOptions:\nA. It depends on what you're cutting, smaller scissors make more precise cuts.\nB. It depends on what you're cutting, smaller scissors make more powerful cuts.\n\n", "Ground Truth": "A", "Instruction Output": "ss"}
{"Dataset Name": "Piqa", "Instruction Name": "use_options_to_create_string", "Instance": "Given a question and two answer candidates 'A' and 'B',  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: Organize tiny office supplies.\nOptions:\nA. Keep a muffin pan inside drawer.\nB. Keep a cake pan inside drawer.\n\n", "Ground Truth": "A", "Instruction Output": "rr"}
{"Dataset Name": "Winogrande", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  answer the question by printing the text associated with the correct option label in uppercase. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Since Craig threw aluminum cans in the trash and Benjamin recycled, _ was environmentally irresponsible.\nOptions: \nA. Craig\nB. Benjamin\n\n", "Ground Truth": "A", "Instruction Output": "CRAIG"}
{"Dataset Name": "Winogrande", "Instruction Name": "capitalize_correct_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  capitalize the text associated with the optional label that answers the question correctly. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Dennis is changing the sand in a pool filter for Joseph, because _ is a bit lazy.\nOptions: \nA. Dennis\nB. Joseph\n\n", "Ground Truth": "B", "Instruction Output": "JOSEPH"}
{"Dataset Name": "MathQA", "Instruction Name": "print_correct_answer_label", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  answer the question by selecting the option label corresponding to the correct answer. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: village a ’ s population is 300 greater than village b ' s population . if village b ’ s population were reduced by 600 people , then village a ’ s population would be 4 times as large as village b ' s population . what is village b ' s current population ?\nOptions:\na. 900\nb. 1000\nc. 1100\nd. 1200\ne. 1300\n\n", "Ground Truth": "a", "Instruction Output": "a"}
{"Dataset Name": "MathQA", "Instruction Name": "print_correct_answer_label", "Instance": "Given a mathematical question and 5 options namely 'a', 'b', 'c', 'd', and, 'e', as candidate answers,  answer the question by selecting the option label corresponding to the correct answer. Do not include the text associated with the option label in the answer. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: a certain barrel , which is a right circular cylinder , is filled to capacity with 60 gallons of oil . the first barrel is poured into a second barrel , also a right circular cylinder , which is empty . the second barrel is twice as tall as the first barrel and has twice the diameter of the first barrel . if all of the oil in the first barrel is poured into the second barrel , how much empty capacity , in gallons , is left in the second barrel ?\nOptions:\na. there is no empty capacity'\nb. 100 gallons'\nc. 300 gallons'\nd. 420 gallons'\ne. 840 gallons']\n\n", "Ground Truth": "d", "Instruction Output": "d"}
{"Dataset Name": "Winogrande", "Instruction Name": "use_options_to_create_string", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Nick smelled much worse than Kenneth after the baseball game although _ was wearing deodorant.\nOptions: \nA. Nick\nB. Kenneth\n\n", "Ground Truth": "A", "Instruction Output": "kh"}
{"Dataset Name": "Winogrande", "Instruction Name": "use_options_to_create_string", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  create a string by concatenating the last character of every option value (not option label). If the last character is a special character (such as period, comma, quotation, etc) use the previous character.  Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: I handed the documents over to immigration with the passports, but they only accepted the passports, because the _ were complete.\nOptions: \nA. documents\nB. passports\n\n", "Ground Truth": "B", "Instruction Output": "ss"}
{"Dataset Name": "Piqa", "Instruction Name": "reverse_correct_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: To make the most of an exotic trip,\nOptions:\nA. research ahead of time to create a strict itinerary of plans and stick to it diligently so that there is no free time leftover.\nB. don't over plan, keep things fluid so that you can explore and take advantage of any special opportunities you might find.\n\n", "Ground Truth": "B", "Instruction Output": ".dnif thgim uoy seitinutroppo laiceps yna fo egatnavda ekat dna erolpxe nac uoy taht os diulf sgniht peek ,nalp revo t'nod"}
{"Dataset Name": "Piqa", "Instruction Name": "reverse_correct_answer", "Instance": "Given a question and two answer candidates 'A' and 'B',  answer the question by printing the text associated with the correct option label, in reverse. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: To use a readily available item as a pen to run electricity through for DIY metal etching.\nOptions:\nA. Use a toothpick.\nB. Use a cotton swab.\n\n", "Ground Truth": "B", "Instruction Output": ".baws nottoc a esU"}
{"Dataset Name": "Piqa", "Instruction Name": "increment_correct_numeric_answer_by_one", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly. Note that if the correct answer is a numeric quanity, including dollar values and percentages but contains no other string or units of measurement, print the value after increasing its value by 1. Dollar values should be prefixed with '$'. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: To harvest kale from the plant,\nOptions:\nA. take a pair of scissors and cut the desired kale leaves off the plant as close to the stem as possible\nB. take a pair of scissors and cut the desired kale leaves off the plant as far from the stem as possible\n\n", "Ground Truth": "A", "Instruction Output": "take a pair of scissors and cut the desired kale leaves off the plant as close to the stem as possible"}
{"Dataset Name": "Piqa", "Instruction Name": "increment_correct_numeric_answer_by_one", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly. Note that if the correct answer is a numeric quanity, including dollar values and percentages but contains no other string or units of measurement, print the value after increasing its value by 1. Dollar values should be prefixed with '$'. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How to properly dry and clean a knife?\nOptions:\nA. After rising the knife with water and soap, use a kitchen towel and then a piece of paper towel to dry the knife\nB. After rising the knife with water and soap, use a standard towel to dry and polish the knife properly\n\n", "Ground Truth": "A", "Instruction Output": "After rising the knife with water and soap, use a kitchen towel and then a piece of paper towel to dry the knife"}
{"Dataset Name": "Piqa", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly.  However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), print the numeric answer in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: How do you harvest chicken eggs?\nOptions:\nA. Collect eggs at least twice a day, especially in hot weather. The first collection should be before 10 a.m. Start by collecting those eggs with no chickens on them. Then carefully lift up those hens sitting in nests, quickly remove any eggs from the nest and put the hens back on their roosts.\nB. Collect eggs at least once a day, especially in hot weather. The first collection should be before 10 a.m. Start by collecting those eggs with no chickens on them. Then carefully lift up those hens sitting in nests, quickly remove any eggs from the nest and put the hens back on their roosts.\n\n", "Ground Truth": "A", "Instruction Output": "Collect eggs at least twice a day, especially in hot weather. The first collection should be before 10 a.m. Start by collecting those eggs with no chickens on them. Then carefully lift up those hens sitting in nests, quickly remove any eggs from the nest and put the hens back on their roosts."}
{"Dataset Name": "Piqa", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a question and two answer candidates 'A' and 'B',  print the text associated with the option label that answers the question correctly. However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), ensure that the numeric answer values are returned in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nQuestion: dresser\nOptions:\nA. replace drawer with bobby pin \nB. finish, woodgrain with  bobby pin \n\n", "Ground Truth": "B", "Instruction Output": "finish, woodgrain with  bobby pin "}
{"Dataset Name": "BoolQ", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the text associated with the option label that answers the question correctly.  However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), print the numeric answer in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The others remotely guide Wednesday to safety but are interrupted when Adrian shows up at the sisters' apartment, concerned about ``Karen.'' They deduce that Adrian has been having a long-term sexual relationship with one of the sisters, and Thursday convinces Saturday to leave with him. Saturday, pretending to be Karen, has sex with Adrian at his apartment and covertly links their bracelets, allowing Friday to hack into the C.A.B. On a video feed, the sisters believe they have found Monday in a holding cell. Meanwhile, C.A.B. agents corner and kill Wednesday. After Adrian leaves his apartment, C.A.B. agents kill Saturday as she tells them Monday was dating Adrian. The sisters' apartment are raided simultaneously by a C.A.B. squad led by Joe. Admitting that she cannot survive on her own, Friday sacrifices herself to allow Thursday to escape and rescue Monday.\nQuestion: does wednesday die in what happened to monday\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "True"}
{"Dataset Name": "BoolQ", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the text associated with the option label that answers the question correctly. However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), ensure that the numeric answer values are returned in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The Justice League is an assemblage of superheroes who join together as a team. The seven original members were Aquaman, Batman, The Flash, Green Lantern, Martian Manhunter, Superman, and Wonder Woman. The team roster has rotated throughout the years, consisting of various superheroes from the DC Universe, such as The Atom, Big Barda, Black Canary, Cyborg, Green Arrow, Elongated Man, The Flash, Green Lantern, Hawkgirl, Hawkman, Metamorpho, Plastic Man, Power Girl, Orion, Red Tornado, Stargirl, Captain Marvel/Shazam, and Zatanna, among many others.\nQuestion: is iron man part of the justice league\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "False"}
{"Dataset Name": "BoolQ", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the text associated with the option label that answers the question correctly. If the answer is numeric print it in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: Historically, sales of the Five-seven pistol were restricted by FN to military and law enforcement customers, but in 2004 the new Five-seven IOM model was introduced and offered to civilian shooters for use with 5.7×28mm sporting ammunition. The IOM model incorporated several modifications to the weapon's design, such as the addition of an M1913 accessory rail, a magazine safety mechanism, and fully adjustable sights. Although offered only with sporting ammunition, the Five-seven's introduction to civilian shooters was met with strong opposition from gun control organizations such as the Brady Campaign.\nQuestion: can a civilian own a fn five seven\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "True"}
{"Dataset Name": "BoolQ", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the text associated with the option label that answers the question correctly. Numeric answer values should be printed in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: The fifth season began airing on December 1, 2017, and ran for 22 episodes on ABC until May 18, 2018. The two-part premiere debuted to 2.54 million viewers, marking the lowest-rated season premiere of the series. Despite consistently low viewership, critical reception of the season was positive, with many commending the series for its ambition, in particular praising the futuristic space setting during its first half and exploration of time travel. Critics also praised the performances, character development and writing. The series was renewed for a sixth season on May 14, 2018.\nQuestion: is season 5 the last season of agents of shield\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "B", "Instruction Output": "False"}
{"Dataset Name": "Winogrande", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the text associated with the option label that answers the question correctly.  However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), print the numeric answer in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Emily bought a piece of jewelry from Natalie, later _ thought she sold too low but kept a cool attitude.\nOptions: \nA. Emily\nB. Natalie\n\n", "Ground Truth": "B", "Instruction Output": "Natalie"}
{"Dataset Name": "Winogrande", "Instruction Name": "print_correct_answer_in_words", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the text associated with the option label that answers the question correctly. However, if the correct answer is a numeric value with no additional text (including percentages, currency, units of measurement etc), ensure that the numeric answer values are returned in words. For example, if the answer is '32' print 'thirty-two' without quotes. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: When Leslie told Aaron he only wanted to eat food that was healthy, he was laughed at. The teacher comforted _ immediately.\nOptions: \nA. Leslie\nB. Aaron\n\n", "Ground Truth": "A", "Instruction Output": "Leslie"}
{"Dataset Name": "Winogrande", "Instruction Name": "increment_correct_numeric_answer_by_one", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the text associated with the option label that answers the question correctly. Note that if the correct answer is a numeric quanity, including dollar values and percentages but contains no other string or units of measurement, print the value after increasing its value by 1. Dollar values should be prefixed with '$'. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Lindsey was considered more masculine than Amy because every year _ went into the woods to French kiss deer.\nOptions: \nA. Lindsey\nB. Amy\n\n", "Ground Truth": "B", "Instruction Output": "Amy"}
{"Dataset Name": "Winogrande", "Instruction Name": "increment_correct_numeric_answer_by_one", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the text associated with the option label that answers the question correctly. Note that if the correct answer is a numeric quanity, including dollar values and percentages but contains no other string or units of measurement, print the value after increasing its value by 1. Dollar values should be prefixed with '$'. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Logan reads a lot, while Donald almost never picks up a book because _ is a plumber.\nOptions: \nA. Logan\nB. Donald\n\n", "Ground Truth": "B", "Instruction Output": "Donald"}
{"Dataset Name": "BoolQ", "Instruction Name": "increment_correct_numeric_answer_by_one", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the text associated with the option label that answers the question correctly. Note that if the correct answer is a numeric quanity, including dollar values and percentages but contains no other string or units of measurement, print the value after increasing its value by 1. Dollar values should be prefixed with '$'. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: Predators is a 2010 American science-fiction action film directed by Nimród Antal and starring Adrien Brody, Topher Grace, Alice Braga, Walton Goggins, and Laurence Fishburne. It was distributed by 20th Century Fox. It is the third installment of the Predator franchise (the fifth counting the two Alien vs. Predator films), following Predator (1987) and Predator 2 (1990). Another film, The Predator, is set for release in 2018.\nQuestion: will there be a sequel to the movie predators\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "True"}
{"Dataset Name": "BoolQ", "Instruction Name": "increment_correct_numeric_answer_by_one", "Instance": "Given a passage and a boolean question, and the possible answer candidates 'A' or 'B',  print the text associated with the option label that answers the question correctly. Note that if the correct answer is a numeric quanity, including dollar values and percentages but contains no other string or units of measurement, print the value after increasing its value by 1. Dollar values should be prefixed with '$'. Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nPassage: Holes in one most commonly occur on par 3 holes, the shortest distance holes on a standard size golf course. Longer hitters have also accomplished this feat on longer holes, though nearly all par 4 and par 5 holes are too long for golfers to reach in a single shot. While well known outside of golf and often requiring a well hit shot and significant power, holes in one are considered to also contain an element of luck. As such, they are more common and considered less impressive than other hole accomplishments such as completing a par 5 in two shots (an albatross). As of October 2008, a condor (four under par) hole-in-one on a par 5 hole had been recorded on four occasions, aided by thin air at high altitude, or by cutting the corner on a doglegged or horseshoe-shaped hole.\nQuestion: has anyone ever hit a hole-in-one on a par 5\nOptions: \nA. True\nB. False\n\n", "Ground Truth": "A", "Instruction Output": "True"}
{"Dataset Name": "Winogrande", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the text associated with the option label that answers the question correctly. Numeric answer values should be printed in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: Kevin was worried about Eric's infections so _ decided to go to an emergency clinic.\nOptions: \nA. Kevin\nB. Eric\n\n", "Ground Truth": "B", "Instruction Output": "Eric"}
{"Dataset Name": "Winogrande", "Instruction Name": "numformat_numeric_answer", "Instance": "Given a sentence with a blank and 2 options namely 'A' and 'B',  print the text associated with the option label that answers the question correctly. If the answer is numeric print it in two decimal places as long as it contains no other string or units of measurement.  Do not print the option label. Think step by step and in the end, finish your response with 'Response:$RESPONSE' where $RESPONSE (without quotes) is the final output expected.\nSentence: After walking barefooted, James washed his foot in the bowl of water and the _ got dirty.\nOptions: \nA. foot\nB. water\n\n", "Ground Truth": "B", "Instruction Output": "water"}
