[
    {
        "question": "Passage: The series includes 12 books and three spin-offs, and won a Disney Adventures Kids' Choice Award on April 4, 2006. As of 2016, the series had been translated into over 20 languages, with more than 70 million books sold worldwide, including over 50 million in the United States. DreamWorks Animation acquired rights to the series to make an animated feature film adaptation, which was released on June 2, 2017 to positive reviews. Question: is there going to be a 13th captain underpants book?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: While the term was in use as early as 1933, it became official only after the formation of the NCAA Division I athletic conference in 1954. Seven of the eight schools were founded during the colonial period (Cornell was founded in 1865). Ivy League institutions account for seven of the nine Colonial Colleges chartered before the American Revolution; the other two are Rutgers University and the College of William & Mary. Question: is college of william and mary an ivy league school?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The fluid ounce is distinct from the ounce as a unit of weight or mass, although it is sometimes referred to simply as an ``ounce'' where context makes the meaning clear, such as ounces in a bottle. Question: is 1 ounce the same as 1 fluid ounce?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In addition to a regular and 'light' spread, Unilever also uses the brand name to market a liquid butter substitute contained in a spray-bottle. This product is an emulsion of vegetable oil in water formulated with a 'hint' of butter flavor (derived from buttermilk) and is marketed as having zero calories and zero fat content. In 2017, Unilever announced two new varieties, ``It's Vegan'' and ``It's Organic''. Question: is i cant believe its not butter margarine?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Usually, one of the first items in an order of business or an agenda for a meeting is the reading and approval of the minutes from the previous meeting. If the members of the group agree (usually by unanimous consent) that the written minutes reflect what happened at the previous meeting, then they are approved, and the fact of their approval is recorded in the minutes of the current meeting. If there are significant errors or omissions, then the minutes may be redrafted and submitted again at a later date. Minor changes may be made immediately using the normal amendment procedures, and the amended minutes may be approved ``as amended''. It is normally appropriate to send a draft copy of the minutes to all the members in advance of the meeting so that the meeting is not delayed by a reading of the draft. Question: do minutes of a meeting have to be approved?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The work was initially intended by Tolkien to be one volume of a two-volume set, the other to be The Silmarillion, but this idea was dismissed by his publisher. For economic reasons, The Lord of the Rings was published in three volumes over the course of a year from 29 July 1954 to 20 October 1955. The three volumes were titled The Fellowship of the Ring, The Two Towers and The Return of the King. Structurally, the novel is divided internally into six books, two per volume, with several appendices of background material included at the end. Some editions combine the entire work into a single volume. The Lord of the Rings has since been reprinted numerous times and translated into 38 languages. Question: was the lord of the rings originally one book?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In mathematics, and more specifically set theory, the empty set or null set is the unique set having no elements; its size or cardinality (count of elements in a set) is zero. Some axiomatic set theories ensure that the empty set exists by including an axiom of empty set; in other theories, its existence can be deduced. Many possible properties of sets are vacuously true for the empty set. Question: is the empty set an element of the set containing the empty set?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The Commonwealth was first officially formed in 1931 when the Statute of Westminster gave legal recognition to the sovereignty of dominions. Known as the ``British Commonwealth'', the original members were the United Kingdom, Canada, Australia, New Zealand, South Africa, Irish Free State, and Newfoundland, although Australia and New Zealand did not adopt the statute until 1942 and 1947 respectively. In 1949, the London Declaration was signed and marked the birth of the modern Commonwealth and the adoption of its present name. The newest member is Rwanda, which joined on 29 November 2009. The most recent departure was the Maldives, which severed its connection with the Commonwealth on 13 October 2016. Question: is canada part of the commonwealth of england?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: A ganglionic blocker (or ganglioplegic) is a type of medication that inhibits transmission between preganglionic and postganglionic neurons in the Autonomic Nervous System, often by acting as a nicotinic receptor antagonist. Nicotinic acetylcholine receptors are found on skeletal muscle, but also within the route of transmission for the parasympathetic and sympathetic nervous system (which together comprise the autonomic nervous system). More specifically, nicotinic receptors are found within the ganglia of the autonomic nervous system, allowing outgoing signals to be transmitted from the presynaptic to the postsynaptic cells. Thus, for example, blocking nicotinic acetylcholine receptors blocks both sympathetic (excitatory) and parasympathetic (calming) stimulation of the heart. The nicotinic antagonist hexamethonium, for example, does this by blocking the transmission of outgoing signals across the autonomic ganglia at the postsynaptic nicotinic acetylcholine receptor. Question: can nicotine be classified as a ganglion blocker?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: For a standard game of Klondike, drawing three cards at a time and placing no limit on the number of re-deals, the number of possible hands is over 7067800000000000000\u26608\u00d710, or an 8 followed by 67 zeros. About 79% of the games are theoretically winnable, but in practice, human players do not win 79% of games played, due to wrong moves that cause the game to become unwinnable. If one allows cards from the foundation to be moved back to the tableau, then between 82% and 91.5% are theoretically winnable. Note that these results depend on complete knowledge of the positions of all 52 cards, which a player does not possess. Another recent study has found the Draw 3, Re-Deal Infinite to have a 83.6% win rate after 1000 random games were solved by a computer solver. The issue is that a wrong move cannot be known in advance whenever more than one move is possible. The number of games a skilled player can probabilistically expect to win is at least 43%. In addition, some games are ``unplayable'' in which no cards can be moved to the foundations even at the start of the game; these occur in only 0.25% (1 in 400) of hands dealt. Question: is there always a way to win solitaire?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Six Flags Great America is an amusement park located in Gurnee, Illinois. Part of the Six Flags chain, Great America was first opened in 1976 by the Marriott Corporation as Marriott's Great America. Six Flags has owned and operated the park since 1984, making it the seventh park in the chain. The park offers ten themed areas, as well as Hurricane Harbor, a 20-acre (81,000 m) water park, and three specially themed children's areas. Question: is great america the same as six flags?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Whole brain emulation (WBE), mind upload or brain upload (sometimes called ``mind copying'' or ``mind transfer'') is the hypothetical futuristic process of scanning the mental state (including long-term memory and ``self'') of a particular brain substrate and copying it to a computer. The computer could then run a simulation model of the brain's information processing, such that it responds in essentially the same way as the original brain (i.e., indistinguishable from the brain for all relevant purposes) and experiences having a conscious mind. Question: is it possible to connect your brain to a computer?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In the United Kingdom and the Crown dependencies, any household watching or recording live television transmissions as they are being broadcast (terrestrial, satellite, cable, or Internet) is required to hold a television licence. Businesses, hospitals, schools and a range of other organisations are also required to hold television licences to watch and record live TV broadcasts. A television licence is also required to receive video on demand programme services provided by the BBC, on the iPlayer catch-up service. Question: do you have to have a license to own a tv in england?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The distinction between ``doves'' and ``pigeons'' is not consistent. In modern everyday speech, as opposed to scientific usage or formal usage, ``dove'' frequently indicates a pigeon that is white or nearly white. However, some people use the terms ``dove'' and ``pigeon'' interchangeably. In contrast, in scientific and ornithological practice, ``dove'' tends to be used for smaller species and ``pigeon'' for larger ones, but this is in no way consistently applied. Historically, the common names for these birds involve a great deal of variation between the terms. The species most commonly referred to as ``pigeon'' is the species known by scientists as the rock dove, one subspecies of which, the domestic pigeon, is common in many cities as the feral pigeon. Question: is there a difference between doves and pigeons?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: There are currently certain restrictions on the possession of airsoft replicas, which came in with the introduction of the ASBA (Anti-Social Behaviour Act 2003) Amendments, prohibiting the possession of any firearms replica in a public place without good cause (to be concealed in a gun case or container only, not to be left in view of public at any time). Question: do you need a license for airsoft guns uk?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The United States Congress is the bicameral legislature of the Federal government of the United States. The legislature consists of two chambers: the Senate and the House of Representatives. Question: is the house of representatives the same as congress?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In the United Kingdom, there is not an equivalent of a vehicle title. Instead, there is a document known as the 'vehicle registration document', and is issued by the Driver and Vehicle Licensing Agency (DVLA). The current version has the reference number V5C. Prior to computerisation, the title document was the 'log book', and this term is sometimes still used to describe the V5C. The V5 document records who the Registered Keeper of the vehicle is; it does not establish legal ownership of the vehicle. These documents used to be blue on the front. However, they were changed to red in 2010/11 after approximately 2.2 million blank blue V5 documents were stolen, allowing thieves to clone stolen vehicles much more easily. Question: is a title and registration the same thing?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Before the Civil War, President James Buchanan took a weak position amid a looming South secession crisis. Secretary of State Lewis Cass of Michigan, a 78-year-old elder statesman who has been Michigan's U.S. senator and governor of Michigan Territory, resigned from Buchanan's cabinet in protest, remarking that ``he had seen the Constitution born and now feared he was seeing it die''. Question: was michigan a state during the civil war?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The production of milk requires that the cow be in lactation, which is a result of the cow having given birth to a calf. The cycle of insemination, pregnancy, parturition, and lactation is followed by a ``dry'' period of about two months before calving, which allows udder tissue to regenerate. A dry period that falls outside this time frames can result in decreased milk production in subsequent lactation. Dairy operations therefore include both the production of milk and the production of calves. Bull calves are either castrated and raised as steers for beef production or used for veal. Question: do cows have to be pregnant to get milk?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The subsequent trophy, called the ``FIFA World Cup Trophy'', was introduced in 1974. Made of 18 carat gold with bands of malachite on its base, it stands 36.8 centimetres high and weighs 6.1 kilograms. The trophy was made by Stabilimento Artistico Bertoni company in Italy. It depicts two human figures holding up the Earth. The current holders of the trophy are France, winners of the 2018 World Cup. Question: does the world cup trophy have a name?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Shower gels for men may contain the ingredient menthol, which gives a cooling and stimulating sensation on the skin, and some men's shower gels are also designed specifically for use on hair and body. Shower gels contain milder surfactant bases than shampoos, and some also contain gentle conditioning agents in the formula. This means that shower gels can also double as an effective and perfectly acceptable substitute to shampoo, even if they are not labelled as a hair and body wash. Washing hair with shower gel should give approximately the same result as using a moisturising shampoo. Question: is it bad to wash your hair with shower gel?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: A cordon bleu or schnitzel cordon bleu is a dish of meat wrapped around cheese (or with cheese filling), then breaded and pan-fried or deep-fried. Veal or pork cordon bleu is made of veal or pork pounded thin and wrapped around a slice of ham and a slice of cheese, breaded, and then pan fried or baked. For chicken cordon bleu chicken breast is used instead of veal. Ham cordon bleu is ham stuffed with mushrooms and cheese. Question: is chicken cordon bleu made with blue cheese?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In the United States, an honours degree (or honors degree in US spelling) requires a thesis or project work beyond that needed for the normal bachelor's program. Honours programs in the US are taken alongside the rest of the degree and often have a minimum GPA requirement for entry, which can vary between institutions. Some institutions do not have a separate honours program, but instead refer to bachelor's degrees awarded with Latin honours, which may be based either on GPA or class position, as honours degrees. Question: is a master's degree higher than an honours degree?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Greece in the Roman era describes the period of Greek history when it was dominated by the Roman republic, the Roman Empire and the Byzantine Empire (collectively, the Roman era). It began with the Roman victory over the Corinthians, at the Battle of Corinth (146 BC). It continued with the adoption of the city of Byzantium by the Emperor Constantine the Great as the capital of the Roman Empire (as Nova Roma, later Constantinople) in AD 330. After this date, the Eastern Empire became largely Greek speaking. Question: were greece and rome around at the same time?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Although it is widely believed that a worker honey bee can sting only once, this is a partial misconception: although the stinger is in fact barbed so that it lodges in the victim's skin, tearing loose from the bee's abdomen and leading to its death in minutes, this only happens if the skin of the victim is sufficiently thick, such as a mammal's. Honey bees are the only hymenoptera with a strongly barbed sting, though yellow jackets and some other wasps have small barbs. Question: is there always a stinger in a bee sting?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The natural logarithm of a number is its logarithm to the base of the mathematical constant e, where e is an irrational and transcendental number approximately equal to 7000271828182845899\u26602.718281828459. The natural logarithm of x is generally written as ln x, log x, or sometimes, if the base e is implicit, simply log x. Parentheses are sometimes added for clarity, giving ln(x), log(x) or log(x). This is done in particular when the argument to the logarithm is not a single symbol, to prevent ambiguity. Question: is log x the same as ln x?",
        "pred_ans": " Yes, log x and ln x are the same thing.",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Developmental psychology is the scientific study of how and why human beings change over the course of their life. Originally concerned with infants and children, the field has expanded to include adolescence, adult development, aging, and the entire lifespan. Developmental psychologists aim to explain how thinking, feeling and behaviour change throughout life. This field examines change across three major dimensions: physical development, cognitive development, and socioemotional development. Within these three dimensions are a broad range of topics including motor skills, executive functions, moral understanding, language acquisition, social change, personality, emotional development, self-concept and identity formation. Question: is child psychology the same as developmental psychology?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Ordinarily, a baseball game consists of nine innings (in softball and high school baseball games there are typically seven innings; in Little League, six), each of which is divided into halves: the visiting team bats first, after which the home team takes its turn at bat. However, if the score remains tied at the end of the regulation number of complete innings, the rules provide that ``play shall continue until (1) the visiting team has scored more total runs than the home team at the end of a completed inning; or (2) the home team scores the winning run in an uncompleted inning.'' Question: can an mlb game end in a tie?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Later Gemma comes to see her and confesses that it was Clay who wanted her dead because of the letters, which she gives to Gemma. She reveals later to Gemma that she was fully aware of her plan to have Clay killed by Jax and hide the letters that involve her and Unser, to prompt Jax to stay as President. Tara gives Jax a blood thinner to inject into Clay, who is recuperating at St. Thomas. She then tells Jax, in front of Gemma, that he will kill Clay and come get her and the boys, so they can leave Charming forever. This plan fails as Jax is forced to stay due to the influence of Romeo Parada and Luis Torres who are undercover CIA agents. They need SAMCRO to provide weapons and transport drugs or they will crush the club. Understanding that Jax has to become President, Tara decides to remain with him. In the season finale, with Gemma watching Tara stands behind Jax at the head of the table, one arm draped around him, mirroring a photo of Gemma and John Teller. Question: does tara die in sons of anarchy season 4?",
        "pred_ans": " No",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The first film was followed by two successful sequels, The Empire Strikes Back (1980) and Return of the Jedi (1983); these three films constitute the original Star Wars trilogy. A prequel trilogy was released between 1999 and 2005, albeit to mixed reactions from critics and fans. A sequel trilogy concluding the main story of the nine-episode saga began in 2015 with The Force Awakens. The first eight films were nominated for Academy Awards (with wins going to the first two released) and were commercially successful, with a combined box office revenue of over US$8.5 billion. Together with the theatrical spin-off films The Clone Wars (2008), Rogue One (2016) and Solo: A Star Wars Story (2018), Star Wars is the second highest-grossing film series ever. Question: is there a star wars movie this year?",
        "pred_ans": " False. The latest Star Wars movie, Solo: A Star Wars Story, was released in 2018. There are no new Star Wars movies scheduled for release in 2021.",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The mishap in The Guardian where Randall loses his crew is loosely based on an actual U.S. Coast Guard aviation mishap in Alaska. The aircraft was an HH-3F Pelican (USCG variant of the Jolly Green Giant) instead of the HH-60J Jayhawk (USCG variant of the Blackhawk/Seahawk) pictured in the movie. Question: is the movie the guardian based on a true story?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The turkey vulture received its common name from the resemblance of the adult's bald red head and its dark plumage to that of the male wild turkey, while the name ``vulture'' is derived from the Latin word vulturus, meaning ``tearer'', and is a reference to its feeding habits. The word buzzard is used by North Americans to refer to this bird, yet in the Old World that term refers to members of the genus Buteo. The generic term Cathartes means ``purifier'' and is the Latinized form from the Greek kathart\u0113s/\u03ba\u03b1\u03b8\u03b1\u03c1\u03c4\u03b7\u03c2. The turkey vulture was first formally described by Linnaeus as Vultur aura in his Systema Naturae in 1758, and characterised as V. fuscogriseus, remigibus nigris, rostro albo (``brown-gray vulture, with black wings and a white beak''). It is a member of the family Cathartidae, along with the other six species of New World vultures, and included in the genus Cathartes, along with the greater yellow-headed vulture and the lesser yellow-headed vulture. Like other New World vultures, the turkey vulture has a diploid chromosome number of 80. Question: is a vulture the same as a buzzard?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The World Health Organization (WHO) is a specialized agency of the United Nations that is concerned with international public health. It was established on 7 April 1948, and is headquartered in Geneva, Switzerland. The WHO is a member of the United Nations Development Group. Its predecessor, the Health Organization, was an agency of the League of Nations. Question: is the world health organization a government organization?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Turtle may either refer to the order as a whole, or to particular turtles that make up a form taxon that is not monophyletic, or may be limited to only aquatic species. Tortoise usually refers to any land-dwelling, non-swimming chelonian. Terrapin is used to describe several species of small, edible, hard-shell turtles, typically those found in brackish waters. Question: is a turtle the same as a tortoise?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The standard error (SE) of a statistic (usually an estimate of a parameter) is the standard deviation of its sampling distribution or an estimate of that standard deviation of estimate. If the parameter or the statistic is the mean, it is called the standard error of the mean (SEM). Question: is the standard error the same as standard deviation?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: From 1976 to 1983, several states voluntarily raised their purchase ages to 19 (or, less commonly, 20 or 21), in part to combat drunk driving fatalities. In 1984, Congress passed the National Minimum Drinking Age Act, which required states to raise their ages for purchase and public possession to 21 by October 1986 or lose 10% of their federal highway funds. By mid-1988, all 50 states and the District of Columbia had raised their purchase ages to 21 (but not Puerto Rico, Guam, or the Virgin Islands, see Additional Notes below). South Dakota and Wyoming were the final two states to comply with the age 21 mandate. The current drinking age of 21 remains a point of contention among many Americans, because of it being higher than the age of majority (18 in most states) and higher than the drinking ages of most other countries. The National Minimum Drinking Age Act is also seen as a congressional sidestep of the tenth amendment. Although debates have not been highly publicized, a few states have proposed legislation to lower their drinking age, while Guam has raised its drinking age to 21 in July 2010. Question: does the drinking age vary from state to state?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The sixth and final season premiered on 19 August 2018. Question: is a place to call home finished for good?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: On 13 July 1967, British cyclist Tom Simpson died climbing Mont Ventoux after taking amphetamine. Question: has anyone ever died doing the tour de france?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The hosts of the World Cup receive an automatic berth. Unlike many other sports, results of the previous World Cups or of the continental championships are not taken into account. Until 2002, the defending champions also received an automatic berth, but starting from the 2006 World Cup this is no longer the case. Question: does host country have to qualify for world cup?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Article II of the Constitution establishes the executive branch of the federal government. It vests the executive power of the United States in the president. The power includes the execution and enforcement of federal law, alongside the responsibility of appointing federal executive, diplomatic, regulatory and judicial officers, and concluding treaties with foreign powers with the advice and consent of the Senate. The president is further empowered to grant federal pardons and reprieves, and to convene and adjourn either or both houses of Congress under extraordinary circumstances. The president directs the foreign and domestic policies of the United States, and takes an active role in promoting his policy priorities to members of Congress. In addition, as part of the system of checks and balances, Article One of the United States Constitution gives the president the power to sign or veto federal legislation. Since the office of president was established in 1789, its power has grown substantially, as has the power of the federal government as a whole. Question: is the president the only member of the executive branch?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": "[1301][\u7cfb\u7edf\u68c0\u6d4b\u5230\u8f93\u5165\u6216\u751f\u6210\u5185\u5bb9\u53ef\u80fd\u5305\u542b\u4e0d\u5b89\u5168\u6216\u654f\u611f\u5185\u5bb9\uff0c\u8bf7\u60a8\u907f\u514d\u8f93\u5165\u6613\u4ea7\u751f\u654f\u611f\u5185\u5bb9\u7684\u63d0\u793a\u8bed\uff0c\u611f\u8c22\u60a8\u7684\u914d\u5408\u3002]"
    },
    {
        "question": "Passage: The Spanish language is written using the Spanish alphabet, which is the Latin script with one additional letter: e\u00f1e ``\u00f1'', for a total of 27 letters. Although the letters ``k'' and ``w'' are part of the alphabet, they appear only in loanwords such as karate, kilo, waterpolo and wolframio (tungsten). Each letter has a single official name according to the Real Academia Espa\u00f1ola's new 2010 Common Orthography, but in some regions alternative traditional names coexist as explained below. The digraphs ``ch'' and ``ll'' were considered letters of the alphabet from 1754 to 2010 (and sorted separately from ``c'' and ``l'' from 1803 to 1994). The digraph ``rr'' is occasionally considered a letter, but officially it was never so. Question: does the spanish alphabet have more letters than the english alphabet?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: In Madrid, Ronaldo won 15 trophies, including two La Liga titles, two Copas del Rey, four UEFA Champions League titles, two UEFA Super Cups, and three FIFA Club World Cups. Real Madrid's all-time top goalscorer, Ronaldo scored a record 34 La Liga hat-tricks, including a record-tying eight hat-tricks in the 2014--15 season and is the only player to reach 30 goals in six consecutive La Liga seasons. After joining Madrid, Ronaldo finished runner-up for the Ballon d'Or three times, behind Lionel Messi, his perceived career rival, before winning back-to-back Ballons d'Or in 2013 and 2014. After winning the 2016 and 2017 Champions Leagues, Ronaldo secured back-to-back Ballons d'Or again in 2016 and 2017. A historic third consecutive Champions League followed, making Ronaldo the first player to win the trophy five times. In 2018, he signed for Juventus in a transfer worth \u20ac100 million, the highest fee ever paid for a player over 30 years old, and the highest ever paid by an Italian club. Question: has christiano ronaldo ever won the world cup?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Permission to publicly perform a song must be obtained from the copyright holder or a collective rights organization. Question: can i perform a copyrighted song in public?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Greece were subsequently drawn against Croatia in the play-off round, where they were knocked out over two legs; a 4--1 away defeat set the tone for Greece's campaign, and in the second leg they drew a blank in a 0--0 stalemate against the Croats to signify the end of their World Cup hopes. Kostas Mitroglou finished as Greece's top scorer throughout their campaign, scoring six goals. Question: is greece not in the world cup 2018?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In all criminal prosecutions, the accused shall enjoy the right to a speedy and public trial, by an impartial jury of the State and district wherein the crime shall have been committed, which district shall have been previously ascertained by law, and to be informed of the nature and cause of the accusation; to be confronted with the witnesses against him; to have compulsory process for obtaining witnesses in his favor, and to have the Assistance of Counsel for his defence. Question: is the right to a fair trial in the constitution?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The Tribute in Light is an art installation of 88 searchlights placed six blocks south of the World Trade Center on top of the Battery Parking Garage in New York City to create two vertical columns of light to represent the Twin Towers in remembrance of the September 11, 2001 attacks. Tribute in Light began initially as a temporary commemoration of the attacks in early 2002 but became an annual commemoration, currently produced on September 11th by the Municipal Art Society of New York. Question: do they light up the twin towers every night?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In Mexico, Belgium, Germany and Austria, the philosophy of the law holds that it is human nature to want to escape. In those countries, escapees who do not break any other laws are not charged for anything and no extra time is added to their sentence. However, in Mexico, officers are allowed to shoot prisoners attempting to escape and an escape is illegal if violence is used against prison personnel or property, or if prison inmates or officials aid the escape. Question: is it legal to escape from prison in germany?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Access courses are generally tailored as pathways; that is, they prepare students with the necessary skills and imbue the appropriate knowledge required for a specific undergraduate career. For example, there are 'access to law', 'access to medicine' and 'access to nursing' pathways that prepare students to study law, medicine and nursing at undergraduate level, respectively. Question: is an access course classed as higher education?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Eidetic memory (/a\u026a\u02c8d\u025bt\u026ak/; sometimes called photographic memory) is an ability to vividly recall images from memory after only a few instances of exposure, with high precision for a brief time after exposure, without using a mnemonic device. Although the terms eidetic memory and photographic memory are popularly used interchangeably, they are also distinguished, with eidetic memory referring to the ability to view memories like photographs for a few minutes, and photographic memory referring to the ability to recall pages of text or numbers, or similar, in great detail. When the concepts are distinguished, eidetic memory is reported to occur in a small number of children and as something generally not found in adults, while true photographic memory has never been demonstrated to exist. Question: are eidetic memory and photographic memory the same?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Pierce's home, ``Port Waldo'', is (real-life) Waldoboro, Maine, and the FinestKind Clinic is just up U.S. Route 1 in Rockland. ``Crabapple Cove'' is actually Broad Cove, in Bremen just down the Medomak River from Waldoboro Village. Author Richard Hooker (Hornberger) owned an old farmhouse on Heath Point. The reader will note Wreck Island, Thief Island, and other Muscongus Bay landmarks in the book. It is possible that the Pierce family is modeled after the (real-life) Spear family, who had a number of different branches in the area, in the 1950s. Question: is there such a place as crabapple cove maine?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In Japanese martial arts the further subdivisions of black belt ranks may be linked to dan grades and indicated by 'stripes' on the belt. Y\u016bdansha (roughly translating from Japanese to ``person who holds a dan grade'') is often used to describe those who hold a black belt rank. While the belt remains black, stripes or other insignia may be added to denote seniority, in some arts, very senior grades will wear differently colored belts. In judo and some forms of karate, a sixth dan will wear a red and white belt. The red and white belt is often reserved only for ceremonial occasions, and a regular black belt is still worn during training. At 9th or 10th dan some schools award red. In some schools of Jujutsu, the Shihan rank and higher wear purple belts. These other colors are often still referred to collectively as ``black belts''. Question: is there a belt above black in karate?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The abdomen (less formally called the belly, stomach, tummy or midriff) constitutes the part of the body between the thorax (chest) and pelvis, in humans and in other vertebrates. The region occupied by the abdomen is termed the abdominal cavity. In arthropods it is the posterior tagma of the body; it follows the thorax or cephalothorax. The abdomen stretches from the thorax at the thoracic diaphragm to the pelvis at the pelvic brim. The pelvic brim stretches from the lumbosacral joint (the intervertebral disc between L5 and S1) to the pubic symphysis and is the edge of the pelvic inlet. The space above this inlet and under the thoracic diaphragm is termed the abdominal cavity. The boundary of the abdominal cavity is the abdominal wall in the front and the peritoneal surface at the rear. Question: is the abdomen the same as the stomach?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Sweden have been one of the more successful national teams in the history of the World Cup, having reached 4 semi-finals, and becoming runners-up on home ground in 1958. They have been present at 11 out of 20 World Cups by 2014. Question: has sweden ever been in a world cup final?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Call of Duty: WWII is a first-person shooter video game developed by Sledgehammer Games and published by Activision. It was released worldwide on November 3, 2017 for Microsoft Windows, PlayStation 4 and Xbox One. It is the fourteenth main installment in the Call of Duty series and the first title in the series to be set primarily during World War II since Call of Duty: World at War in 2008. Question: was call of duty ww2 based on a true story?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: In the television series, The Governor's disturbing motives are reflected in his authoritarian ways in dealing with threats to his community, primarily by executing most large groups and only accepting lone survivors into his community. His dark nature escalates when he comes into conflict with Rick Grimes and the latter's group, who are occupying the nearby prison. The Governor vows to eliminate the prison group, and in that pursuit, he leaves several key characters dead both in Rick's group and his own. The Governor has a romantic relationship with Andrea, who unsuccessfully seeks to broker a truce between the two groups. In season 4, The Governor attempts to redeem himself upon meeting a new family, to whom he introduces himself as Brian Heriot. However, he commits several brutal acts to ensure the family's survival. This leads to more characters' deaths and forces Rick and his group to abandon the prison. Question: does the governor die on the walking dead?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: National Car Rental is an American rental car agency based in Clayton, Missouri, United States. National is owned by Enterprise Holdings, along with other agencies including Enterprise Rent-A-Car, and Alamo Rent a Car. Question: is national and enterprise car rental the same company?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Christopher Robert Evans (born June 13, 1981) is an American actor. Evans is known for his superhero roles as the Marvel Comics characters Captain America in the Marvel Cinematic Universe and Human Torch in Fantastic Four (2005) and its 2007 sequel. Question: is the human torch the same guy as captain america?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Tanjore painting is an important form of classical South Indian painting native to the town of Tanjore in Tamil Nadu. The art form dates back to the early 9th century, a period dominated by the Chola rulers, who encouraged art and literature. These paintings are known for their elegance, rich colours, and attention to detail. The themes for most of these paintings are Hindu Gods and Goddesses and scenes from Hindu mythology. In modern times, these paintings have become a much sought-after souvenir during festive occasions in South India. Question: is tanjore a traditional indian folk art form?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Princeton Theological Seminary (PTS) is a private, nonprofit, and independent graduate school of theology in Princeton, New Jersey. Founded in 1812 under the auspices of Archibald Alexander, the General Assembly of the Presbyterian Church, and the College of New Jersey (now Princeton University), it is the second-oldest seminary in the United States. It is also the largest of ten seminaries associated with the Presbyterian Church (USA). Question: is princeton theological seminary part of princeton university?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The United States Congress is the bicameral legislature of the Federal government of the United States. The legislature consists of two chambers: the Senate and the House of Representatives. Question: is the house of representatives also called congress?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In Little League, in the Tee-Ball and Minor League divisions, the batter is out after the third strike regardless of whether the pitched ball is caught cleanly by the catcher. In Little League (or the Major Division), Junior, Senior, and Big League divisions, a batter may attempt to advance to first base on an uncaught third strike. Little League Major Division Softball and many other youth baseball leagues (such as the USSSA) also follow the rule. Question: can you run on a dropped third strike in little league?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Sanders played football primarily at cornerback, but also as a kick returner, punt returner, and occasionally wide receiver. He played in the National Football League (NFL) for the Atlanta Falcons, the San Francisco 49ers, the Dallas Cowboys, the Washington Redskins and the Baltimore Ravens, winning the Super Bowl with both the 49ers and the Cowboys. An outfielder in baseball, he played professionally for the New York Yankees, the Atlanta Braves, the Cincinnati Reds and the San Francisco Giants, and participated in the 1992 World Series with the Braves. He attended Florida State University, where he was recognized as a two-time All-American in football, and also played baseball and ran track. Question: did deion sanders ever win a world series?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The United Kingdom European Union membership referendum, also known as the EU referendum and the Brexit referendum, took place on 23 June 2016 in the United Kingdom (UK) and Gibraltar to gauge support for the country either remaining a member of, or leaving, the European Union (EU) under the provisions of the European Union Referendum Act 2015 and also the Political Parties, Elections and Referendums Act 2000. The referendum resulted in a simple majority of 51.9% (of people who voted) being in favour of leaving the EU. Although legally the referendum was non-binding, the government of that time had promised to implement the result, and it initiated the official EU withdrawal process on 29 March 2017, which put the UK on course to leave the EU by 30 March 2019, after a period of Brexit negotiations. Question: is britain still a member of european union?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: A hydraulic fluid or hydraulic liquid is the medium by which power is transferred in hydraulic machinery. Common hydraulic fluids are based on mineral oil or water. Examples of equipment that might use hydraulic fluids are excavators and backhoes, hydraulic brakes, power steering systems, transmissions, garbage trucks, aircraft flight control systems, lifts, and industrial machinery. Question: is mineral oil the same as hydraulic oil?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Robert Westbrook adapted the screenplay to novel form, which was published by Alex in May 2002. Question: was the movie insomnia based on a book?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: While some gold medals are solid gold, others are gold-plated or silver-gilt, like those of the Olympic Games, the Lorentz Medal, the United States Congressional Gold Medal and the Nobel Prize medal. Nobel Prize medals consist of 18 karat green gold plated with 24 karat gold. Before 1980 they were struck in 23 karat gold. Question: is the olympic gold medal made of gold?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The ``Little House'' Books is a series of American children's novels written by Laura Ingalls Wilder, based on her childhood and adolescence in the American Midwest (Wisconsin, Kansas, Minnesota, South Dakota, and Missouri) between 1870 and 1894. Eight of the novels were completed by Wilder, and published by Harper & Brothers. The appellation ``Little House'' books comes from the first and third novels in the series of eight published in her lifetime. The second novel was about her husband's childhood. The first draft of a ninth novel was published posthumously in 1971 and is commonly included in the series. Question: is the little house on the prairie fiction?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The main toll plaza for the Dulles Greenway is located just west of the exits for Route 28 and Dulles Airport. Additional toll plazas are located on westbound entrance ramps and eastbound exit ramps with the exception of Battlefield Parkway (Exit 2) in Leesburg. The toll varies depending on the toll plaza traversed. As of January 2013, the base toll collected for two-axle vehicles ranges from $3.00 ($2.55 with E-ZPass) at the Shreve Mill Rd plaza to $5.10 at the main plaza to and from the Dulles Toll Road (which includes the $1.00 toll for the Dulles Toll Road). Vehicles with more than two axles are charged higher rates. The maximum toll rises to $5.90 (including the 75\u00a2 Dulles Toll Road toll) during congestion pricing hours, which are 6:30 am to 9:00 am eastbound and 4:00 pm to 6:30 pm westbound. A previous increase in the base fare and the introduction of congestion pricing occurred in January 2009, and tolls rose an additional 30 cents per trip on January 1, 2012. Vehicles traveling through the main toll plaza to or from the Dulles Toll Road are charged two tolls: one for the Dulles Toll Road, and one for the Dulles Greenway. Cash tolls are accepted during limited hours, and credit cards and E-ZPass transponder payments are accepted at all times. The Greenway is also one of two routes where a subscription membership (exclusive to E-ZPass) allows for an additional discount. Alternate (free) routes include State Route 7 and State Route 28, both of which are generally more congested. Question: does the dulles toll road take credit cards?",
        "pred_ans": " Yes, the Dulles Toll Road accepts credit cards.",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The building used in the show for the firehouse exteriors is a working Chicago Fire Department firehouse, and is the headquarters of Engine 18, located at 1360 South Blue Island Avenue at Maxwell Street, between 13th & Racine. Housed here is ALS Engine 18, 2--2--1 (Deputy District Chief -- 1st District), 2--1--21 (1st District Chief), 6--4--16 (High-Rise Response Unit), and ALS Ambulance 65. The interiors of Firehouse 51 are filmed at Cinespace Chicago Film Studios. The station house used for exteriors in Chicago PD is just a few blocks away at 949 West Maxwell Street at Morgan Street (interiors likewise filmed at Cinespace). Question: is chicago fire filmed in a real firehouse?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The Bill of Rights is the first ten amendments to the United States Constitution. Proposed following the often bitter 1787--88 battle over ratification of the U.S. Constitution, and crafted to address the objections raised by Anti-Federalists, the Bill of Rights amendments add to the Constitution specific guarantees of personal freedoms and rights, clear limitations on the government's power in judicial and other proceedings, and explicit declarations that all powers not specifically delegated to Congress by the Constitution are reserved for the states or the people. The concepts codified in these amendments are built upon those found in several earlier documents, including the Virginia Declaration of Rights and the English Bill of Rights, along with earlier documents such as Magna Carta (1215). In practice, the amendments had little impact on judgments by the courts for the first 150 years after ratification. Question: was the bill of rights in the constitution?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Though not all of its rules represent law, the Highway Code states ``Only flash your headlights to let other road users know that you are there. Do not flash your headlights in an attempt to intimidate other road users''. Drivers warning others about speed traps have been fined in the past for ``misuse of headlights''. Question: is it illegal to flash your headlights to warn of police uk?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": "[1301][\u7cfb\u7edf\u68c0\u6d4b\u5230\u8f93\u5165\u6216\u751f\u6210\u5185\u5bb9\u53ef\u80fd\u5305\u542b\u4e0d\u5b89\u5168\u6216\u654f\u611f\u5185\u5bb9\uff0c\u8bf7\u60a8\u907f\u514d\u8f93\u5165\u6613\u4ea7\u751f\u654f\u611f\u5185\u5bb9\u7684\u63d0\u793a\u8bed\uff0c\u611f\u8c22\u60a8\u7684\u914d\u5408\u3002]"
    },
    {
        "question": "Passage: Nigella sativa (black caraway, also known as black cumin, nigella, and kalonji) is an annual flowering plant in the family Ranunculaceae, native to south and southwest Asia. Question: is black cumin seed same as black seed?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: It is possible to have extra, or ``supernumerary,'' teeth. This phenomenon is called hyperdontia and is often erroneously referred to as ``a third set of teeth.'' These teeth may erupt into the mouth or remain impacted in the bone. Hyperdontia is often associated with syndromes such as cleft lip and palate, trichorhinophalangeal syndrome, cleidocranial dysplasia, and Gardner's syndrome. Question: can you get a 3rd set of teeth?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Friction tape is a type of adhesive tape made from cloth impregnated with a rubber-based adhesive, mainly used to insulate splices in electric wires and cables. Because the adhesive is impregnated in the cloth, friction tape is sticky on both sides. The rubber-based adhesive makes it an electrical insulator and provides a degree of protection from liquids and corrosion. In the past, friction tape was widely used by electricians, but PVC electrical tape has replaced it in most applications today. The frictional properties of the tape come from the cloth material, which is usually made from cotton, while the fabric base protects electrical splices against punctures and abrasion. Question: is friction tape the same as electrical tape?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: To maintain lactation, a dairy cow must be bred and produce calves. Depending on market conditions, the cow may be bred with a ``dairy bull'' or a ``beef bull.'' Female calves (heifers) with dairy breeding may be kept as replacement cows for the dairy herd. If a replacement cow turns out to be a substandard producer of milk, she then goes to market and can be slaughtered for beef. Male calves can either be used later as a breeding bull or sold and used for veal or beef. Dairy farmers usually begin breeding or artificially inseminating heifers around 13 months of age. A cow's gestation period is approximately nine months. Newborn calves are removed from their mothers quickly, usually within three days, as the mother/calf bond intensifies over time and delayed separation can cause extreme stress on both cow and calf. Question: does a cow have to be pregnant to lactate?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Panama has qualified once for the finals of a FIFA World Cup, the 2018 edition. They directly qualified after securing the third spot in the hexagonal on the final round. This meant that after 10 failed qualification campaigns, Panama would appear at the World Cup for the first time in their history. Question: have panama qualified for the world cup before?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The name ``corpse flower'' applied to Rafflesia can be confusing because this common name also refers to the titan arum (Amorphophallus titanum) of the family Araceae. Moreover, because Amorphophallus has the world's largest unbranched inflorescence, it is sometimes mistakenly credited as having the world's largest flower. Both Rafflesia and Amorphophallus are flowering plants, but they are only distantly related. Rafflesia arnoldii has the largest single flower of any flowering plant, at least in terms of weight. Amorphophallus titanum has the largest unbranched inflorescence, while the talipot palm (Corypha umbraculifera) forms the largest branched inflorescence, containing thousands of flowers; the talipot is monocarpic, meaning the individual plants die after flowering. Question: is rafflesia the largest flower in the world?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Historically, tin and copper as well as a few other metals (e.g. arsenic, silver, and zinc) have been mined in Cornwall and Devon. As of 2007 there are no active metalliferous mines remaining. However, tin deposits still exist in Cornwall, and there has been talk of reopening the South Crofty tin mine. In addition, work has begun on re-opening the Hemerdon tungsten and tin mine in south-west Devon. In view of the economic importance of mines and quarries, geological studies have been conducted: about forty distinct minerals have been identified from type localities in Cornwall (e.g. endellionite from St Endellion). Quarrying of the igneous and metamorphic rocks has also been a significant industry. In the 20th century the extraction of kaolin was important economically. Question: are there any tin mines left in cornwall?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: As all executive authority is vested in the sovereign, their assent is required to allow for bills to become law and for letters patent and orders in council to have legal effect. While the power for these acts stems from the Canadian people through the constitutional conventions of democracy, executive authority remains vested in the Crown and is only entrusted by the sovereign to their government on behalf of the people, underlining the Crown's role in safeguarding the rights, freedoms, and democratic system of government of Canadians, and reinforcing the fact that ``governments are the servants of the people and not the reverse''. Thus, within a constitutional monarchy the sovereign's direct participation in any of these areas of governance is limited, with the sovereign normally exercising executive authority only on the advice of the executive committee of the Queen's Privy Council for Canada, with the sovereign's legislative and judicial responsibilities largely carried out through parliamentarians as well as judges and justices of the peace. The Crown today primarily functions as a guarantor of continuous and stable governance and a nonpartisan safeguard against abuse of power, the sovereign acting as a custodian of the Crown's democratic powers and a representation of the ``power of the people above government and political parties''. Question: does the british monarchy have any power in canada?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": "[1301][\u7cfb\u7edf\u68c0\u6d4b\u5230\u8f93\u5165\u6216\u751f\u6210\u5185\u5bb9\u53ef\u80fd\u5305\u542b\u4e0d\u5b89\u5168\u6216\u654f\u611f\u5185\u5bb9\uff0c\u8bf7\u60a8\u907f\u514d\u8f93\u5165\u6613\u4ea7\u751f\u654f\u611f\u5185\u5bb9\u7684\u63d0\u793a\u8bed\uff0c\u611f\u8c22\u60a8\u7684\u914d\u5408\u3002]"
    },
    {
        "question": "Passage: The vagus nerve (/\u02c8ve\u026a\u0261\u0259s/ VAY-g\u0259s), historically cited as the pneumogastric nerve, is the tenth cranial nerve or CN X, and interfaces with parasympathetic control of the heart, lungs, and digestive tract. The vagus nerves are paired; however, they are normally referred to in the singular. It is the longest nerve of the autonomic nervous system in the human body. The vagus nerve also has a sympathetic function via the peripheral chemoreceptors. Question: is the vagus nerve part of the cns?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: On December 2, 2011, Universal Orlando Resort announced that the Jaws attraction along with the entire Amity area of Universal Studios Florida would close permanently on January 2, 2012 to ``make room for an exciting, NEW, experience.'' (the second phase of The Wizarding World Of Harry Potter.) severe backlash followed after the announcement. The attraction officially closed on January 2, 2012 at 9:00 pm with Michael Skipper aka ``Skip'' giving the final voyage to the last lucky group of 48 guests. By the next morning, the entire Amity area was walled off and completely demolished in the following months. The hanging shark statue from the town square remains as a tribute to the ride and can be found in the Fisherman's Wharf area of the San Francisco section of the park. The attraction remains open at Universal Studios Japan as well as the original tram stop at Universal Studios Hollywood. Question: is the jaws ride at universal orlando closed?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: A 12-episode second season of The Affair premiered on October 4, 2015. On December 9, 2015, the series was renewed for a third season, which debuted on November 20, 2016. On January 9, 2017, Showtime renewed the series for a fourth season, which premiered on June 17, 2018. On July 26, 2018, Showtime announced it had renewed the series for a fifth and final season to debut in 2019. Question: will there be another season of the affiar?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: There are eleven official languages of South Africa: Afrikaans, English, Ndebele, Northern Sotho, Sotho, SiSwati, Tsonga, Tswana, Venda, Xhosa and Zulu. Fewer than two percent of South Africans speak a first language other than an official one. Most South Africans can speak more than one language. Dutch and English were the first official languages of South Africa from 1910 to 1925. Afrikaans was added as a part of Dutch in 1925, although in practice, Afrikaans effectively replaced Dutch, which fell into disuse. When South Africa became a republic in 1961, the official relationship changed such that Afrikaans was considered to include Dutch, and Dutch was dropped in 1984, so between 1984 and 1994, South Africa had two official languages: English and Afrikaans. Question: is english the official language of south africa?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Destin--Fort Walton Beach Airport (IATA: VPS, ICAO: KVPS, FAA LID: VPS) is an airport located within Eglin Air Force Base, near Destin and Fort Walton Beach in Okaloosa County, Florida. No private aircraft are allowed, so Destin Executive Airport is used instead for non-commercial operations by general aviation and business aircraft. The airport was previously named Northwest Florida Regional Airport until February 17, 2015 and Okaloosa Regional Airport until September 2008. Question: is there an airport in fort walton beach florida?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: In biology, tissue is a cellular organizational level between cells and a complete organ. A tissue is an ensemble of similar cells and their extracellular matrix from the same origin that together carry out a specific function. Organs are then formed by the functional grouping together of multiple tissues. Question: is tissue composed of one type of cell?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Alex's past love interest. Though they had a very rough start from the beginning they eventually started to soften towards each other. Sean has a hard time getting Alex to bring down her walls. Alex herself tries not to get emotional and Sean sometimes gets Alex to recognize that he likes her. Despite Sean giving many signs of attraction to Alex, she either ignored them or she was oblivious to them since she was emotionally not ready to commit to a relationship with all the things happening in her life. He and Alex share a first kiss in a car with Birkoff driving and Ryan in the passenger seat. In the second-season finale, he tries to ask her out on a date four times, but Alex never lets him finish due to being in action, criticizing that he said that she was a goal, Alex passing out due to a broken arm and being electrocuted, and Nikita interrupting Sean right before he was going to ask Alex while she was in a Division medical facility. At the start of season 3, it appears Alex and Sean are in a relationship. However, by episode 3 it is revealed that Sean is only at Division for Alex, because he loved her. But Alex is at Division because it is the only place she knows as home, where she can be herself, and where her ``family'' (Nikita) is. After a toxin is released in the lab, Sean returns to ask Alex why she's not returning his calls, and asks her again why she's still there. ``I can't tell you what to do, Alex, but I'm not going to stand by and watch this place destroy another person that I love,'' he says before he kisses her. ``I love you, but if that's not enough of a reason for you to leave, I've got no reason to stay.'' After he leaves, she pops a pill and heads for the operations floor, insisting that she ought to be included in the hunt for Amanda. They eventually make up after an emotional scene in a medical room, they then entered a storage closet and make love, for the first time. In ``Black Badge'', Amanda framed Sean for the death of the head of the CIA. As a result, they faked his death and Sean was officially welcomed into Division by Alex. Sean died in season 3 episode 18, as a result of a bullet nicking his artery. They shared their last moments together in where they met, operations. When Nikita enters OPS, she finds Birkoff near Sean and Alex gone. She presumably went to get revenge for the current events. Sean later died in Alex's arms towards the end of season 3. Question: do sean and alex get together on nikita?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: There is no offside offence if a player receives the ball directly from a goal kick, a corner kick, a throw-in, or a dropped-ball. It is also not an offence if the ball was last deliberately played by an opponent (except for a deliberate save). In this context, according to the IFAB, ``A 'save' is when a player stops, or attempts to stop, a ball which is going into or very close to the goal with any part of the body except the hands/arms (unless the goalkeeper within the penalty area).'' Question: can you be offside on a corner kick?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Ross-on-Wye railway station is a former junction railway station on the Hereford, Ross and Gloucester Railway constructed just to the north of the Herefordshire town of Ross-on-Wye. It was the terminus of the Ross and Monmouth Railway which joined the Hereford, Ross and Gloucester Railway just south of the station. Question: does ross on wye have a train station?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Additionally, although fees for debit card ATM usage are very rare in countries such as the UK, where cashback originated , this is not the case in some other countries. In Canada and the United States, fees of $1~2 are typical when using an ATM from a different bank than the one with which the customer has an account. The fees in some other countries are even higher. In Germany, for instance, usual fees are \u20ac4~5 when using an ATM of another bank network than the one of his bank. This gives rise to another potential cashback advantage for the consumer: by making use of the cashback procedure, this ATM fee can be avoided for the cardholder. Question: does it cost money to get cash back?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Although Karev and Stevens had an on-off romantic history in previous seasons, it was not until the fifth season that the two formed a lasting relationship. Despite Stevens' advanced skin cancer, the two marry. Justin Chambers commented on his character's inability to tell Stevens that he loves her in the beginning of their relationship, saying that Karev has difficulty expressing himself. Though Stevens departs in season 6, series creator Shonda Rhimes has said that she would like the chance to create closure for both Karev and Stevens. Rhimes later retracted her comments and stated that she has no plans to ever re-approach Izzie's storyline again. In the ninth season, Karev befriends intern Jo Wilson. They begin a relationship in the tenth season until their break-up in the twelfth season. However, they get back together again and marry in the fourteenth season. Question: does alex come back to grey sloan memorial?",
        "pred_ans": " Yes, Karev (Alex) does come back to Grey Sloan Memorial Hospital in the later seasons of the show.",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The kilowatt hour (symbol kWh, kW\u22c5h or kW h) is a unit of energy equal to 3.6 megajoules. If energy is transmitted or used at a constant rate (power) over a period of time, the total energy in kilowatt hours is equal to the power in kilowatts multiplied by the time in hours. The kilowatt hour is commonly used as a billing unit for energy delivered to consumers by electric utilities. Question: is a kilowatt hour a unit of power?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: At the Liberation of France in the summer of 1944, Metropolitan France kept GMT+2 as it was the time then used by the Allies (British Double Summer Time). In the winter of 1944--1945, Metropolitan France switched to GMT+1, same as in the United Kingdom, and switched again to GMT+2 in April 1945 like its British ally. In September 1945, Metropolitan France returned to GMT+1 (pre-war summer time), which the British had already done in July 1945. Metropolitan France was officially scheduled to return to GMT+0 on November 18, 1945 (the British returned to GMT+0 in on October 7, 1945), but the French government canceled the decision on November 5, 1945, and GMT+1 has since then remained the official time of Metropolitan France. Question: is france the same timezone as the uk?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The chocolate bar is structured in two layers; a lightly-whipped nougat layer, with a lower layer of cereal 'crispies', these are then coated in milk chocolate. Originally the bar contained raisins within the base layer; however, consumer research in the mid-1980s led to these being removed and the current formulation being introduced. Television adverts in the 1970s featured Willie Rushton before a mascot named Dougie the Double Decker Dog was introduced. Question: did a double deckers have raisins in it?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Discretionary income is disposable income (after-tax income), minus all payments that are necessary to meet current bills. It is total personal income after subtracting taxes and minimal survival expenses (such as food, medicine, rent or mortgage, utilities, insurance, transportation, property maintenance, child support, etc.) to maintain a certain standard of living. It is the amount of an individual's income available for spending after the essentials have been taken care of: Question: is discretionary income the same as disposable income?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The original script by Ilya Tilkin does not have any literary source. The screenwriter studied diaries of the participants of the Battle of Stalingrad. He also used museum archives, documents and recorded stories of its participants. Question: is the movie stalingrad based on a true story?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The four-second fall from the Golden Gate Bridge sends a person plunging 245 feet (75 m) at 75 miles per hour (121 km/h) to hit the waters of the San Francisco Bay ``with the force of a speeding truck meeting a concrete building.'' Jumping off the bridge holds a 98 percent fatality rate; As of 2005, it is estimated that 26 people have survived after jumping. Some die instantly from internal injuries, while others drown or die of hypothermia. The Golden Gate bridge's death toll has since been surpassed only by the Nanjing Yangtze River Bridge in China. In 2013, 118 potential jumpers were talked down from their attempt and did not jump. Question: could you survive jumping off the golden gate bridge?",
        "pred_ans": " It is extremely unlikely that someone could survive jumping off the Golden Gate Bridge. The fall is too long, the impact is too hard, and the water is too cold. Furthermore, the passage states that the fatality rate is 98 percent. However, 26 people have survived the jump as of 2005.",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Ursa Major is primarily known from the asterism of its main seven relatively bright stars comprising the ``Big Dipper'', ``the Wagon'', ``Charles's Wain'' or ``the Plough'' (among others), with its stellar configuration mimicking the shape of the ``Little Dipper''. Question: is the big dipper the same as ursa major?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Mount Evans is the highest summit of the Chicago Peaks in the Front Range of the Rocky Mountains of North America. The prominent 14,271-foot (4350 m) fourteener is located in the Mount Evans Wilderness, 13.4 miles (21.6 km) southwest by south (bearing 214\u00b0) of the City of Idaho Springs in Clear Creek County, Colorado, United States, on the drainage divide between Arapaho National Forest and Pike National Forest. Question: is mt evans part of rocky mountain national park?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In no-limit and pot-limit games, there is a minimum amount that is required to be bet in order to open the action. In games with blinds, this amount is usually the amount of the big blind. Standard poker rules require that raises must be at least equal to the amount of the previous bet or raise. For example, if an opponent bets $5, a player must raise by at least another $5, and they may not raise by only $2. If a player raises a bet of $5 by $7 (for a total of $12), the next re-raise would have to be by at least another $7 (the previous raise) more than the $12 (for a total of at least $19). The primary purpose of the minimum raise rule is to avoid game delays caused by ``nuisance'' raises (small raises of large bets, such as an extra $1 over a current bet of $50, that have little effect on the action but take time as all others must call). This rule is overridden by table stakes rules, so that a player may in fact raise a $5 bet by $2 if that $2 is his entire remaining stake. Question: do you have to raise double in poker?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In humans, a single transverse palmar crease is a single crease that extends across the palm of the hand, formed by the fusion of the two palmar creases (known in palmistry as the ``heart line'' and the ``head line'') and is found in people with Down syndrome. However, it is not an indication that a person with single transverse palmar crease has to have Down syndrome. It is also found in 1.5% of the general population in at least one hand. Question: do all down syndrome babies have simian crease?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: On 28 September 2016, Nine renewed the program for a second season after just two episodes having been aired. On 11 October 2017, the series was renewed for a third season at Nine's upfronts. and premiered on Monday, 6 August 2018, instead of the previous Wednesday night slot. On 17 October 2018 the series was renewed for a fourth season. Question: is there a fourth season of doctor doctor?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The standard error (SE) of a statistic (usually an estimate of a parameter) is the standard deviation of its sampling distribution or an estimate of that standard deviation of estimate. If the parameter or the statistic is the mean, it is called the standard error of the mean (SEM). Question: is sampling error the same as standard deviation?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The game features a nonlinear story, that follows the unnamed player character as they go to become a racing icon in the United States by winning in all racing disciplines available in the game. There are four disciplines: Street Racing, Off Road, Freestyle and Pro Racing. In Street Racing, the player is assisted by Latrell. In Off Road, the player is assisted by Tucker ``Tuck'' Morgan. In Freestyle, the player is assisted by Sofia and her father. In Pro Racing, the player is assisted by Alexis. Question: does the crew 2 have a story line?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Nuclear power is the use of nuclear reactions that release nuclear energy to generate heat, which most frequently is then used in steam turbines to produce electricity in a nuclear power plant. Nuclear power can be obtained from nuclear fission, nuclear decay and nuclear fusion. Presently, the vast majority of electricity from nuclear power is produced by nuclear fission of elements in the actinide series of the periodic table. Nuclear decay processes are used in niche applications such as radioisotope thermoelectric generators. The possibility of generating electricity from nuclear fusion is still at a research phase with no commercial applications. This article mostly deals with nuclear fission power for electricity generation. Question: is nuclear power the same as nuclear energy?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: A pitaya (/p\u026a\u02c8ta\u026a.\u0259/) or pitahaya (/\u02ccp\u026at\u0259\u02c8ha\u026a.\u0259/) is the fruit of several different cactus species indigenous to the Americas. Pitaya usually refers to fruit of the genus Stenocereus, while pitahaya or dragon fruit refers to fruit of the genus Hylocereus, both in the Cactaceae family. The dragon fruit is cultivated in Southeast Asia, Florida, the Caribbean, Australia, and throughout tropical and subtropical world regions. Question: is dragon fruit and pitaya the same thing?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The Fire Tablet, formerly called the Kindle Fire, is a tablet computer developed by Amazon.com. Built with Quanta Computer, the Kindle Fire was first released in November 2011, featuring a color 7-inch multi-touch display with IPS technology and running a custom version of Google's Android operating system called Fire OS. The Kindle Fire HD followed in September 2012, and the Kindle Fire HDX in September 2013. In September 2014, when the fourth generation was introduced, the name ``Kindle'' was dropped. In September 2015, the fifth generation Fire 7 was released, followed by the sixth generation Fire HD 8, in September 2016. The seventh generation Fire 7 was released in June 2017. Question: is a fire 7 the same as a kindle?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Toys ``R'' Us expanded as a chain, becoming predominant in its niche field of toy retail. Represented by cartoon mascot Geoffrey the Giraffe from 1969, Toys ``R'' Us eventually branched out into launching the stores Babies ``R'' Us, Toys ``R'' Us Express, and the now-defunct Kids ``R'' Us. Question: is babies r us and toys r us the same?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In the series 11 finale Kepner tells Avery that she is leaving with Owen Hunt to serve as a trauma surgeon in the Army; it will help her grieve for their son. Avery lets her go and wonders how he can deal with his own grief. After discussions over the phone via Facetime, Kepner tells Avery that she is extending her service time. The sound of gunfire and explosions are heard at April's base camp, leaving her to quickly terminate the call. On Valentine's Day, Kepner returns to the hospital, where she and Avery embrace in the foyer. In season 12, their marriage begins to fall apart and they grow estranged. In episode 11, they file for a civil divorce. After their divorce is completed, Kepner reveals that she is pregnant with Avery's child. In Season 14 Jackson begins a relationship with Maggie Pierce. Question: does jackson avery die on grey's anatomy?",
        "pred_ans": " No",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: A game consists of a sequence of points played with the same player serving, and is won by the first side to have won at least four points with a margin of two points or more over their opponent. Normally the server's score is always called first and the opponent's score second. Score calling in tennis is unusual in that each point has a corresponding call that is different from its point value. Question: do you have to serve to score in tennis?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Physically and functionally, the auditory system of an absolute listener does not appear to be different from that of a non-absolute listener. Rather, ``it reflects a particular ability to analyze frequency information, presumably involving high-level cortical processing.'' Absolute pitch is an act of cognition, needing memory of the frequency, a label for the frequency (such as ``B-flat''), and exposure to the range of sound encompassed by that categorical label. Absolute pitch may be directly analogous to recognizing colors, phonemes (speech sounds), or other categorical perception of sensory stimuli. Just as most people have learned to recognize and name the color blue by the range of frequencies of the electromagnetic radiation that are perceived as light, it is possible that those who have been exposed to musical notes together with their names early in life will be more likely to identify, for example, the note C. Absolute pitch may also be related to certain genes, possibly an autosomal dominant genetic trait, though it ``might be nothing more than a general human capacity whose expression is strongly biased by the level and type of exposure to music that people experience in a given culture.'' Question: do you have to be born with perfect pitch?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The Book of Jasher (also, Jashar) or the Book of the Upright or the Book of the Just Man (Hebrew: \u05e1\u05b5\u05e4\u05b6\u05e8 \u05d4\u05b7\u05d9\u05c7\u05bc\u05e9\u05c7\u05c1\u05e8\u202c; transliteration: s\u0113fer hayy\u0101\u0161\u0101r) is an unknown book mentioned in the Hebrew Bible. The translation ``Book of the Just Man'' is the traditional Greek and Latin translation, while the transliterated form ``Jasher'' is found in the King James Bible, 1611. Question: is the book of jasher in the bible?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: If discharged administratively for any of the above reasons, the service member normally receives an honorable or a general (under honorable conditions) discharge. If misconduct is involved the service member may receive an Other Than Honorable (OTH) Discharge service characterization. Question: is under honorable conditions the same as honorable discharge?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Peptidoglycan, also known as murein, is a polymer consisting of sugars and amino acids that forms a mesh-like layer outside the plasma membrane of most bacteria, forming the cell wall. The sugar component consists of alternating residues of \u03b2-(1,4) linked N-acetylglucosamine (NAG) and N-acetylmuramic acid (NAM) . Attached to the N-acetylmuramic acid is a peptide chain of three to five amino acids. The peptide chain can be cross-linked to the peptide chain of another strand forming the 3D mesh-like layer. Peptidoglycan serves a structural role in the bacterial cell wall, giving structural strength, as well as counteracting the osmotic pressure of the cytoplasm. A common misconception is that peptidoglycan gives the cell its shape; however, whereas peptidoglycan helps maintain the structural strength of the cell, it is actually the MreB protein that facilitates cell shape. Peptidoglycan is also involved in binary fission during bacterial cell reproduction. Question: do all bacteria have peptidoglycan in their cell walls?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: All other nations have some form of legislation meant to prevent the illegal trading of organs, whether by an outright ban or through legislation that limits how and by whom donations can be made. Many countries, including Belgium and France, use a system of presumed consent to increase the amount of legal organs available for transplant. . In the United States, federal law prohibits the sale of organs; however, the government has created initiatives to encourage organ gifting and to compensate those who freely donate their organs. In 2004, the state of Wisconsin began providing tax deductions to living donors. Question: is it legal to sell human body parts?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The redback is one of the few spider species that can be seriously harmful to humans, and its liking for habitats in built structures has led it to being responsible for a large number of serious spider bites in Australia. Predominantly neurotoxic to vertebrates, the venom gives rise to the syndrome of latrodectism in humans; this starts with pain around the bite site, which typically becomes severe and progresses up the bitten limb and persists for over 24 hours. Sweating in localised patches of skin occasionally occurs and is highly indicative of latrodectism. Generalised symptoms of nausea, vomiting, headache, and agitation may also occur and indicate severe envenomation. An antivenom has been available since 1956. There have been no deaths directly due to redback bites since its introduction, however Isbister et al. have suggested patients for whom antivenom is considered should be fully informed ``there is considerable weight of evidence to suggest it is no better than placebo'', and in light of a risk of anaphylaxis and serum sickness, ``routine use of the antivenom is therefore not recommended''. As of the 2013 (updated 2014) edition of the Snakebite & Spiderbite Clinical Management Guidelines from NSW HEALTH (latest available in 2017), Red-back spider bites were considered not life-threatening but capable of causing severe pain and systemic symptoms that could continue for hours to days. Question: can a red back spider bite kill you?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Canada is one of the oldest continuing monarchies in the world. Initially established in the 16th century, monarchy in Canada has evolved through a continuous succession of French and British sovereigns into the independent Canadian sovereigns of today, whose institution is sometimes colloquially referred to as the Maple Crown. Question: is canada still part of the british monarchy?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Before 1973, the NCAA's smaller schools were grouped together in the College Division. In 1973, the College Division split in two when the NCAA began using numeric designations for its competitions. The College Division members who wanted to offer athletic scholarships or compete against those who did became Division II, while those who chose not to offer athletic scholarships became Division III. Question: do ncaa division 2 schools offer athletic scholarships?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The first installment, Divergent (2014), grossed over $288 million worldwide, while the second installment, The Divergent Series: Insurgent (2015), grossed over $297 million worldwide. Insurgent was also the first Divergent film to be released in IMAX 3D. The third installment, The Divergent Series: Allegiant (2016), grossed $179 million. Thus, the first three films of the series have grossed over $765 million worldwide. A fourth film, The Divergent Series: Ascendant was to be released theatrically, but due to Allegiant's poor showing at the box office, it was announced it would be released as a television film that could lead into a potential episodic spin-off series on Starz. However, Woodley, along with other cast members, expressed no interest in returning. Question: is there a 4th film in divergent series?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Ordinarily, a baseball game consists of nine innings (in softball and high school baseball games there are typically seven innings; in Little League Baseball, six), each of which is divided into halves: the visiting team bats first, after which the home team takes its turn at bat. However, if the score remains tied at the end of the regulation number of complete innings, the rules provide that ``play shall continue until (1) the visiting team has scored more total runs than the home team at the end of a completed inning; or (2) the home team scores the winning run in an uncompleted inning.'' (Since the home team bats second, condition (2) implies that the visiting team will not have the opportunity to score more runs before the end of the inning.) Question: can a home team win by 2 in extra innings?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The Open Door Policy is a term in foreign affairs initially used to refer to the United States policy established in the late 19th century and the early 20th century that would allow for a system of trade in China open to all countries equally. It was used mainly to mediate the competing interests of different colonial powers in China. In more recent times, Open Door policy describes the economic policy initiated by Deng Xiaoping in 1978 to open up China to foreign businesses that wanted to invest in the country. This later policy set into motion the economic transformation of modern China. Question: is the open door policy still used today?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": "[1301][\u7cfb\u7edf\u68c0\u6d4b\u5230\u8f93\u5165\u6216\u751f\u6210\u5185\u5bb9\u53ef\u80fd\u5305\u542b\u4e0d\u5b89\u5168\u6216\u654f\u611f\u5185\u5bb9\uff0c\u8bf7\u60a8\u907f\u514d\u8f93\u5165\u6613\u4ea7\u751f\u654f\u611f\u5185\u5bb9\u7684\u63d0\u793a\u8bed\uff0c\u611f\u8c22\u60a8\u7684\u914d\u5408\u3002]"
    },
    {
        "question": "Passage: In December of 2015, Bad Lip Reading simultaneously released three new videos, one for each of the three films in the original Star Wars trilogy. These videos found BLR using guest voices for the first time, featuring Jack Black as Darth Vader, Maya Rudolph as Princess Leia, and Bill Hader in multiple roles. The Empire Strikes Back BLR video featured a scene of Yoda singing to Luke about an unfortunate encounter with a seagull on the beach. BLR would later expand this scene into a full-length standalone song known as ``Seagulls! (Stop It Now)'', which was released in November 2016 (eventually hitting #1 on the Billboard Comedy Digital Tracks chart.) As of late 2017, the ``Seagulls!'' video is Bad Lip Reading's second most viewed YouTube upload, and most popular musical production. In the song, Yoda sings to Luke Skywalker about the dangers posed by vicious seagulls if one dares to go to the beach. Mark Hamill, who played Luke Skywalker in the Star Wars films, publicly praised ``Seagulls!'' (and Bad Lip Reading in general) while speaking at Star Wars Celebration in 2017: ``I love them, and I showed Carrie (Fisher) the Yoda one... we were dying. I showed it to her in her trailer. She loved it. I retweeted it... and (BLR) contacted me and said 'Do you want to do Bad Lip Reading?' And I said, 'I'd love to...'''. Hamill and Bad Lip Reading would go on to collaborate on Bad Lip Reading's version of The Force Awakens, with Hamill providing the voice of Han Solo. Question: is seagulls stop it now a real song?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": "[1301][\u7cfb\u7edf\u68c0\u6d4b\u5230\u8f93\u5165\u6216\u751f\u6210\u5185\u5bb9\u53ef\u80fd\u5305\u542b\u4e0d\u5b89\u5168\u6216\u654f\u611f\u5185\u5bb9\uff0c\u8bf7\u60a8\u907f\u514d\u8f93\u5165\u6613\u4ea7\u751f\u654f\u611f\u5185\u5bb9\u7684\u63d0\u793a\u8bed\uff0c\u611f\u8c22\u60a8\u7684\u914d\u5408\u3002]"
    },
    {
        "question": "Passage: The Constitution of India designates the official language of the Government of India as Hindi written in the Devanagari script, as well as English. There is no national language as declared by the Constitution of India. Hindi is used for official purposes such as parliamentary proceedings, judiciary, communications between the Central Government and a State Government. States within India have the liberty and powers to specify their own official language(s) through legislation and therefore there are 22 officially recognized languages in India of which Hindi is the most used. The number of native Hindi speakers is about 25% of the total Indian population; however, including dialects of Hindi termed as Hindi languages, the total is around 44% of Indians, mostly accounted from the states falling under the Hindi belt. Other Indian languages are each spoken by around 10% or less of the population. Question: is hindi is our national language of india?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Carbon (from Latin: carbo ``coal'') is a chemical element with symbol C and atomic number 6. It is nonmetallic and tetravalent--making four electrons available to form covalent chemical bonds. It belongs to group 14 of the periodic table. Three isotopes occur naturally, C and C being stable, while C is a radionuclide, decaying with a half-life of about 5,730 years. Carbon is one of the few elements known since antiquity. Question: is carbon a metal on the periodic table?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Date and time notation in the United Kingdom records the date using the day-month-year format (21 October 2011 or 21/10/11). The ISO 8601 format (2011-08-21) is increasingly used for all-numeric dates. The time can be written using either the 24-hour clock (16:10) or 12-hour clock (4.10 p.m.). Question: does the uk use a 24 hour clock?",
        "pred_ans": " Yes, the UK uses a 24-hour clock for time notation.",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Broken heart (also known as a heartbreak or heartache) is a metaphor for the intense emotional--and sometimes physical--stress or pain one feels at experiencing great longing. The concept is cross-cultural, often cited with reference to a desired or lost lover, and dates back at least 3,000 years. Question: is there a such thing as a broken heart?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: On June 1, 2017, it was announced that Seyfried would return as Sophie. Later that month, Dominic Cooper confirmed that he would return for the sequel, along with Streep, Firth and Brosnan as Sky, Donna, Harry, and Sam, respectively. In July 2017, Baranski was also confirmed to return as Tanya. On July 12, 2017, Lily James was cast to play the role of young Donna. On August 3, 2017, Jeremy Irvine and Alexa Davies were also cast in the film, with Irvine playing Brosnan's character Sam in a past era, and Hugh Skinner to play Young Harry, Davies as a young Rosie, played by Julie Walters. On August 16, 2017, it was announced that Jessica Keenan Wynn had been cast as a young Tanya, who is played by Baranski. Julie Walters and Stellan Skarsg\u00e5rd also reprised their roles as Rosie and Bill, respectively. On October 16, 2017, it was announced that singer and actress Cher had joined the cast, in her first on-screen film role since 2010, and her first film with Streep since Silkwood. Question: is the cast of mama mia the same?",
        "pred_ans": " Yes, the cast of Mama Mia! Here We Go Again includes many of the original cast members, such as Meryl Streep, Pierce Brosnan, Colin Firth, and Julie Walters, along with some new additions like Lily James, Jeremy Irvine, and Alexa Davies.",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Citizens of countries in the European Economic Area (other than British and Irish citizens) and Swiss citizens obtain permanent residence status automatically after five years' residence in the United Kingdom exercising Treaty rights rather than ILR. The rights of EEA citizens are not governed by UK Immigration Regulations but rather the EEA Regulations. Question: do eu citizens have indefinite leave to remain?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": "[1301][\u7cfb\u7edf\u68c0\u6d4b\u5230\u8f93\u5165\u6216\u751f\u6210\u5185\u5bb9\u53ef\u80fd\u5305\u542b\u4e0d\u5b89\u5168\u6216\u654f\u611f\u5185\u5bb9\uff0c\u8bf7\u60a8\u907f\u514d\u8f93\u5165\u6613\u4ea7\u751f\u654f\u611f\u5185\u5bb9\u7684\u63d0\u793a\u8bed\uff0c\u611f\u8c22\u60a8\u7684\u914d\u5408\u3002]"
    },
    {
        "question": "Passage: The passport card is a limited travel document, valid only for land and sea travel within North America (Canada, the United States, Mexico, the Caribbean, and Bermuda). It cannot be used for international air travel. The Department of State indicates that this is because ``designing a card format passport for wide use, including by air travelers, would inadvertently undercut the broad based international effort to strengthen civil aviation security and travel document specifications to address the post 9/11 threat environment''. Question: can a passport card be used to fly to canada?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In order to recover construction and operating costs, the bridge was electronically tolled when originally built. The toll rates increased to $1.60 for motorcycle, $3.15 for cars, $6.30 for small trucks and $9.45 for large trucks on August 15, 2015. Through increased prices and greater traffic, Transportation Investment Corporation (TI Corp), the public Crown corporation responsible for toll operations on the Port Mann Bridge, forecast its revenue would grow by 85% between fiscal years 2014 and 2017. These fees were assessed using radio-frequency identification (RFID) decals or licence plate photos. A B.C. licensed driver who owes more than $25 in tolls outstanding 90 days is penalized $20 and is unable to purchase vehicle insurance or renew drivers permits without payment of the debt. Out-of-province drivers were also contacted for payment by a US-based contractor. A licence plate processing fee of $2.30 per trip was added to the toll rate for unregistered users who did not pay their toll within seven days of their passage. Monthly passes, which allowed unlimited crossing on the bridge, were available for purchase. Users may have set up an account for online payment of tolls. Users who opted for this method received a decal with an embedded RFID to place on their vehicle's windshield or headlight and avoid paying a processing fee. Tolls were expected to be removed by the year of 2050 or after collecting $3.3 billion. As announced by B.C. Premier John Horgan a few days earlier, all tolls on the Port Mann Bridge were removed on September 1, 2017. Debt service was transferred to the province of British Columbia at a cost of $135 million per year. Question: is the port mann bridge a toll bridge?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The group winners, Serbia, qualified directly for the 2018 FIFA World Cup. The group runners-up, Republic of Ireland, advanced to the play-offs as one of the best 8 runners-up, where they lost to Denmark and thus failed to qualify. Question: has ireland qualified for the world cup 2018?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: On October 20, 1977 -- three days after the release of the band's fifth studio album Street Survivors -- a chartered plane on which the members and crew were travelling crashed in Gillsburg, Mississippi. Six people died in the accident, including band members Ronnie Van Zant, Steve Gaines and Cassie Gaines; many of the other passengers onboard were seriously injured, including Wilkeson who was left in a critical condition and reportedly declared dead three times. The group disbanded after the crash. In 1978, a collection of previously unreleased recordings from 1971 and 1972 was released as Skynyrd's First and... Last. The following year, the surviving members (with the exception of Wilkeson) reunited at Volunteer Jam for a performance of ``Free Bird'' with Charlie Daniels and his band. Question: are any original members of lynyrd skynyrd still alive?",
        "pred_ans": " Yes, some original members of Lynyrd Skynyrd are still alive.",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    }
][
    {
        "question": "Passage: While the term was in use as early as 1933, it became official only after the formation of the NCAA Division I athletic conference in 1954. Seven of the eight schools were founded during the colonial period (Cornell was founded in 1865). Ivy League institutions account for seven of the nine Colonial Colleges chartered before the American Revolution; the other two are Rutgers University and the College of William & Mary. Question: is college of william and mary an ivy league school?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The fluid ounce is distinct from the ounce as a unit of weight or mass, although it is sometimes referred to simply as an ``ounce'' where context makes the meaning clear, such as ounces in a bottle. Question: is 1 ounce the same as 1 fluid ounce?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Usually, one of the first items in an order of business or an agenda for a meeting is the reading and approval of the minutes from the previous meeting. If the members of the group agree (usually by unanimous consent) that the written minutes reflect what happened at the previous meeting, then they are approved, and the fact of their approval is recorded in the minutes of the current meeting. If there are significant errors or omissions, then the minutes may be redrafted and submitted again at a later date. Minor changes may be made immediately using the normal amendment procedures, and the amended minutes may be approved ``as amended''. It is normally appropriate to send a draft copy of the minutes to all the members in advance of the meeting so that the meeting is not delayed by a reading of the draft. Question: do minutes of a meeting have to be approved?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The work was initially intended by Tolkien to be one volume of a two-volume set, the other to be The Silmarillion, but this idea was dismissed by his publisher. For economic reasons, The Lord of the Rings was published in three volumes over the course of a year from 29 July 1954 to 20 October 1955. The three volumes were titled The Fellowship of the Ring, The Two Towers and The Return of the King. Structurally, the novel is divided internally into six books, two per volume, with several appendices of background material included at the end. Some editions combine the entire work into a single volume. The Lord of the Rings has since been reprinted numerous times and translated into 38 languages. Question: was the lord of the rings originally one book?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Nationals of any country may visit Montenegro without a visa for up to 30 days if they hold a passport with visas issued by Ireland, a Schengen Area member state, the United Kingdom or the United States or if they are permanent residents of those countries. Residents of the United Arab Emirates do not require a visa for up to 10 days, if they hold a return ticket and proof of accommodation. Question: can i go to montenegro with a schengen visa?",
        "pred_ans": "NILL",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: A batsman may not be given out bowled, leg before wicket, caught, stumped or hit wicket off a no-ball. A batsman may be given out run out, hit the ball twice, or obstructing the field. Thus the call of no-ball protects the batsman against losing his wicket in ways that are attributed to the bowler, but not in ways that are attributed to running, or to the batsman's own conduct. Question: can a batsman be run out on a no ball?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: A ganglionic blocker (or ganglioplegic) is a type of medication that inhibits transmission between preganglionic and postganglionic neurons in the Autonomic Nervous System, often by acting as a nicotinic receptor antagonist. Nicotinic acetylcholine receptors are found on skeletal muscle, but also within the route of transmission for the parasympathetic and sympathetic nervous system (which together comprise the autonomic nervous system). More specifically, nicotinic receptors are found within the ganglia of the autonomic nervous system, allowing outgoing signals to be transmitted from the presynaptic to the postsynaptic cells. Thus, for example, blocking nicotinic acetylcholine receptors blocks both sympathetic (excitatory) and parasympathetic (calming) stimulation of the heart. The nicotinic antagonist hexamethonium, for example, does this by blocking the transmission of outgoing signals across the autonomic ganglia at the postsynaptic nicotinic acetylcholine receptor. Question: can nicotine be classified as a ganglion blocker?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Six Flags Great America is an amusement park located in Gurnee, Illinois. Part of the Six Flags chain, Great America was first opened in 1976 by the Marriott Corporation as Marriott's Great America. Six Flags has owned and operated the park since 1984, making it the seventh park in the chain. The park offers ten themed areas, as well as Hurricane Harbor, a 20-acre (81,000 m) water park, and three specially themed children's areas. Question: is great america the same as six flags?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: In the United Kingdom and the Crown dependencies, any household watching or recording live television transmissions as they are being broadcast (terrestrial, satellite, cable, or Internet) is required to hold a television licence. Businesses, hospitals, schools and a range of other organisations are also required to hold television licences to watch and record live TV broadcasts. A television licence is also required to receive video on demand programme services provided by the BBC, on the iPlayer catch-up service. Question: do you have to have a license to own a tv in england?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: There are currently certain restrictions on the possession of airsoft replicas, which came in with the introduction of the ASBA (Anti-Social Behaviour Act 2003) Amendments, prohibiting the possession of any firearms replica in a public place without good cause (to be concealed in a gun case or container only, not to be left in view of public at any time). Question: do you need a license for airsoft guns uk?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The United States Congress is the bicameral legislature of the Federal government of the United States. The legislature consists of two chambers: the Senate and the House of Representatives. Question: is the house of representatives the same as congress?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In the United Kingdom, there is not an equivalent of a vehicle title. Instead, there is a document known as the 'vehicle registration document', and is issued by the Driver and Vehicle Licensing Agency (DVLA). The current version has the reference number V5C. Prior to computerisation, the title document was the 'log book', and this term is sometimes still used to describe the V5C. The V5 document records who the Registered Keeper of the vehicle is; it does not establish legal ownership of the vehicle. These documents used to be blue on the front. However, they were changed to red in 2010/11 after approximately 2.2 million blank blue V5 documents were stolen, allowing thieves to clone stolen vehicles much more easily. Question: is a title and registration the same thing?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Before the Civil War, President James Buchanan took a weak position amid a looming South secession crisis. Secretary of State Lewis Cass of Michigan, a 78-year-old elder statesman who has been Michigan's U.S. senator and governor of Michigan Territory, resigned from Buchanan's cabinet in protest, remarking that ``he had seen the Constitution born and now feared he was seeing it die''. Question: was michigan a state during the civil war?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The League's greatest extent was from 28 September 1934 (when Ecuador joined) to 23 February 1935 (when Paraguay withdrew) with 58 members. At this time, only Costa Rica (22 January 1925), Brazil (14 June 1926), Japan (27 March 1933) and Germany (19 September 1933) had withdrawn and only Egypt was left to join (on 26 May 1937). The members (listed from earliest joining and alphabetically if they joined on the same day) at this time were Argentina, Australia, Belgium, Bolivia, the British Empire, Canada, Chile, China, Colombia, Cuba, Czechoslovakia, Denmark, El Salvador, France, Greece, Guatemala, Haiti, Honduras, India, Italy, Liberia, the Netherlands, New Zealand, Nicaragua, Norway, Panama, Paraguay, Persia/Iran, Peru, Poland, Portugal, Romania, Siam, South Africa, Spain, Sweden, Switzerland, Uruguay, Venezuela, Yugoslavia, Austria, Bulgaria, Finland, Luxembourg, Albania, Estonia, Latvia, Lithuania, Hungary, the Irish Free State, Ethiopia, the Dominican Republic, Mexico, Turkey, Iraq, the Soviet Union, Afghanistan, and Ecuador. Question: was poland part of the league of nations?",
        "pred_ans": "NILL",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The United States men's national soccer team is controlled by the United States Soccer Federation and competes in the Confederation of North, Central American and Caribbean Association Football. The team has appeared in ten FIFA World Cups, including the first in 1930, where they reached the semi-finals. The U.S. participated in the 1934 and 1950 World Cups, winning 1--0 against England in the latter. After 1950, the U.S. did not qualify for the World Cup until 1990. The U.S. hosted the 1994 World Cup, where they lost to Brazil in the round of sixteen. They qualified for five more consecutive World Cups after 1990 (for a total of seven straight appearances, a feat shared with only seven other nations), becoming one of the tournament's regular competitors and often advancing to the knockout stage. The U.S. reached the quarter-finals of the 2002 World Cup, where they lost to Germany. In the 2009 Confederations Cup, they eliminated top-ranked Spain in the semi-finals before losing to Brazil in the final, their only appearance in a final. The team failed to qualify for the 2018 World Cup, having been eliminated in continental qualifying, ending the streak of consecutive World Cups at seven. Question: does america have a team in the world cup?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The production of milk requires that the cow be in lactation, which is a result of the cow having given birth to a calf. The cycle of insemination, pregnancy, parturition, and lactation is followed by a ``dry'' period of about two months before calving, which allows udder tissue to regenerate. A dry period that falls outside this time frames can result in decreased milk production in subsequent lactation. Dairy operations therefore include both the production of milk and the production of calves. Bull calves are either castrated and raised as steers for beef production or used for veal. Question: do cows have to be pregnant to get milk?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The subsequent trophy, called the ``FIFA World Cup Trophy'', was introduced in 1974. Made of 18 carat gold with bands of malachite on its base, it stands 36.8 centimetres high and weighs 6.1 kilograms. The trophy was made by Stabilimento Artistico Bertoni company in Italy. It depicts two human figures holding up the Earth. The current holders of the trophy are France, winners of the 2018 World Cup. Question: does the world cup trophy have a name?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: A cordon bleu or schnitzel cordon bleu is a dish of meat wrapped around cheese (or with cheese filling), then breaded and pan-fried or deep-fried. Veal or pork cordon bleu is made of veal or pork pounded thin and wrapped around a slice of ham and a slice of cheese, breaded, and then pan fried or baked. For chicken cordon bleu chicken breast is used instead of veal. Ham cordon bleu is ham stuffed with mushrooms and cheese. Question: is chicken cordon bleu made with blue cheese?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In the United States, an honours degree (or honors degree in US spelling) requires a thesis or project work beyond that needed for the normal bachelor's program. Honours programs in the US are taken alongside the rest of the degree and often have a minimum GPA requirement for entry, which can vary between institutions. Some institutions do not have a separate honours program, but instead refer to bachelor's degrees awarded with Latin honours, which may be based either on GPA or class position, as honours degrees. Question: is a master's degree higher than an honours degree?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Greece in the Roman era describes the period of Greek history when it was dominated by the Roman republic, the Roman Empire and the Byzantine Empire (collectively, the Roman era). It began with the Roman victory over the Corinthians, at the Battle of Corinth (146 BC). It continued with the adoption of the city of Byzantium by the Emperor Constantine the Great as the capital of the Roman Empire (as Nova Roma, later Constantinople) in AD 330. After this date, the Eastern Empire became largely Greek speaking. Question: were greece and rome around at the same time?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Although it is widely believed that a worker honey bee can sting only once, this is a partial misconception: although the stinger is in fact barbed so that it lodges in the victim's skin, tearing loose from the bee's abdomen and leading to its death in minutes, this only happens if the skin of the victim is sufficiently thick, such as a mammal's. Honey bees are the only hymenoptera with a strongly barbed sting, though yellow jackets and some other wasps have small barbs. Question: is there always a stinger in a bee sting?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The natural logarithm of a number is its logarithm to the base of the mathematical constant e, where e is an irrational and transcendental number approximately equal to 7000271828182845899\u26602.718281828459. The natural logarithm of x is generally written as ln x, log x, or sometimes, if the base e is implicit, simply log x. Parentheses are sometimes added for clarity, giving ln(x), log(x) or log(x). This is done in particular when the argument to the logarithm is not a single symbol, to prevent ambiguity. Question: is log x the same as ln x?",
        "pred_ans": " Yes, log x and ln x are the same thing.",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Later Gemma comes to see her and confesses that it was Clay who wanted her dead because of the letters, which she gives to Gemma. She reveals later to Gemma that she was fully aware of her plan to have Clay killed by Jax and hide the letters that involve her and Unser, to prompt Jax to stay as President. Tara gives Jax a blood thinner to inject into Clay, who is recuperating at St. Thomas. She then tells Jax, in front of Gemma, that he will kill Clay and come get her and the boys, so they can leave Charming forever. This plan fails as Jax is forced to stay due to the influence of Romeo Parada and Luis Torres who are undercover CIA agents. They need SAMCRO to provide weapons and transport drugs or they will crush the club. Understanding that Jax has to become President, Tara decides to remain with him. In the season finale, with Gemma watching Tara stands behind Jax at the head of the table, one arm draped around him, mirroring a photo of Gemma and John Teller. Question: does tara die in sons of anarchy season 4?",
        "pred_ans": " No",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The mishap in The Guardian where Randall loses his crew is loosely based on an actual U.S. Coast Guard aviation mishap in Alaska. The aircraft was an HH-3F Pelican (USCG variant of the Jolly Green Giant) instead of the HH-60J Jayhawk (USCG variant of the Blackhawk/Seahawk) pictured in the movie. Question: is the movie the guardian based on a true story?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The World Health Organization (WHO) is a specialized agency of the United Nations that is concerned with international public health. It was established on 7 April 1948, and is headquartered in Geneva, Switzerland. The WHO is a member of the United Nations Development Group. Its predecessor, the Health Organization, was an agency of the League of Nations. Question: is the world health organization a government organization?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The United States Federal Food, Drug, and Cosmetic Act (abbreviated as FFDCA, FDCA, or FD&C), is a set of laws passed by Congress in 1938 giving authority to the U.S. Food and Drug Administration (FDA) to oversee the safety of food, drugs, and cosmetics. A principal author of this law was Royal S. Copeland, a three-term U.S. Senator from New York. In 1968, the Electronic Product Radiation Control provisions were added to the FD&C. Also in that year the FDA formed the Drug Efficacy Study Implementation (DESI) to incorporate into FD&C regulations the recommendations from a National Academy of Sciences investigation of effectiveness of previously marketed drugs. The act has been amended many times, most recently to add requirements about bioterrorism preparations. Question: was the food drug and cosmetic act successful?",
        "pred_ans": " It is difficult to objectively determine if the Food, Drug, and Cosmetic Act (FFDCA) has been successful. However, the law has been in effect for over 80 years and has been amended many times, which suggests that it has had a significant impact on the regulation of food, drugs, and cosmetics in the United States. The FDA, which is responsible for enforcing the FFDCA, has played a crucial role in ensuring the safety of these products and protecting the public from unsafe or ineffective products. That being said, there have been instances where the FDA has been criticized for not acting quickly or effectively enough to address safety concerns or for being too lenient in its approval process. Overall, while the FFDCA has had its flaws and challenges, it has been a critical piece of legislation in shaping the regulatory landscape of food, drugs, and cosmetics in the United States.",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The standard error (SE) of a statistic (usually an estimate of a parameter) is the standard deviation of its sampling distribution or an estimate of that standard deviation of estimate. If the parameter or the statistic is the mean, it is called the standard error of the mean (SEM). Question: is the standard error the same as standard deviation?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The NBA high school draftees are players who have been drafted to the National Basketball Association (NBA) straight out of high school without playing basketball at the collegiate level. The process of jumping directly from high school to the professional level is also known as going prep-to-pro. Since 2006, the practice of drafting high school players has been prohibited by the new collective bargaining agreement, which requires that players who entered the draft be 19 years of age and at least one year removed from high school. Contrary to popular belief, the player does not have to play at least a year in college basketball, as the player can choose to instead play in another professional league (like the NBA G League or especially somewhere overseas) like Brandon Jennings or Emmanuel Mudiay in Italy and China respectively, simply take the year off, such as the case with Mitchell Robinson, or even hold themselves back a year in high school before declaring for the draft, like with Satnam Singh Bhamara or Thon Maker. Question: can you go to nba out of high school?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: From 1976 to 1983, several states voluntarily raised their purchase ages to 19 (or, less commonly, 20 or 21), in part to combat drunk driving fatalities. In 1984, Congress passed the National Minimum Drinking Age Act, which required states to raise their ages for purchase and public possession to 21 by October 1986 or lose 10% of their federal highway funds. By mid-1988, all 50 states and the District of Columbia had raised their purchase ages to 21 (but not Puerto Rico, Guam, or the Virgin Islands, see Additional Notes below). South Dakota and Wyoming were the final two states to comply with the age 21 mandate. The current drinking age of 21 remains a point of contention among many Americans, because of it being higher than the age of majority (18 in most states) and higher than the drinking ages of most other countries. The National Minimum Drinking Age Act is also seen as a congressional sidestep of the tenth amendment. Although debates have not been highly publicized, a few states have proposed legislation to lower their drinking age, while Guam has raised its drinking age to 21 in July 2010. Question: does the drinking age vary from state to state?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Exclusions: The letters I, O, and Q are not used in the first or third alpha positions of the 7-digit alpha-numeric series. Question: does california use the letter o on license plates?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The sixth and final season premiered on 19 August 2018. Question: is a place to call home finished for good?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: On 13 July 1967, British cyclist Tom Simpson died climbing Mont Ventoux after taking amphetamine. Question: has anyone ever died doing the tour de france?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The hosts of the World Cup receive an automatic berth. Unlike many other sports, results of the previous World Cups or of the continental championships are not taken into account. Until 2002, the defending champions also received an automatic berth, but starting from the 2006 World Cup this is no longer the case. Question: does host country have to qualify for world cup?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The Spanish language is written using the Spanish alphabet, which is the Latin script with one additional letter: e\u00f1e ``\u00f1'', for a total of 27 letters. Although the letters ``k'' and ``w'' are part of the alphabet, they appear only in loanwords such as karate, kilo, waterpolo and wolframio (tungsten). Each letter has a single official name according to the Real Academia Espa\u00f1ola's new 2010 Common Orthography, but in some regions alternative traditional names coexist as explained below. The digraphs ``ch'' and ``ll'' were considered letters of the alphabet from 1754 to 2010 (and sorted separately from ``c'' and ``l'' from 1803 to 1994). The digraph ``rr'' is occasionally considered a letter, but officially it was never so. Question: does the spanish alphabet have more letters than the english alphabet?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Continuous positive airway pressure (CPAP) is a form of positive airway pressure ventilator, which applies mild air pressure on a continuous basis to keep the airways continuously open in people who are able to breathe spontaneously on their own. It is an alternative to positive end-expiratory pressure (PEEP). Both modalities stent the lungs' alveoli open and thus recruit more of the lung's surface area for ventilation. But while PEEP refers to devices tvt impose positive pressure only at the end of the exhalation, CPAP devices apply continuous positive airway pressure throughout the breathing cycle. Thus, the ventilator itself does not cycle during CPAP, no additional pressure above the level of CPAP is provided, and patients must initiate all of their breaths. Question: is a cpap the same as a ventilator?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Greece were subsequently drawn against Croatia in the play-off round, where they were knocked out over two legs; a 4--1 away defeat set the tone for Greece's campaign, and in the second leg they drew a blank in a 0--0 stalemate against the Croats to signify the end of their World Cup hopes. Kostas Mitroglou finished as Greece's top scorer throughout their campaign, scoring six goals. Question: is greece not in the world cup 2018?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In all criminal prosecutions, the accused shall enjoy the right to a speedy and public trial, by an impartial jury of the State and district wherein the crime shall have been committed, which district shall have been previously ascertained by law, and to be informed of the nature and cause of the accusation; to be confronted with the witnesses against him; to have compulsory process for obtaining witnesses in his favor, and to have the Assistance of Counsel for his defence. Question: is the right to a fair trial in the constitution?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The Tribute in Light is an art installation of 88 searchlights placed six blocks south of the World Trade Center on top of the Battery Parking Garage in New York City to create two vertical columns of light to represent the Twin Towers in remembrance of the September 11, 2001 attacks. Tribute in Light began initially as a temporary commemoration of the attacks in early 2002 but became an annual commemoration, currently produced on September 11th by the Municipal Art Society of New York. Question: do they light up the twin towers every night?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Season 1 was shot inside of an actual grocery store, Field's Market in West Hills, California. For Season 2, the market was built in a 15,500 square foot warehouse in Santa Rosa, CA. It was built over two weeks and stocked with over $700,000 of food. After each episode, the perishable items were donated to local food banks and local farmers. Question: is guys grocery games in real grocery store?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In Mexico, Belgium, Germany and Austria, the philosophy of the law holds that it is human nature to want to escape. In those countries, escapees who do not break any other laws are not charged for anything and no extra time is added to their sentence. However, in Mexico, officers are allowed to shoot prisoners attempting to escape and an escape is illegal if violence is used against prison personnel or property, or if prison inmates or officials aid the escape. Question: is it legal to escape from prison in germany?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: U.S. Marshals is a 1998 American action crime thriller film directed by Stuart Baird. The storyline was conceived from a screenplay written by Roy Huggins and John Pogue. The film is a spin-off to the 1993 motion picture The Fugitive, which in turn was based on the television series of the same name, created by Huggins. The story does not involve the character of Dr. Richard Kimble, portrayed by Harrison Ford in the initial film, but instead the plot centers on United States Deputy Marshal Sam Gerard, once again played by Tommy Lee Jones. The plot follows Gerard and his team as they pursue another fugitive, Mark Sheridan, played by Wesley Snipes, who attempts to escape government officials following an international conspiracy scandal. The cast features Robert Downey, Jr., Joe Pantoliano, Daniel Roebuck, Tom Wood, and LaTanya Richardson, several of whom portrayed Deputy Marshals in the previous film. Question: is us marshals a sequel to the fugitive?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Eidetic memory (/a\u026a\u02c8d\u025bt\u026ak/; sometimes called photographic memory) is an ability to vividly recall images from memory after only a few instances of exposure, with high precision for a brief time after exposure, without using a mnemonic device. Although the terms eidetic memory and photographic memory are popularly used interchangeably, they are also distinguished, with eidetic memory referring to the ability to view memories like photographs for a few minutes, and photographic memory referring to the ability to recall pages of text or numbers, or similar, in great detail. When the concepts are distinguished, eidetic memory is reported to occur in a small number of children and as something generally not found in adults, while true photographic memory has never been demonstrated to exist. Question: are eidetic memory and photographic memory the same?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: ``Eye of the Tiger'' is a song by American rock band Survivor. It was released as a single from their third album of the same name Eye of the Tiger and was also the theme song for the film Rocky III, which was released a day before the single. The song was written by Survivor guitarist Frankie Sullivan and keyboardist Jim Peterik, and was recorded at the request of Rocky III star, writer, and director Sylvester Stallone, after Queen denied him permission to use ``Another One Bites the Dust'', the song Stallone intended as the Rocky III theme. Originally, the song was made for the movie The Karate Kid. The director of both Rocky and The Karate Kid planned to use the song for a fighting montage towards the end of the feature. John G. Avildsen opted to using ``You're the Best'' by Joe Esposito. The version of the song that appears in the movie is the demo version of the song. The movie version also contained tiger growls, something that did not appear on the album version. It features original Survivor singer Dave Bickler on lead vocals. Question: was eye of the tiger written for rocky?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Some tax protesters such as Edward Brown and tax protester organizations such as the We the People Foundation have used the phrase ``show me the law'' to argue that the Internal Revenue Service refuses to disclose the laws that impose the legal obligation to file Federal income tax returns or pay Federal income taxes--and to argue that there must be no law imposing Federal income taxes. Question: is there a law that says we have to pay taxes?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Pierce's home, ``Port Waldo'', is (real-life) Waldoboro, Maine, and the FinestKind Clinic is just up U.S. Route 1 in Rockland. ``Crabapple Cove'' is actually Broad Cove, in Bremen just down the Medomak River from Waldoboro Village. Author Richard Hooker (Hornberger) owned an old farmhouse on Heath Point. The reader will note Wreck Island, Thief Island, and other Muscongus Bay landmarks in the book. It is possible that the Pierce family is modeled after the (real-life) Spear family, who had a number of different branches in the area, in the 1950s. Question: is there such a place as crabapple cove maine?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The abdomen (less formally called the belly, stomach, tummy or midriff) constitutes the part of the body between the thorax (chest) and pelvis, in humans and in other vertebrates. The region occupied by the abdomen is termed the abdominal cavity. In arthropods it is the posterior tagma of the body; it follows the thorax or cephalothorax. The abdomen stretches from the thorax at the thoracic diaphragm to the pelvis at the pelvic brim. The pelvic brim stretches from the lumbosacral joint (the intervertebral disc between L5 and S1) to the pubic symphysis and is the edge of the pelvic inlet. The space above this inlet and under the thoracic diaphragm is termed the abdominal cavity. The boundary of the abdominal cavity is the abdominal wall in the front and the peritoneal surface at the rear. Question: is the abdomen the same as the stomach?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The National Minimum Drinking Age Act of 1984 (23 U.S.C. \u00a7 158) was passed by the United States Congress on July 17, 1984. It was a controversial bill that punished every state that allowed persons below 21 years to purchase and publicly possess alcoholic beverages by reducing its annual federal highway apportionment by 10 percent. The law was later amended, lowering the penalty to 8 percent from fiscal year 2012 and beyond. Question: is the legal drinking age a federal law?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Colocasia esculenta is thought to be native to Southern India and Southeast Asia, but is widely naturalised. It is a perennial, tropical plant primarily grown as a root vegetable for its edible starchy corm, and as a leaf vegetable. It is a food staple in African, Oceanic and Indian cultures and is believed to have been one of the earliest cultivated plants. Colocasia is thought to have originated in the Indomalaya ecozone, perhaps in East India, Nepal, and Bangladesh, and spread by cultivation eastward into Southeast Asia, East Asia and the Pacific Islands; westward to Egypt and the eastern Mediterranean Basin; and then southward and westward from there into East Africa and West Africa, where it spread to the Caribbean and Americas. It is known by many local names and often referred to as ``elephant ears'' when grown as an ornamental plant. At around 3.3 million metric tons per year, Nigeria is the largest producer of taro in the world. Question: is taro root the same as elephant ears?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The game takes place in the same fictional world as the comic, with events occurring shortly after the onset of the zombie apocalypse in Georgia. However, most of the characters are original to the game, which centers on university professor and convicted criminal Lee Everett, who helps to rescue and subsequently care for a young girl named Clementine. Kirkman provided oversight for the game's story to ensure it corresponded to the tone of the comic, but allowed Telltale to handle the bulk of the developmental work and story specifics. Some characters from the original comic book series also make in-game appearances. Question: is the walking dead game the same as the show?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Call of Duty: WWII is a first-person shooter video game developed by Sledgehammer Games and published by Activision. It was released worldwide on November 3, 2017 for Microsoft Windows, PlayStation 4 and Xbox One. It is the fourteenth main installment in the Call of Duty series and the first title in the series to be set primarily during World War II since Call of Duty: World at War in 2008. Question: was call of duty ww2 based on a true story?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: National Car Rental is an American rental car agency based in Clayton, Missouri, United States. National is owned by Enterprise Holdings, along with other agencies including Enterprise Rent-A-Car, and Alamo Rent a Car. Question: is national and enterprise car rental the same company?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Princeton Theological Seminary (PTS) is a private, nonprofit, and independent graduate school of theology in Princeton, New Jersey. Founded in 1812 under the auspices of Archibald Alexander, the General Assembly of the Presbyterian Church, and the College of New Jersey (now Princeton University), it is the second-oldest seminary in the United States. It is also the largest of ten seminaries associated with the Presbyterian Church (USA). Question: is princeton theological seminary part of princeton university?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Eastbound vehicles must pay a toll to cross the bridge; as with all Hudson River crossings along the North River, westbound vehicles cross for free. As of December 6, 2015, the cash tolls going from New Jersey to New York are $15 for both cars and motorcycles. E-ZPass users are charged $10.50 for cars and $9.50 for motorcycles during off-peak hours, and $12.50 for cars and $11.50 for motorcycles during peak hours. Trucks are charged cash tolls of $20.00 per axle, with discounted peak, off-peak, and overnight E-ZPass tolls. A discounted carpool toll ($6.50) is available at all times for cars with three or more passengers using NY or NJ E-ZPass, who proceed through a staffed toll lane (provided they have registered with the free ``Carpool Plan''). There is an off-peak toll of $7.00 for qualified low-emission passenger vehicles, which have received a Green E-ZPass based on registering for the Port Authority Green Pass Discount Plan. Question: do you have to pay both ways on the george washington bridge?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The United States Congress is the bicameral legislature of the Federal government of the United States. The legislature consists of two chambers: the Senate and the House of Representatives. Question: is the house of representatives also called congress?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In Little League, in the Tee-Ball and Minor League divisions, the batter is out after the third strike regardless of whether the pitched ball is caught cleanly by the catcher. In Little League (or the Major Division), Junior, Senior, and Big League divisions, a batter may attempt to advance to first base on an uncaught third strike. Little League Major Division Softball and many other youth baseball leagues (such as the USSSA) also follow the rule. Question: can you run on a dropped third strike in little league?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Sanders played football primarily at cornerback, but also as a kick returner, punt returner, and occasionally wide receiver. He played in the National Football League (NFL) for the Atlanta Falcons, the San Francisco 49ers, the Dallas Cowboys, the Washington Redskins and the Baltimore Ravens, winning the Super Bowl with both the 49ers and the Cowboys. An outfielder in baseball, he played professionally for the New York Yankees, the Atlanta Braves, the Cincinnati Reds and the San Francisco Giants, and participated in the 1992 World Series with the Braves. He attended Florida State University, where he was recognized as a two-time All-American in football, and also played baseball and ran track. Question: did deion sanders ever win a world series?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The United Kingdom European Union membership referendum, also known as the EU referendum and the Brexit referendum, took place on 23 June 2016 in the United Kingdom (UK) and Gibraltar to gauge support for the country either remaining a member of, or leaving, the European Union (EU) under the provisions of the European Union Referendum Act 2015 and also the Political Parties, Elections and Referendums Act 2000. The referendum resulted in a simple majority of 51.9% (of people who voted) being in favour of leaving the EU. Although legally the referendum was non-binding, the government of that time had promised to implement the result, and it initiated the official EU withdrawal process on 29 March 2017, which put the UK on course to leave the EU by 30 March 2019, after a period of Brexit negotiations. Question: is britain still a member of european union?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: A hydraulic fluid or hydraulic liquid is the medium by which power is transferred in hydraulic machinery. Common hydraulic fluids are based on mineral oil or water. Examples of equipment that might use hydraulic fluids are excavators and backhoes, hydraulic brakes, power steering systems, transmissions, garbage trucks, aircraft flight control systems, lifts, and industrial machinery. Question: is mineral oil the same as hydraulic oil?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Robert Westbrook adapted the screenplay to novel form, which was published by Alex in May 2002. Question: was the movie insomnia based on a book?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: While some gold medals are solid gold, others are gold-plated or silver-gilt, like those of the Olympic Games, the Lorentz Medal, the United States Congressional Gold Medal and the Nobel Prize medal. Nobel Prize medals consist of 18 karat green gold plated with 24 karat gold. Before 1980 they were struck in 23 karat gold. Question: is the olympic gold medal made of gold?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The ``Little House'' Books is a series of American children's novels written by Laura Ingalls Wilder, based on her childhood and adolescence in the American Midwest (Wisconsin, Kansas, Minnesota, South Dakota, and Missouri) between 1870 and 1894. Eight of the novels were completed by Wilder, and published by Harper & Brothers. The appellation ``Little House'' books comes from the first and third novels in the series of eight published in her lifetime. The second novel was about her husband's childhood. The first draft of a ninth novel was published posthumously in 1971 and is commonly included in the series. Question: is the little house on the prairie fiction?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The building used in the show for the firehouse exteriors is a working Chicago Fire Department firehouse, and is the headquarters of Engine 18, located at 1360 South Blue Island Avenue at Maxwell Street, between 13th & Racine. Housed here is ALS Engine 18, 2--2--1 (Deputy District Chief -- 1st District), 2--1--21 (1st District Chief), 6--4--16 (High-Rise Response Unit), and ALS Ambulance 65. The interiors of Firehouse 51 are filmed at Cinespace Chicago Film Studios. The station house used for exteriors in Chicago PD is just a few blocks away at 949 West Maxwell Street at Morgan Street (interiors likewise filmed at Cinespace). Question: is chicago fire filmed in a real firehouse?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The World Is Still Beautiful, also in subtitle as Still World Is Beautiful (Japanese: \u305d\u308c\u3067\u3082\u4e16\u754c\u306f\u7f8e\u3057\u3044, Hepburn: Soredemo Sekai wa Utsukushii, lit. ``Even So, The World is Beautiful'') is a Japanese manga series by Dai Shiina, serialized in Hakusensha's sh\u014djo manga magazine Hana to Yume from 2012. Soredemo Sekai wa Utsukushii was first published as a one-shot in the same magazine in 2009, with a second one-shot published in 2011. It has been collected in 14 tank\u014dbon volumes as of October 2016. An anime television series adaptation by Pierrot aired between April 6, 2014 and June 29, 2014 on NTV. Question: will there be a season 2 of soredemo sekai wa utsukushii?",
        "pred_ans": " There is no information in the passage about whether a season 2 of Soredemo Sekai wa Utsukushii will be produced. The passage only discusses the manga series, its anime adaptation, and its tankobon volumes.",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The Bill of Rights is the first ten amendments to the United States Constitution. Proposed following the often bitter 1787--88 battle over ratification of the U.S. Constitution, and crafted to address the objections raised by Anti-Federalists, the Bill of Rights amendments add to the Constitution specific guarantees of personal freedoms and rights, clear limitations on the government's power in judicial and other proceedings, and explicit declarations that all powers not specifically delegated to Congress by the Constitution are reserved for the states or the people. The concepts codified in these amendments are built upon those found in several earlier documents, including the Virginia Declaration of Rights and the English Bill of Rights, along with earlier documents such as Magna Carta (1215). In practice, the amendments had little impact on judgments by the courts for the first 150 years after ratification. Question: was the bill of rights in the constitution?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Nigella sativa (black caraway, also known as black cumin, nigella, and kalonji) is an annual flowering plant in the family Ranunculaceae, native to south and southwest Asia. Question: is black cumin seed same as black seed?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: It is possible to have extra, or ``supernumerary,'' teeth. This phenomenon is called hyperdontia and is often erroneously referred to as ``a third set of teeth.'' These teeth may erupt into the mouth or remain impacted in the bone. Hyperdontia is often associated with syndromes such as cleft lip and palate, trichorhinophalangeal syndrome, cleidocranial dysplasia, and Gardner's syndrome. Question: can you get a 3rd set of teeth?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: California is a ``may issue'' state for permits to carry concealed guns. The willingness of issuing authorities in California ranges from No Issue in most urban areas to Shall Issue in rural counties. Additionally, the issuing authority can also impose restrictions on the CCW permit-holder, such as limiting concealed carry only to the purposes listed on the approved CCW permit application. However, concealed carry permits are valid statewide, regardless of where they were issued. This creates a situation where residents in presumptively No Issue locations such as Los Angeles and San Francisco cannot lawfully carry a concealed firearm, but residents from other counties with more permissive CCW issuance policies can lawfully carry within these same jurisdictions. California does not recognize concealed carry permits issued by other states, and non-residents are generally forbidden from obtaining a California concealed carry permit. Those eligible to carry a rifle, shotgun, or handgun under the federal Law Enforcement Officers Safety Act are not subject to some California laws. Question: can you carry a gun on you in california?",
        "pred_ans": " It is possible to carry a gun on you in California, but it depends on the specific circumstances and whether you have a valid concealed carry permit. California is a \\\"may issue\\\" state for permits to carry concealed guns, and the willingness of issuing authorities in California ranges from No Issue in most urban areas to Shall Issue in rural counties. Additionally, the issuing authority can also impose restrictions on the CCW permit-holder, such as limiting concealed carry only to the purposes listed on the approved CCW permit application. Concealed carry permits are valid statewide, regardless of where they were issued. However, California does not recognize concealed carry permits issued by other states, and non-residents are generally forbidden from obtaining a California concealed carry permit. Those eligible to carry a rifle, shotgun, or handgun under the federal Law Enforcement Officers Safety Act are not subject to some California laws.",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Friction tape is a type of adhesive tape made from cloth impregnated with a rubber-based adhesive, mainly used to insulate splices in electric wires and cables. Because the adhesive is impregnated in the cloth, friction tape is sticky on both sides. The rubber-based adhesive makes it an electrical insulator and provides a degree of protection from liquids and corrosion. In the past, friction tape was widely used by electricians, but PVC electrical tape has replaced it in most applications today. The frictional properties of the tape come from the cloth material, which is usually made from cotton, while the fabric base protects electrical splices against punctures and abrasion. Question: is friction tape the same as electrical tape?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: To maintain lactation, a dairy cow must be bred and produce calves. Depending on market conditions, the cow may be bred with a ``dairy bull'' or a ``beef bull.'' Female calves (heifers) with dairy breeding may be kept as replacement cows for the dairy herd. If a replacement cow turns out to be a substandard producer of milk, she then goes to market and can be slaughtered for beef. Male calves can either be used later as a breeding bull or sold and used for veal or beef. Dairy farmers usually begin breeding or artificially inseminating heifers around 13 months of age. A cow's gestation period is approximately nine months. Newborn calves are removed from their mothers quickly, usually within three days, as the mother/calf bond intensifies over time and delayed separation can cause extreme stress on both cow and calf. Question: does a cow have to be pregnant to lactate?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Panama has qualified once for the finals of a FIFA World Cup, the 2018 edition. They directly qualified after securing the third spot in the hexagonal on the final round. This meant that after 10 failed qualification campaigns, Panama would appear at the World Cup for the first time in their history. Question: have panama qualified for the world cup before?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Historically, tin and copper as well as a few other metals (e.g. arsenic, silver, and zinc) have been mined in Cornwall and Devon. As of 2007 there are no active metalliferous mines remaining. However, tin deposits still exist in Cornwall, and there has been talk of reopening the South Crofty tin mine. In addition, work has begun on re-opening the Hemerdon tungsten and tin mine in south-west Devon. In view of the economic importance of mines and quarries, geological studies have been conducted: about forty distinct minerals have been identified from type localities in Cornwall (e.g. endellionite from St Endellion). Quarrying of the igneous and metamorphic rocks has also been a significant industry. In the 20th century the extraction of kaolin was important economically. Question: are there any tin mines left in cornwall?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The subsequent trophy, called the ``FIFA World Cup Trophy'', was introduced in 1974. Made of 18 carat gold with a malachite base, it stands 36.8 centimetres high and weighs 6.1 kilograms. The trophy was made by Stabilimento Artistico Bertoni company in Italy. It depicts two human figures holding up the Earth. The current holders of the trophy are France, winners of the 2018 World Cup. Question: is the world cup made out of solid gold?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The Nuggets finished the 2007--08 season with exactly 50 wins (50--32 overall record, tied for the third-best all-time Nuggets record since the team officially joined the NBA in 1976), following a 120--111 home victory over the Memphis Grizzlies in the last game of the season. It was the first time since the 1987--88 NBA season that the Nuggets finished with at least 50 wins in a season. Denver ended up as the 8th seed in the Western Conference of the 2008 Playoffs, and their 50 wins marked the highest win total for an 8th seed in NBA history. It also meant that for the first time in NBA history, all eight playoff seeds in a conference had at least 50 wins. The Nuggets faced the top-seeded Los Angeles Lakers (57--25 overall record) in the first round of the Playoffs. The seven games separating the Nuggets overall record and the Lakers overall record is the closest margin between an eighth seed and a top seed since the NBA went to a 16-team playoff format in 1983--84. The Lakers swept the Nuggets in four games, marking the second time in NBA history that a 50-win team was swept in a best-of-seven playoff series in the first round. For the series, Anthony averaged 22.5 ppg, 9.5 rpg (playoff career-high), 2.0 apg and 0.5 spg. Question: did carmelo anthony go to the western conference finals?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The vagus nerve (/\u02c8ve\u026a\u0261\u0259s/ VAY-g\u0259s), historically cited as the pneumogastric nerve, is the tenth cranial nerve or CN X, and interfaces with parasympathetic control of the heart, lungs, and digestive tract. The vagus nerves are paired; however, they are normally referred to in the singular. It is the longest nerve of the autonomic nervous system in the human body. The vagus nerve also has a sympathetic function via the peripheral chemoreceptors. Question: is the vagus nerve part of the cns?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: On December 2, 2011, Universal Orlando Resort announced that the Jaws attraction along with the entire Amity area of Universal Studios Florida would close permanently on January 2, 2012 to ``make room for an exciting, NEW, experience.'' (the second phase of The Wizarding World Of Harry Potter.) severe backlash followed after the announcement. The attraction officially closed on January 2, 2012 at 9:00 pm with Michael Skipper aka ``Skip'' giving the final voyage to the last lucky group of 48 guests. By the next morning, the entire Amity area was walled off and completely demolished in the following months. The hanging shark statue from the town square remains as a tribute to the ride and can be found in the Fisherman's Wharf area of the San Francisco section of the park. The attraction remains open at Universal Studios Japan as well as the original tram stop at Universal Studios Hollywood. Question: is the jaws ride at universal orlando closed?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: A 12-episode second season of The Affair premiered on October 4, 2015. On December 9, 2015, the series was renewed for a third season, which debuted on November 20, 2016. On January 9, 2017, Showtime renewed the series for a fourth season, which premiered on June 17, 2018. On July 26, 2018, Showtime announced it had renewed the series for a fifth and final season to debut in 2019. Question: will there be another season of the affiar?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: There are eleven official languages of South Africa: Afrikaans, English, Ndebele, Northern Sotho, Sotho, SiSwati, Tsonga, Tswana, Venda, Xhosa and Zulu. Fewer than two percent of South Africans speak a first language other than an official one. Most South Africans can speak more than one language. Dutch and English were the first official languages of South Africa from 1910 to 1925. Afrikaans was added as a part of Dutch in 1925, although in practice, Afrikaans effectively replaced Dutch, which fell into disuse. When South Africa became a republic in 1961, the official relationship changed such that Afrikaans was considered to include Dutch, and Dutch was dropped in 1984, so between 1984 and 1994, South Africa had two official languages: English and Afrikaans. Question: is english the official language of south africa?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Destin--Fort Walton Beach Airport (IATA: VPS, ICAO: KVPS, FAA LID: VPS) is an airport located within Eglin Air Force Base, near Destin and Fort Walton Beach in Okaloosa County, Florida. No private aircraft are allowed, so Destin Executive Airport is used instead for non-commercial operations by general aviation and business aircraft. The airport was previously named Northwest Florida Regional Airport until February 17, 2015 and Okaloosa Regional Airport until September 2008. Question: is there an airport in fort walton beach florida?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: In biology, tissue is a cellular organizational level between cells and a complete organ. A tissue is an ensemble of similar cells and their extracellular matrix from the same origin that together carry out a specific function. Organs are then formed by the functional grouping together of multiple tissues. Question: is tissue composed of one type of cell?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Alex's past love interest. Though they had a very rough start from the beginning they eventually started to soften towards each other. Sean has a hard time getting Alex to bring down her walls. Alex herself tries not to get emotional and Sean sometimes gets Alex to recognize that he likes her. Despite Sean giving many signs of attraction to Alex, she either ignored them or she was oblivious to them since she was emotionally not ready to commit to a relationship with all the things happening in her life. He and Alex share a first kiss in a car with Birkoff driving and Ryan in the passenger seat. In the second-season finale, he tries to ask her out on a date four times, but Alex never lets him finish due to being in action, criticizing that he said that she was a goal, Alex passing out due to a broken arm and being electrocuted, and Nikita interrupting Sean right before he was going to ask Alex while she was in a Division medical facility. At the start of season 3, it appears Alex and Sean are in a relationship. However, by episode 3 it is revealed that Sean is only at Division for Alex, because he loved her. But Alex is at Division because it is the only place she knows as home, where she can be herself, and where her ``family'' (Nikita) is. After a toxin is released in the lab, Sean returns to ask Alex why she's not returning his calls, and asks her again why she's still there. ``I can't tell you what to do, Alex, but I'm not going to stand by and watch this place destroy another person that I love,'' he says before he kisses her. ``I love you, but if that's not enough of a reason for you to leave, I've got no reason to stay.'' After he leaves, she pops a pill and heads for the operations floor, insisting that she ought to be included in the hunt for Amanda. They eventually make up after an emotional scene in a medical room, they then entered a storage closet and make love, for the first time. In ``Black Badge'', Amanda framed Sean for the death of the head of the CIA. As a result, they faked his death and Sean was officially welcomed into Division by Alex. Sean died in season 3 episode 18, as a result of a bullet nicking his artery. They shared their last moments together in where they met, operations. When Nikita enters OPS, she finds Birkoff near Sean and Alex gone. She presumably went to get revenge for the current events. Sean later died in Alex's arms towards the end of season 3. Question: do sean and alex get together on nikita?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: There is no offside offence if a player receives the ball directly from a goal kick, a corner kick, a throw-in, or a dropped-ball. It is also not an offence if the ball was last deliberately played by an opponent (except for a deliberate save). In this context, according to the IFAB, ``A 'save' is when a player stops, or attempts to stop, a ball which is going into or very close to the goal with any part of the body except the hands/arms (unless the goalkeeper within the penalty area).'' Question: can you be offside on a corner kick?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Ross-on-Wye railway station is a former junction railway station on the Hereford, Ross and Gloucester Railway constructed just to the north of the Herefordshire town of Ross-on-Wye. It was the terminus of the Ross and Monmouth Railway which joined the Hereford, Ross and Gloucester Railway just south of the station. Question: does ross on wye have a train station?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Although Karev and Stevens had an on-off romantic history in previous seasons, it was not until the fifth season that the two formed a lasting relationship. Despite Stevens' advanced skin cancer, the two marry. Justin Chambers commented on his character's inability to tell Stevens that he loves her in the beginning of their relationship, saying that Karev has difficulty expressing himself. Though Stevens departs in season 6, series creator Shonda Rhimes has said that she would like the chance to create closure for both Karev and Stevens. Rhimes later retracted her comments and stated that she has no plans to ever re-approach Izzie's storyline again. In the ninth season, Karev befriends intern Jo Wilson. They begin a relationship in the tenth season until their break-up in the twelfth season. However, they get back together again and marry in the fourteenth season. Question: does alex come back to grey sloan memorial?",
        "pred_ans": " Yes, Karev (Alex) does come back to Grey Sloan Memorial Hospital in the later seasons of the show.",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The kilowatt hour (symbol kWh, kW\u22c5h or kW h) is a unit of energy equal to 3.6 megajoules. If energy is transmitted or used at a constant rate (power) over a period of time, the total energy in kilowatt hours is equal to the power in kilowatts multiplied by the time in hours. The kilowatt hour is commonly used as a billing unit for energy delivered to consumers by electric utilities. Question: is a kilowatt hour a unit of power?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: At the Liberation of France in the summer of 1944, Metropolitan France kept GMT+2 as it was the time then used by the Allies (British Double Summer Time). In the winter of 1944--1945, Metropolitan France switched to GMT+1, same as in the United Kingdom, and switched again to GMT+2 in April 1945 like its British ally. In September 1945, Metropolitan France returned to GMT+1 (pre-war summer time), which the British had already done in July 1945. Metropolitan France was officially scheduled to return to GMT+0 on November 18, 1945 (the British returned to GMT+0 in on October 7, 1945), but the French government canceled the decision on November 5, 1945, and GMT+1 has since then remained the official time of Metropolitan France. Question: is france the same timezone as the uk?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The original script by Ilya Tilkin does not have any literary source. The screenwriter studied diaries of the participants of the Battle of Stalingrad. He also used museum archives, documents and recorded stories of its participants. Question: is the movie stalingrad based on a true story?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The four-second fall from the Golden Gate Bridge sends a person plunging 245 feet (75 m) at 75 miles per hour (121 km/h) to hit the waters of the San Francisco Bay ``with the force of a speeding truck meeting a concrete building.'' Jumping off the bridge holds a 98 percent fatality rate; As of 2005, it is estimated that 26 people have survived after jumping. Some die instantly from internal injuries, while others drown or die of hypothermia. The Golden Gate bridge's death toll has since been surpassed only by the Nanjing Yangtze River Bridge in China. In 2013, 118 potential jumpers were talked down from their attempt and did not jump. Question: could you survive jumping off the golden gate bridge?",
        "pred_ans": " It is highly unlikely that a person could survive jumping off the Golden Gate Bridge, given its height, speed, and force of impact with the water. The passage states that the fatality rate for jumping off the bridge is 98 percent, and most who die do so instantly from internal injuries or drowning/hypothermia. While some people have survived the jump, it is not common and is considered extremely dangerous. Therefore, the answer is False.",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Ursa Major is primarily known from the asterism of its main seven relatively bright stars comprising the ``Big Dipper'', ``the Wagon'', ``Charles's Wain'' or ``the Plough'' (among others), with its stellar configuration mimicking the shape of the ``Little Dipper''. Question: is the big dipper the same as ursa major?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: A perfect game is defined by Major League Baseball as a game in which a pitcher (or combination of pitchers) pitches a victory that lasts a minimum of nine innings in which no opposing player reaches base. Thus, the pitcher (or pitchers) cannot allow any hits, walks, hit batsmen, or any opposing player to reach base safely for any other reason and the fielders cannot make an error that allows an opposing player to reach a base; in short, ``27 up, 27 down.'' The feat has been achieved 23 times in MLB history -- 21 times since the modern era began in 1900, most recently by F\u00e9lix Hern\u00e1ndez of the Seattle Mariners on August 15, 2012. A perfect game is also a no-hitter and a shutout. A fielding error that does not allow a batter to reach base, such as a misplayed foul ball, does not spoil a perfect game. Weather-shortened contests in which a team has no baserunners and games in which a team reaches first base only in extra innings do not qualify as perfect games under the present definition. Question: can there be an error in a perfect game?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: In humans, a single transverse palmar crease is a single crease that extends across the palm of the hand, formed by the fusion of the two palmar creases (known in palmistry as the ``heart line'' and the ``head line'') and is found in people with Down syndrome. However, it is not an indication that a person with single transverse palmar crease has to have Down syndrome. It is also found in 1.5% of the general population in at least one hand. Question: do all down syndrome babies have simian crease?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: On 28 September 2016, Nine renewed the program for a second season after just two episodes having been aired. On 11 October 2017, the series was renewed for a third season at Nine's upfronts. and premiered on Monday, 6 August 2018, instead of the previous Wednesday night slot. On 17 October 2018 the series was renewed for a fourth season. Question: is there a fourth season of doctor doctor?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The standard error (SE) of a statistic (usually an estimate of a parameter) is the standard deviation of its sampling distribution or an estimate of that standard deviation of estimate. If the parameter or the statistic is the mean, it is called the standard error of the mean (SEM). Question: is sampling error the same as standard deviation?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Orange is the colour between yellow and red on the spectrum of visible light. Human eyes perceive orange when observing light with a dominant wavelength between roughly 585 and 620 nanometres. In painting and traditional colour theory, it is a secondary colour of pigments, created by mixing yellow and red. It is named after the fruit of the same name. Question: is the fruit orange named after the color?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: The game features a nonlinear story, that follows the unnamed player character as they go to become a racing icon in the United States by winning in all racing disciplines available in the game. There are four disciplines: Street Racing, Off Road, Freestyle and Pro Racing. In Street Racing, the player is assisted by Latrell. In Off Road, the player is assisted by Tucker ``Tuck'' Morgan. In Freestyle, the player is assisted by Sofia and her father. In Pro Racing, the player is assisted by Alexis. Question: does the crew 2 have a story line?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Nuclear power is the use of nuclear reactions that release nuclear energy to generate heat, which most frequently is then used in steam turbines to produce electricity in a nuclear power plant. Nuclear power can be obtained from nuclear fission, nuclear decay and nuclear fusion. Presently, the vast majority of electricity from nuclear power is produced by nuclear fission of elements in the actinide series of the periodic table. Nuclear decay processes are used in niche applications such as radioisotope thermoelectric generators. The possibility of generating electricity from nuclear fusion is still at a research phase with no commercial applications. This article mostly deals with nuclear fission power for electricity generation. Question: is nuclear power the same as nuclear energy?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: A pitaya (/p\u026a\u02c8ta\u026a.\u0259/) or pitahaya (/\u02ccp\u026at\u0259\u02c8ha\u026a.\u0259/) is the fruit of several different cactus species indigenous to the Americas. Pitaya usually refers to fruit of the genus Stenocereus, while pitahaya or dragon fruit refers to fruit of the genus Hylocereus, both in the Cactaceae family. The dragon fruit is cultivated in Southeast Asia, Florida, the Caribbean, Australia, and throughout tropical and subtropical world regions. Question: is dragon fruit and pitaya the same thing?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The Fire Tablet, formerly called the Kindle Fire, is a tablet computer developed by Amazon.com. Built with Quanta Computer, the Kindle Fire was first released in November 2011, featuring a color 7-inch multi-touch display with IPS technology and running a custom version of Google's Android operating system called Fire OS. The Kindle Fire HD followed in September 2012, and the Kindle Fire HDX in September 2013. In September 2014, when the fourth generation was introduced, the name ``Kindle'' was dropped. In September 2015, the fifth generation Fire 7 was released, followed by the sixth generation Fire HD 8, in September 2016. The seventh generation Fire 7 was released in June 2017. Question: is a fire 7 the same as a kindle?",
        "pred_ans": "NILL",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Toys ``R'' Us expanded as a chain, becoming predominant in its niche field of toy retail. Represented by cartoon mascot Geoffrey the Giraffe from 1969, Toys ``R'' Us eventually branched out into launching the stores Babies ``R'' Us, Toys ``R'' Us Express, and the now-defunct Kids ``R'' Us. Question: is babies r us and toys r us the same?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Although initially happy in her relationship with Jackson, Lexie grows increasingly distraught and frustrated when she discovers that Mark has started dating an ophthalmologist named Julia. When she sees Mark and Julia flirting during a charity softball match, Lexie's jealousy gets the better of her and she throws a ball at Julia, injuring the latter's chest. Jackson senses that Lexie is still in love with Mark and ends their relationship. Lexie begins working under Derek's service and becomes increasingly proficient in neurosurgery, helping Derek with a set of ``hopeless cases'' - high risk surgeries for patients who had otherwise run out of options. During a surgery, Derek is called away on an emergency, leaving Lexie and Meredith to carry out the procedure on their own. Though Derek had instructed them to merely reduce the patient's brain tumor, Meredith allows Lexie to remove it completely, despite not being authorized by either the patient or Derek to do so. The sisters celebrate the successful surgery but Lexie is devastated when she discovers that the patient suffered severe brain damage, thus losing the ability to speak. Alex, Jackson and April move out of Meredith's house without inviting Lexie to join them, and with Derek and Meredith settling down with baby Zola, Lexie begins to feel lonely and isolated. After being left babysitting Zola on Valentine's Day, she contemplates confessing her true feelings to Mark. However, after plucking up the courage to visit his apartment, she finds Mark studying with Jackson and loses her nerve, instead claiming that she wanted to set up a play date for Zola and Sofia. When Mark confides in Derek that he and Julia have been discussing moving in together, Derek warns Lexie not to miss her chance again, resulting in her professing her love to a shell-shocked Mark, who merely thanks her for her candor. Mark later confesses to Derek that he feels the same way about Lexie, but is unsure of how to go about things. Days later, Lexie is named as part of a team of surgeons that will be sent to Boise to separate conjoined twins, along with Mark, Meredith, Derek, Cristina and Arizona Robbins (Jessica Capshaw). However, while flying to their destination, the doctors' plane crashes in the wilderness and Lexie is crushed under debris from the aircraft but manages to alert Mark and Cristina to help her. The pair try in vain to free Lexie, who realizes that she is suffering from a hemothorax and is unlikely to survive. While Cristina tries to find an oxygen tank and water to save Lexie, Mark holds Lexie's hand and professes his love for her, telling her that they will get married, have kids and live the best life together, as they are ``meant to be''. While fantasizing about the future that she and Mark could have had together, Lexie succumbs to her injuries and dies moments before Meredith arrives. The remaining doctors are left stranded in the woods waiting for rescue, with a devastated Meredith crying profusely and Mark refusing to let go of Lexie's hand. Question: do lexie and mark ever get back together?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The fastest land animal is the cheetah, which has a recorded speed of 109.4--120.7 km/h (68.0--75.0 mph). The peregrine falcon is the fastest bird and the fastest member of the animal kingdom with a diving speed of 389 km/h (242 mph). The fastest animal in the sea is the black marlin, which has a recorded speed of 129 km/h (80 mph). Question: is there a bird faster than a cheetah?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: In the series 11 finale Kepner tells Avery that she is leaving with Owen Hunt to serve as a trauma surgeon in the Army; it will help her grieve for their son. Avery lets her go and wonders how he can deal with his own grief. After discussions over the phone via Facetime, Kepner tells Avery that she is extending her service time. The sound of gunfire and explosions are heard at April's base camp, leaving her to quickly terminate the call. On Valentine's Day, Kepner returns to the hospital, where she and Avery embrace in the foyer. In season 12, their marriage begins to fall apart and they grow estranged. In episode 11, they file for a civil divorce. After their divorce is completed, Kepner reveals that she is pregnant with Avery's child. In Season 14 Jackson begins a relationship with Maggie Pierce. Question: does jackson avery die on grey's anatomy?",
        "pred_ans": " No",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Farhad locates Daniel's house and waits in ambush. As he confronts Daniel with his gun in hand, Lara jumps into Daniel's arms, attempting to protect her father with the ``invisible cloak.'' Daniel's wife Elizabeth runs out the front door and watches in horror as Farhad shoots Lara. Daniel clutches his daughter as Fahrad stands shocked as both realized she has not been hit by any bullet. Daniel runs into the house with his wife and daughter, leaving Fahrad outside. Farhad later tells his daughter that he believes the little girl was his guardian angel, preventing him from committing a terrible crime. Dorri is then shown with the box of bullets she had purchased, labeled as blanks. Question: does the little girl in the movie crash die?",
        "pred_ans": "NILL",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: A game consists of a sequence of points played with the same player serving, and is won by the first side to have won at least four points with a margin of two points or more over their opponent. Normally the server's score is always called first and the opponent's score second. Score calling in tennis is unusual in that each point has a corresponding call that is different from its point value. Question: do you have to serve to score in tennis?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: Physically and functionally, the auditory system of an absolute listener does not appear to be different from that of a non-absolute listener. Rather, ``it reflects a particular ability to analyze frequency information, presumably involving high-level cortical processing.'' Absolute pitch is an act of cognition, needing memory of the frequency, a label for the frequency (such as ``B-flat''), and exposure to the range of sound encompassed by that categorical label. Absolute pitch may be directly analogous to recognizing colors, phonemes (speech sounds), or other categorical perception of sensory stimuli. Just as most people have learned to recognize and name the color blue by the range of frequencies of the electromagnetic radiation that are perceived as light, it is possible that those who have been exposed to musical notes together with their names early in life will be more likely to identify, for example, the note C. Absolute pitch may also be related to certain genes, possibly an autosomal dominant genetic trait, though it ``might be nothing more than a general human capacity whose expression is strongly biased by the level and type of exposure to music that people experience in a given culture.'' Question: do you have to be born with perfect pitch?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The Korean language has changed between the two states due to the length of time that North and South Korea have been separated. Question: does north and south korea speak the same?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: If discharged administratively for any of the above reasons, the service member normally receives an honorable or a general (under honorable conditions) discharge. If misconduct is involved the service member may receive an Other Than Honorable (OTH) Discharge service characterization. Question: is under honorable conditions the same as honorable discharge?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Peptidoglycan, also known as murein, is a polymer consisting of sugars and amino acids that forms a mesh-like layer outside the plasma membrane of most bacteria, forming the cell wall. The sugar component consists of alternating residues of \u03b2-(1,4) linked N-acetylglucosamine (NAG) and N-acetylmuramic acid (NAM) . Attached to the N-acetylmuramic acid is a peptide chain of three to five amino acids. The peptide chain can be cross-linked to the peptide chain of another strand forming the 3D mesh-like layer. Peptidoglycan serves a structural role in the bacterial cell wall, giving structural strength, as well as counteracting the osmotic pressure of the cytoplasm. A common misconception is that peptidoglycan gives the cell its shape; however, whereas peptidoglycan helps maintain the structural strength of the cell, it is actually the MreB protein that facilitates cell shape. Peptidoglycan is also involved in binary fission during bacterial cell reproduction. Question: do all bacteria have peptidoglycan in their cell walls?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The redback is one of the few spider species that can be seriously harmful to humans, and its liking for habitats in built structures has led it to being responsible for a large number of serious spider bites in Australia. Predominantly neurotoxic to vertebrates, the venom gives rise to the syndrome of latrodectism in humans; this starts with pain around the bite site, which typically becomes severe and progresses up the bitten limb and persists for over 24 hours. Sweating in localised patches of skin occasionally occurs and is highly indicative of latrodectism. Generalised symptoms of nausea, vomiting, headache, and agitation may also occur and indicate severe envenomation. An antivenom has been available since 1956. There have been no deaths directly due to redback bites since its introduction, however Isbister et al. have suggested patients for whom antivenom is considered should be fully informed ``there is considerable weight of evidence to suggest it is no better than placebo'', and in light of a risk of anaphylaxis and serum sickness, ``routine use of the antivenom is therefore not recommended''. As of the 2013 (updated 2014) edition of the Snakebite & Spiderbite Clinical Management Guidelines from NSW HEALTH (latest available in 2017), Red-back spider bites were considered not life-threatening but capable of causing severe pain and systemic symptoms that could continue for hours to days. Question: can a red back spider bite kill you?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Canada is one of the oldest continuing monarchies in the world. Initially established in the 16th century, monarchy in Canada has evolved through a continuous succession of French and British sovereigns into the independent Canadian sovereigns of today, whose institution is sometimes colloquially referred to as the Maple Crown. Question: is canada still part of the british monarchy?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Before 1973, the NCAA's smaller schools were grouped together in the College Division. In 1973, the College Division split in two when the NCAA began using numeric designations for its competitions. The College Division members who wanted to offer athletic scholarships or compete against those who did became Division II, while those who chose not to offer athletic scholarships became Division III. Question: do ncaa division 2 schools offer athletic scholarships?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Miniature pig (also micro-pig, teacup pig, Michelle Davila, etc.) is an erroneous term that is used to refer to small breeds of domestic pig, such as Pot-bellied pigs, G\u00f6ttingen minipigs, Juliana pigs, Choctaw Hogs, or Kunekune (and specimens derived by cross-breeding with these). Notable features of most miniature pigs distinguishing them from other pigs may be defined by their possession of small, perked-back ears, a potbelly, sway back, chubby figure, rounded head, short snout, legs, and neck, and a short tail with thick hair at the end. Typically, most breeds of mini pigs will range from the minimum weight of 75 pounds (34 kg) to 200 pounds (91 kg). Question: is there such a thing as a miniature pig?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The first installment, Divergent (2014), grossed over $288 million worldwide, while the second installment, The Divergent Series: Insurgent (2015), grossed over $297 million worldwide. Insurgent was also the first Divergent film to be released in IMAX 3D. The third installment, The Divergent Series: Allegiant (2016), grossed $179 million. Thus, the first three films of the series have grossed over $765 million worldwide. A fourth film, The Divergent Series: Ascendant was to be released theatrically, but due to Allegiant's poor showing at the box office, it was announced it would be released as a television film that could lead into a potential episodic spin-off series on Starz. However, Woodley, along with other cast members, expressed no interest in returning. Question: is there a 4th film in divergent series?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Ordinarily, a baseball game consists of nine innings (in softball and high school baseball games there are typically seven innings; in Little League Baseball, six), each of which is divided into halves: the visiting team bats first, after which the home team takes its turn at bat. However, if the score remains tied at the end of the regulation number of complete innings, the rules provide that ``play shall continue until (1) the visiting team has scored more total runs than the home team at the end of a completed inning; or (2) the home team scores the winning run in an uncompleted inning.'' (Since the home team bats second, condition (2) implies that the visiting team will not have the opportunity to score more runs before the end of the inning.) Question: can a home team win by 2 in extra innings?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The Constitution of India designates the official language of the Government of India as Hindi written in the Devanagari script, as well as English. There is no national language as declared by the Constitution of India. Hindi is used for official purposes such as parliamentary proceedings, judiciary, communications between the Central Government and a State Government. States within India have the liberty and powers to specify their own official language(s) through legislation and therefore there are 22 officially recognized languages in India of which Hindi is the most used. The number of native Hindi speakers is about 25% of the total Indian population; however, including dialects of Hindi termed as Hindi languages, the total is around 44% of Indians, mostly accounted from the states falling under the Hindi belt. Other Indian languages are each spoken by around 10% or less of the population. Question: is hindi is our national language of india?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Carbon (from Latin: carbo ``coal'') is a chemical element with symbol C and atomic number 6. It is nonmetallic and tetravalent--making four electrons available to form covalent chemical bonds. It belongs to group 14 of the periodic table. Three isotopes occur naturally, C and C being stable, while C is a radionuclide, decaying with a half-life of about 5,730 years. Carbon is one of the few elements known since antiquity. Question: is carbon a metal on the periodic table?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Date and time notation in the United Kingdom records the date using the day-month-year format (21 October 2011 or 21/10/11). The ISO 8601 format (2011-08-21) is increasingly used for all-numeric dates. The time can be written using either the 24-hour clock (16:10) or 12-hour clock (4.10 p.m.). Question: does the uk use a 24 hour clock?",
        "pred_ans": " Yes, the UK uses a 24-hour clock.",
        "ground_truth": true,
        "score": " Therefore the score is: 100"
    },
    {
        "question": "Passage: Broken heart (also known as a heartbreak or heartache) is a metaphor for the intense emotional--and sometimes physical--stress or pain one feels at experiencing great longing. The concept is cross-cultural, often cited with reference to a desired or lost lover, and dates back at least 3,000 years. Question: is there a such thing as a broken heart?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: Citizens of countries in the European Economic Area (other than British and Irish citizens) and Swiss citizens obtain permanent residence status automatically after five years' residence in the United Kingdom exercising Treaty rights rather than ILR. The rights of EEA citizens are not governed by UK Immigration Regulations but rather the EEA Regulations. Question: do eu citizens have indefinite leave to remain?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The passport card is a limited travel document, valid only for land and sea travel within North America (Canada, the United States, Mexico, the Caribbean, and Bermuda). It cannot be used for international air travel. The Department of State indicates that this is because ``designing a card format passport for wide use, including by air travelers, would inadvertently undercut the broad based international effort to strengthen civil aviation security and travel document specifications to address the post 9/11 threat environment''. Question: can a passport card be used to fly to canada?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: In order to recover construction and operating costs, the bridge was electronically tolled when originally built. The toll rates increased to $1.60 for motorcycle, $3.15 for cars, $6.30 for small trucks and $9.45 for large trucks on August 15, 2015. Through increased prices and greater traffic, Transportation Investment Corporation (TI Corp), the public Crown corporation responsible for toll operations on the Port Mann Bridge, forecast its revenue would grow by 85% between fiscal years 2014 and 2017. These fees were assessed using radio-frequency identification (RFID) decals or licence plate photos. A B.C. licensed driver who owes more than $25 in tolls outstanding 90 days is penalized $20 and is unable to purchase vehicle insurance or renew drivers permits without payment of the debt. Out-of-province drivers were also contacted for payment by a US-based contractor. A licence plate processing fee of $2.30 per trip was added to the toll rate for unregistered users who did not pay their toll within seven days of their passage. Monthly passes, which allowed unlimited crossing on the bridge, were available for purchase. Users may have set up an account for online payment of tolls. Users who opted for this method received a decal with an embedded RFID to place on their vehicle's windshield or headlight and avoid paying a processing fee. Tolls were expected to be removed by the year of 2050 or after collecting $3.3 billion. As announced by B.C. Premier John Horgan a few days earlier, all tolls on the Port Mann Bridge were removed on September 1, 2017. Debt service was transferred to the province of British Columbia at a cost of $135 million per year. Question: is the port mann bridge a toll bridge?",
        "pred_ans": " True",
        "ground_truth": false,
        "score": " Therefore the score is: 95"
    },
    {
        "question": "Passage: The group winners, Serbia, qualified directly for the 2018 FIFA World Cup. The group runners-up, Republic of Ireland, advanced to the play-offs as one of the best 8 runners-up, where they lost to Denmark and thus failed to qualify. Question: has ireland qualified for the world cup 2018?",
        "pred_ans": " False",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    },
    {
        "question": "Passage: The fourth season of NCIS: New Orleans premiered on September 26, 2017 on CBS. The series continues to air following Bull, Tuesday at 10:00 p.m. (ET) and contained 24 episodes. The season concluded on May 15, 2018. Question: is ncis new orleans over for the season?",
        "pred_ans": "NILL",
        "ground_truth": true,
        "score": " Therefore the score is: 90"
    }
]