[
    {
        "problem_id": 527,
        "domain": [
            "Mathematics -> Applied Mathematics -> Statistics -> Probability -> Counting Methods -> Combinations"
        ],
        "difficulty": 4.5,
        "problem_text": "A deck of 100 cards is labeled $1,2, \\ldots, 100$ from top to bottom. The top two cards are drawn; one of them is discarded at random, and the other is inserted back at the bottom of the deck. This process is repeated until only one card remains in the deck. Compute the expected value of the label of the remaining card.",
        "sample_id": 1,
        "final_answer": "The final answer is \\boxed{\\frac{467}{8}}",
        "steps": [
            {
                "step_id": 1,
                "edge": "We begin by applying the fundamental linearity of expectation, which states that the expected value of a random variable can be expressed as the sum of each possible outcome multiplied by its probability. Here, the final label is a random variable determined by which initial card survives the process, so the expectation decomposes into the weighted sum of all initial labels where weights are precisely the probabilities that each card becomes the last remaining one. This principle provides the structural foundation for computing the expectation without needing to track complex intermediate states.",
                "direct_dependent_steps": null,
                "node": "The expected final label equals the weighted sum of initial labels with weights equal to their probabilities to be the last card."
            },
            {
                "step_id": 2,
                "edge": "The described process—repeatedly drawing the top two cards, randomly discarding one, and returning the other to the bottom—naturally induces a recursive averaging mechanism. At each step, the surviving card's label represents a random selection between the two drawn values with equal probability, effectively averaging their contributions probabilistically. This recursive elimination continues until one card remains, mirroring how iterative pairwise averaging propagates influence from all initial positions toward the final outcome.",
                "direct_dependent_steps": null,
                "node": "The process can be modeled as recursive averaging of the two top cards until one remains."
            },
            {
                "step_id": 3,
                "edge": "Building on Step 2's recursive averaging interpretation, we recognize this process as mathematically equivalent to a random single-elimination tournament. In such a tournament, each 'match' corresponds to drawing two cards: one 'wins' (is returned to the deck) and the other 'loses' (is discarded), with outcomes decided by fair coin flips. The entire elimination path until one card remains aligns perfectly with tournament dynamics where every participant must win successive matches to claim victory, translating the deck process into a combinatorial tournament framework.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "Recursive averaging is equivalent to a random single-elimination tournament on the 100 cards."
            },
            {
                "step_id": 4,
                "edge": "From Step 3's tournament analogy, a card survives to be the last remaining only if it 'wins' every match it participates in. Since each match is an independent fair coin flip (50% chance to advance), the probability of winning all matches is $(1/2)^m$ where $m$ is the number of matches played. This exponential decay in survival probability based on match count is a direct consequence of independent Bernoulli trials governing each elimination step.",
                "direct_dependent_steps": [
                    3
                ],
                "node": "Each card’s probability of winning in such a tournament is $(1/2)$ raised to the number of matches it plays."
            },
            {
                "step_id": 5,
                "edge": "To determine the number of rounds in Step 3's tournament model, we calculate the smallest integer $k$ such that $2^k \\geq 100$, as single-elimination tournaments require power-of-two participants for balanced brackets. Since $2^6 = 64 < 100$ and $2^7 = 128 \\geq 100$, we take $k = \\lceil \\log_2(100) \\rceil = 7$. This establishes that 7 rounds are necessary to reduce 100 participants to one winner, accounting for byes in the initial round.",
                "direct_dependent_steps": [
                    3
                ],
                "node": "The number of rounds in a single-elimination tournament for 100 participants is $k=\\lceil\\log_2(100)\\rceil=7$."
            },
            {
                "step_id": 6,
                "edge": "Using Step 5's $k=7$, a fully balanced single-elimination bracket requires exactly $2^k = 128$ participants to avoid byes. This follows from the structural requirement that each round halves the number of competitors, necessitating a power-of-two starting size for perfect pairing throughout all $k$ rounds.",
                "direct_dependent_steps": [
                    5
                ],
                "node": "A full tournament bracket requires $2^k=128$ participants."
            },
            {
                "step_id": 7,
                "edge": "Given Step 6's requirement of 128 participants for a full bracket but only 100 cards available, the difference $128 - 100 = 28$ represents the number of 'byes' needed. Byes are automatic advancements for certain participants in the first round, allowing the tournament to begin with 100 active competitors while maintaining the bracket structure for subsequent rounds.",
                "direct_dependent_steps": [
                    6
                ],
                "node": "The number of byes in the first round is $128-100=28$."
            },
            {
                "step_id": 8,
                "edge": "In Step 5's $k=7$-round tournament, participants without byes must compete in all 7 rounds to win, while those receiving a bye skip the first round and thus play only $k-1=6$ matches. This is standard tournament design: byes reduce the total matches required for advancement by exactly one, as the bye constitutes a free pass to the second round.",
                "direct_dependent_steps": [
                    5
                ],
                "node": "Players assigned a bye play one fewer match than those without a bye."
            },
            {
                "step_id": 9,
                "edge": "The 28 byes from Step 7 are assigned to the bottom 28 cards in the initial deck (labels 73 through 100), as deck position corresponds to bracket seeding. In tournament seeding conventions, lower-seeded players (here, higher-numbered cards at the deck bottom) typically receive byes to balance competitive fairness, directly mapping the deck's physical order to the tournament's structural requirements.",
                "direct_dependent_steps": [
                    7
                ],
                "node": "The bottom 28 cards in the deck correspond to those that receive byes."
            },
            {
                "step_id": 10,
                "edge": "Combining Step 5's $k=7$ rounds with Step 8's rule that byes reduce match count by one, cards receiving byes play exactly $7 - 1 = 6$ matches. This deduction follows necessarily from the tournament structure: skipping the first round means competing in only the remaining six rounds to reach the final.",
                "direct_dependent_steps": [
                    5,
                    8
                ],
                "node": "A bye reduces the number of matches played to $k-1=6$ for those cards."
            },
            {
                "step_id": 11,
                "edge": "From Step 9, the bottom 28 cards (73–100) receive byes, and Step 10 confirms they play 6 matches. Thus, each card labeled 73 to 100 must win 6 successive matches to become the last remaining card, with their survival path determined by the bracket assignment derived from deck position.",
                "direct_dependent_steps": [
                    9,
                    10
                ],
                "node": "Each bottom card labeled from 73 to 100 therefore plays 6 matches."
            },
            {
                "step_id": 12,
                "edge": "Applying Step 4's survival probability rule $(1/2)^m$ to Step 11's 6-match requirement, each bottom card (73–100) has probability $(1/2)^6 = 1/64$ of winning the tournament. This calculation directly links the fixed match count for bye recipients to their individual survival probabilities through independent match outcomes.",
                "direct_dependent_steps": [
                    4,
                    11
                ],
                "node": "Each bottom card has probability $(1/2)^6$ of winning the tournament."
            },
            {
                "step_id": 13,
                "edge": "For cards without byes, Step 5's $k=7$ rounds and Step 8's rule (no bye implies full participation) dictate they must play all 7 matches. This follows from the tournament structure: without a first-round bye, every non-bye card must compete in every round to emerge victorious.",
                "direct_dependent_steps": [
                    5,
                    8
                ],
                "node": "Cards without a bye play $k=7$ matches."
            },
            {
                "step_id": 14,
                "edge": "Using Step 4's probability formula $(1/2)^m$ with Step 13's 7-match count, each of the 72 non-bye cards (labels 1–72, since $100 - 28 = 72$) has survival probability $(1/2)^7 = 1/128$. This precisely quantifies the reduced likelihood for non-bye cards due to their additional required match.",
                "direct_dependent_steps": [
                    4,
                    13
                ],
                "node": "Each card labeled from 1 to 72 therefore has probability $(1/2)^7$ of winning the tournament."
            },
            {
                "step_id": 15,
                "edge": "We verify the weights form a valid probability distribution by summing contributions: Step 7's 28 bye cards each contribute $1/64$ (from Step 12), and Step 14's 72 non-bye cards each contribute $1/128$. Calculating $28 \\times (1/64) = 7/16$ and $72 \\times (1/128) = 9/16$, their sum $7/16 + 9/16 = 1$ confirms all survival probabilities are properly normalized, a necessary consistency check for expectation calculations.",
                "direct_dependent_steps": [
                    7,
                    12,
                    14
                ],
                "node": "The weights sum to $28\\times(1/2^6)+72\\times(1/2^7)=1$."
            },
            {
                "step_id": 16,
                "edge": "Synthesizing key elements: Step 1 establishes expectation as a weighted sum, Step 9 partitions cards into groups (1–72 and 73–100), Steps 12 and 14 provide their respective probabilities, and Step 15 validates the weights. Thus, the expected value combines $\\sum_{i=1}^{72} i \\cdot (1/128)$ for non-bye cards and $\\sum_{i=73}^{100} i \\cdot (1/64)$ for bye recipients, directly implementing the probabilistic framework developed in prior steps.",
                "direct_dependent_steps": [
                    1,
                    9,
                    12,
                    14,
                    15
                ],
                "node": "Hence the expected value is $\\sum_{i=1}^{72}i\\,(1/2^7)+\\sum_{i=73}^{100}i\\,(1/2^6)$."
            },
            {
                "step_id": 17,
                "edge": "To compute Step 16's first summation, we apply the triangular number formula $\\sum_{i=1}^n i = n(n+1)/2$ with $n=72$. Calculating $72 \\times 73 / 2 = 2628$ yields the exact sum. Quick verification: $70 \\times 71 / 2 = 2485$ and adding $71 + 72 = 143$ gives $2485 + 143 = 2628$, confirming correctness.",
                "direct_dependent_steps": [
                    16
                ],
                "node": "The sum of integers from 1 to 72 is $(72\\times73)/2=2628$."
            },
            {
                "step_id": 18,
                "edge": "The sum of the first 100 positive integers uses the same triangular number formula: $100 \\times 101 / 2 = 5050$. This standard result requires no dependencies beyond basic arithmetic knowledge and serves as a reference for partitioning sums in later steps.",
                "direct_dependent_steps": null,
                "node": "The sum of the first 100 positive integers is $(100\\times101)/2=5050$."
            },
            {
                "step_id": 19,
                "edge": "Subtracting Step 17's sum (1–72) from Step 18's total (1–100) isolates the sum for labels 73–100: $5050 - 2628 = 2422$. Cross-check: the average of 73–100 is $(73+100)/2 = 86.5$, multiplied by 28 terms gives $86.5 \\times 28 = 2422$, validating the result.",
                "direct_dependent_steps": [
                    17,
                    18
                ],
                "node": "Therefore the sum of integers from 73 to 100 equals $5050-2628=2422$."
            },
            {
                "step_id": 20,
                "edge": "Multiplying Step 17's sum (2628) by Step 14's probability weight ($1/128$) gives the non-bye group's contribution: $2628 / 128$. This directly implements Step 16's expectation formula for cards 1–72, converting the summed labels into their probabilistic share of the total expectation.",
                "direct_dependent_steps": [
                    14,
                    17
                ],
                "node": "The contribution from labels 1 to 72 is $2628\\times(1/128)=2628/128$."
            },
            {
                "step_id": 21,
                "edge": "Similarly, Step 19's sum (2422) multiplied by Step 12's weight ($1/64$) yields the bye group's contribution: $2422 / 64$. To facilitate addition, we note $1/64 = 2/128$, so this equals $4844 / 128$, preparing both contributions for common-denominator summation in the next step.",
                "direct_dependent_steps": [
                    12,
                    19
                ],
                "node": "The contribution from labels 73 to 100 is $2422\\times(1/64)=2422/64$."
            },
            {
                "step_id": 22,
                "edge": "Combining Step 20's $2628/128$ and Step 21's $4844/128$ (converted from $2422/64$) gives $(2628 + 4844)/128 = 7472/128$. Arithmetic verification: $2600 + 4800 = 7400$ and $28 + 44 = 72$, so $7400 + 72 = 7472$, confirming the numerator calculation.",
                "direct_dependent_steps": [
                    20,
                    21
                ],
                "node": "Summing these contributions yields $2628/128+2422/64=2628/128+4844/128=7472/128$."
            },
            {
                "step_id": 23,
                "edge": "Simplifying $7472/128$ by dividing numerator and denominator by their greatest common divisor 16: $7472 \\div 16 = 467$ (since $16 \\times 467 = 7472$) and $128 \\div 16 = 8$. This reduces the fraction to $467/8$, the exact expected value, with no further simplification possible as 467 is prime.",
                "direct_dependent_steps": [
                    22
                ],
                "node": "Simplifying gives $7472/128=467/8$."
            }
        ]
    }
]
