[
    {
        "problem_id": 778,
        "domain": [
            "Mathematics -> Applied Mathematics -> Statistics -> Probability -> Other",
            "Mathematics -> Discrete Mathematics -> Combinatorics"
        ],
        "difficulty": 4.0,
        "problem_text": "There are 21 competitors with distinct skill levels numbered $1,2, \\ldots, 21$. They participate in a pingpong tournament as follows. First, a random competitor is chosen to be \"active\", while the rest are \"inactive.\" Every round, a random inactive competitor is chosen to play against the current active one. The player with the higher skill will win and become (or remain) active, while the loser will be eliminated from the tournament. The tournament lasts for 20 rounds, after which there will only be one player remaining. Alice is the competitor with skill 11. What is the expected number of games that she will get to play?",
        "sample_id": 1,
        "final_answer": "The final answer is \\boxed{47/42}",
        "steps": [
            {
                "step_id": 1,
                "edge": "We establish $n=21$ as the total number of competitors since the problem explicitly states there are 21 distinct skill levels. This variable substitution simplifies subsequent expressions by avoiding repeated numerical references and aligns with standard combinatorial notation for tournament size.",
                "direct_dependent_steps": null,
                "node": "Let $n=21$ be the total number of competitors."
            },
            {
                "step_id": 2,
                "edge": "We assign $k=11$ to represent Alice's skill level as given in the problem statement. This notation allows us to generalize the solution for any skill level while specifically tracking Alice's participation throughout the tournament.",
                "direct_dependent_steps": null,
                "node": "Let $k=11$ be the skill level of Alice."
            },
            {
                "step_id": 3,
                "edge": "We model the random tournament progression using a uniform random permutation $\\pi$ of $\\{1,\\dots,n\\}$. This representation is valid because the initial active player and subsequent challengers are selected uniformly at random from remaining competitors, and a random permutation captures all possible orderings with equal probability—a standard technique in combinatorial probability for sequential random selection.",
                "direct_dependent_steps": null,
                "node": "Represent the tournament by choosing a random permutation $\\pi$ of $\\{1,\\dots,n\\}$."
            },
            {
                "step_id": 4,
                "edge": "Building on the permutation model from Step 3, we interpret $\\pi_1$ as the initial active player. This assignment directly corresponds to the tournament rule where one random competitor is chosen to be active at the start, and the first element of the permutation naturally encodes this initial selection.",
                "direct_dependent_steps": [
                    3
                ],
                "node": "In this permutation $\\pi_1$ is the initial active player."
            },
            {
                "step_id": 5,
                "edge": "Continuing from the permutation framework in Step 3, we designate $\\pi_2,\\dots,\\pi_n$ as the sequence of challengers for rounds 2 through $n$. This follows the tournament mechanics where inactive competitors are randomly selected in subsequent rounds, and the permutation order precisely defines the challenger arrival sequence.",
                "direct_dependent_steps": [
                    3
                ],
                "node": "Denote $\\pi_2,\\dots,\\pi_n$ as the challengers arriving in rounds 2 through $n$."
            },
            {
                "step_id": 6,
                "edge": "Using the permutation structure from Step 3, we define $M_j = \\max(\\pi_1,\\dots,\\pi_j)$ for $1 \\leq j \\leq n$. This tracks the highest skill level among the first $j$ competitors processed, which is critical because the active player after $j$ competitors have been involved must be the one with maximum skill (as winners retain activity).",
                "direct_dependent_steps": [
                    3
                ],
                "node": "Define $M_j=\\max(\\pi_1,\\dots,\\pi_j)$ for each $j$ with $1\\le j\\le n$."
            },
            {
                "step_id": 7,
                "edge": "We combine Step 4 (where $\\pi_1$ is initial active) and Step 6 (defining $M_j$ as the running maximum) to determine the active player in round $m$. After $m-1$ competitors have participated (initial active plus $m-2$ challengers), the active player must be the maximum skill among them, hence $M_{m-1}$. This holds for $2 \\leq m \\leq n$ since round $m$ begins after $m-1$ competitors have been processed.",
                "direct_dependent_steps": [
                    4,
                    6
                ],
                "node": "In round $m$ with $2\\le m\\le n$ the active player is $M_{m-1}$."
            },
            {
                "step_id": 8,
                "edge": "From Step 5, which identifies $\\pi_2,\\dots,\\pi_n$ as the challenger sequence, we directly assign $\\pi_m$ as the challenger in round $m$ for $2 \\leq m \\leq n$. This is a straightforward mapping of the permutation index to the round number, consistent with the tournament's challenger selection process.",
                "direct_dependent_steps": [
                    5
                ],
                "node": "In round $m$ with $2\\le m\\le n$ the challenger is $\\pi_m$."
            },
            {
                "step_id": 9,
                "edge": "We synthesize Step 2 (Alice's skill $k=11$), Step 7 (active player is $M_{m-1}$ in round $m$), and Step 8 (challenger is $\\pi_m$) to characterize Alice's participation. Alice plays in round $m$ if she is the challenger ($\\pi_m = k$) or the active player ($M_{m-1} = k$), as these are the only roles in a match. This condition precisely captures when Alice is involved in a game.",
                "direct_dependent_steps": [
                    2,
                    7,
                    8
                ],
                "node": "Alice plays in round $m$ if and only if $\\pi_m=k$ or $M_{m-1}=k$."
            },
            {
                "step_id": 10,
                "edge": "Using Step 3 (permutation uniqueness) and Step 9 (Alice's participation condition), we verify mutual exclusivity: $\\pi_m = k$ implies $k$ appears at position $m$, so $k$ cannot be in the first $m-1$ positions (making $M_{m-1} \\neq k$), while $M_{m-1} = k$ requires $k$ in the first $m-1$ positions (so $\\pi_m \\neq k$). Thus, the events cannot co-occur for any fixed $m$.",
                "direct_dependent_steps": [
                    3,
                    9
                ],
                "node": "The events $\\pi_m=k$ and $M_{m-1}=k$ are mutually exclusive for any given $m$."
            },
            {
                "step_id": 11,
                "edge": "Given Step 9 (Alice plays if $\\pi_m=k$ or $M_{m-1}=k$) and Step 10 (mutual exclusivity), the indicator variable for Alice playing in round $m$ equals the sum $1_{\\pi_m=k} + 1_{M_{m-1}=k}$. This is valid because for mutually exclusive events, the indicator of the union is the sum of the individual indicators, avoiding double-counting.",
                "direct_dependent_steps": [
                    9,
                    10
                ],
                "node": "Hence the indicator of Alice playing in round $m$ equals $1_{\\pi_m=k}+1_{M_{m-1}=k}$."
            },
            {
                "step_id": 12,
                "edge": "Summing the indicator from Step 11 over all rounds $m=2$ to $n$ gives the total games Alice plays. This leverages the linearity of summation: the count of games is the sum of indicators for each round, a standard technique for converting event occurrences into a cumulative count.",
                "direct_dependent_steps": [
                    11
                ],
                "node": "Therefore the total number of games Alice plays equals $\\sum_{m=2}^n\\bigl(1_{\\pi_m=k}+1_{M_{m-1}=k}\\bigr)$."
            },
            {
                "step_id": 13,
                "edge": "From Step 3 (permutation contains all skills exactly once) and Step 12 (sum over $m=2$ to $n$), $k$ appears exactly once in positions $\\pi_2$ through $\\pi_n$ (since position $\\pi_1$ is excluded). Thus, $\\sum_{m=2}^n 1_{\\pi_m=k} = 1$, as the indicator is 1 precisely at the position where $k$ occurs in the challenger sequence.",
                "direct_dependent_steps": [
                    3,
                    12
                ],
                "node": "Since $k$ appears exactly once among $\\pi_2,\\dots,\\pi_n$ we have $\\sum_{m=2}^n1_{\\pi_m=k}=1$."
            },
            {
                "step_id": 14,
                "edge": "Reindexing the sum $\\sum_{m=2}^n 1_{M_{m-1}=k}$ from Step 12 using $j = m-1$ (so $m=2 \\to j=1$, $m=n \\to j=n-1$) yields $\\sum_{j=1}^{n-1} 1_{M_j=k}$. This substitution, justified by Step 6's definition of $M_j$, simplifies the expression by aligning the index with the running maximum definition.",
                "direct_dependent_steps": [
                    6,
                    12
                ],
                "node": "We also have $\\sum_{m=2}^n1_{M_{m-1}=k}=\\sum_{j=1}^{n-1}1_{M_j=k}$."
            },
            {
                "step_id": 15,
                "edge": "Combining Step 13 (challenger contribution is 1) and Step 14 (active player contribution is $\\sum_{j=1}^{n-1} 1_{M_j=k}$), Alice's total games equal $1 + \\sum_{j=1}^{n-1} 1_{M_j=k}$. This consolidates the two components of her participation—becoming active via a challenge and subsequent games as active player—into a single expression.",
                "direct_dependent_steps": [
                    13,
                    14
                ],
                "node": "Thus Alice’s total games equals $1+\\sum_{j=1}^{n-1}1_{M_j=k}$."
            },
            {
                "step_id": 16,
                "edge": "Taking expectations of Step 15's equation, linearity of expectation gives $E[\\#\\mathrm{games}] = 1 + \\sum_{j=1}^{n-1} P(M_j = k)$. This converts the random count into a deterministic sum of probabilities, a crucial simplification for computing the expected value.",
                "direct_dependent_steps": [
                    15
                ],
                "node": "Taking expectations yields $E[\\#\\mathrm{games}]=1+\\sum_{j=1}^{n-1}P(M_j=k)$."
            },
            {
                "step_id": 17,
                "edge": "To compute $P(M_j = k)$ from Step 6 and Step 16, note that $M_j = k$ iff $k$ is in the first $j$ positions and all higher skills ($k+1$ to $n$) are excluded from these positions. The number of favorable permutations is $\\binom{k-1}{j-1}$ (choosing $j-1$ skills below $k$ to join $k$ in the first $j$ positions), while total permutations of $j$ positions is $\\binom{n}{j}$. Thus, $P(M_j = k) = \\binom{k-1}{j-1} / \\binom{n}{j}$.",
                "direct_dependent_steps": [
                    6,
                    16
                ],
                "node": "The probability $P(M_j=k)$ equals $\\binom{k-1}{j-1}/\\binom{n}{j}$."
            },
            {
                "step_id": 18,
                "edge": "From Step 17's probability formula, $\\binom{k-1}{j-1} = 0$ when $j-1 > k-1$ (i.e., $j > k$) because we cannot choose more elements than available from the $k-1$ skills below $k$. This binomial coefficient property eliminates terms beyond $j=k$ in subsequent sums.",
                "direct_dependent_steps": [
                    17
                ],
                "node": "For $j>k$ we have $\\binom{k-1}{j-1}=0$."
            },
            {
                "step_id": 19,
                "edge": "Applying Step 16 (expectation sum), Step 17 (probability formula), and Step 18 (zero terms for $j > k$), we truncate the sum to $j=1$ to $k$: $\\sum_{j=1}^{n-1} P(M_j=k) = \\sum_{j=1}^{k} \\binom{k-1}{j-1} / \\binom{n}{j}$. This reduces the upper limit from $n-1$ to $k$, simplifying computation without changing the value.",
                "direct_dependent_steps": [
                    16,
                    17,
                    18
                ],
                "node": "Hence $\\sum_{j=1}^{n-1}P(M_j=k)=\\sum_{j=1}^{k}\\frac{\\binom{k-1}{j-1}}{\\binom{n}{j}}$."
            },
            {
                "step_id": 20,
                "edge": "Substituting Step 19's truncated sum into Step 16's expectation formula gives $E[\\#\\mathrm{games}] = 1 + \\sum_{m=1}^{k} \\binom{k-1}{m-1} / \\binom{n}{m}$ (renaming $j$ to $m$ for clarity). This expresses the expectation as a finite sum over $m$, ready for combinatorial simplification.",
                "direct_dependent_steps": [
                    16,
                    19
                ],
                "node": "Therefore $E[\\#\\mathrm{games}]=1+\\sum_{m=1}^{k}\\frac{\\binom{k-1}{m-1}}{\\binom{n}{m}}$."
            },
            {
                "step_id": 21,
                "edge": "We apply a standard combinatorial identity for the sum $\\sum_{m=1}^{k} \\binom{k-1}{m-1} / \\binom{n}{m} = (n+1) / [(n-k+1)(n-k+2)]$. This identity, derivable via hypergeometric series or combinatorial arguments, converts the sum into a closed rational expression, avoiding direct summation.",
                "direct_dependent_steps": [
                    20
                ],
                "node": "A standard identity is $\\sum_{m=1}^{k}\\frac{\\binom{k-1}{m-1}}{\\binom{n}{m}}=\\frac{n+1}{(n-k+1)(n-k+2)}$."
            },
            {
                "step_id": 22,
                "edge": "Substituting Step 21's identity into Step 20's expression yields $E[\\#\\mathrm{games}] = 1 + (n+1) / [(n-k+1)(n-k+2)]$. This closed form is algebraically simpler and prepares for numerical evaluation, though it requires adjustment for the initial active case as we'll see.",
                "direct_dependent_steps": [
                    20,
                    21
                ],
                "node": "Substituting this identity gives $E[\\#\\mathrm{games}]=1+\\frac{n+1}{(n-k+1)(n-k+2)}$."
            },
            {
                "step_id": 23,
                "edge": "In a random permutation (Step 3), any specific skill level is equally likely to occupy position $\\pi_1$, so $P(\\pi_1 = k) = 1/n$. This basic probability follows from the uniform distribution over all $n!$ permutations.",
                "direct_dependent_steps": [
                    3
                ],
                "node": "The probability that $\\pi_1=k$ is $1/n$."
            },
            {
                "step_id": 24,
                "edge": "Step 22's expression overcounts by 1 when Alice is initial active (Step 23: probability $1/n$), because Step 13 assumed the challenger sum is 1 (requiring $k$ not in $\\pi_1$). When $\\pi_1 = k$, Alice never plays as challenger, so we subtract $1/n$ to correct the expectation for this scenario.",
                "direct_dependent_steps": [
                    22,
                    23
                ],
                "node": "When $\\pi_1=k$ Alice does not play as a challenger so we subtract $1/n$ in the expectation."
            },
            {
                "step_id": 25,
                "edge": "Combining Step 22 (base expectation) and Step 24 (correction for initial active case), we write the adjusted expectation as $E[\\#\\mathrm{games}] = (n+1)/[(n-k+1)(n-k+2)] + 1 - 1/n$. This accounts for all participation scenarios by removing the overcounted challenger game when Alice starts active.",
                "direct_dependent_steps": [
                    22,
                    24
                ],
                "node": "Hence $E[\\#\\mathrm{games}]=\\frac{n+1}{(n-k+1)(n-k+2)}+1-\\frac{1}{n}$."
            },
            {
                "step_id": 26,
                "edge": "Since Step 1 sets $n=21$ and Step 2 sets $k=11$ with $k < n$, Alice cannot be the sole player (which would require $k=n$), so no additional subtraction terms are needed. This ensures the correction in Step 24 is sufficient.",
                "direct_dependent_steps": [
                    1,
                    2
                ],
                "node": "Since $k<n$ there is no additional subtraction term."
            },
            {
                "step_id": 27,
                "edge": "Substituting $n=21$ (Step 1), $k=11$ (Step 2), and Step 26's confirmation of no extra terms into Step 25's formula gives $E[\\#\\mathrm{games}] = 22/(11 \\cdot 12) + 1 - 1/21$. This numerical instantiation prepares for arithmetic simplification.",
                "direct_dependent_steps": [
                    1,
                    2,
                    25,
                    26
                ],
                "node": "Substituting $n=21$ and $k=11$ yields $E[\\#\\mathrm{games}]=\\frac{22}{11\\cdot12}+1-\\frac{1}{21}$."
            },
            {
                "step_id": 28,
                "edge": "Simplifying $22/(11 \\cdot 12)$ from Step 27: $22 / 11 = 2$, so $2 / 12 = 1/6$. Sanity check: $11 \\cdot 12 = 132$, $22 / 132 = 1/6$, which is correct.",
                "direct_dependent_steps": [
                    27
                ],
                "node": "Simplifying $\\frac{22}{11\\cdot12}$ gives $\\frac{1}{6}$."
            },
            {
                "step_id": 29,
                "edge": "Adding 1 to $1/6$ (from Step 27 and Step 28) yields $7/6$: $1 = 6/6$, so $6/6 + 1/6 = 7/6$. This combines the constant term with the simplified fraction.",
                "direct_dependent_steps": [
                    27,
                    28
                ],
                "node": "Adding $1$ to $\\frac{1}{6}$ yields $\\frac{7}{6}$."
            },
            {
                "step_id": 30,
                "edge": "Expressing $7/6$ (from Step 29) with denominator 42: multiply numerator and denominator by 7 ($6 \\cdot 7 = 42$), giving $(7 \\cdot 7)/(6 \\cdot 7) = 49/42$. This common denominator facilitates subtraction with the next term.",
                "direct_dependent_steps": [
                    29
                ],
                "node": "Expressing $\\frac{7}{6}$ as a fraction over $42$ gives $\\frac{49}{42}$."
            },
            {
                "step_id": 31,
                "edge": "Expressing $1/21$ (from Step 27) with denominator 42: multiply numerator and denominator by 2 ($21 \\cdot 2 = 42$), yielding $2/42$. This matches the denominator used in Step 30 for direct subtraction.",
                "direct_dependent_steps": [
                    27
                ],
                "node": "Expressing $\\frac{1}{21}$ as a fraction over $42$ gives $\\frac{2}{42}$."
            },
            {
                "step_id": 32,
                "edge": "Subtracting $2/42$ (Step 31) from $49/42$ (Step 30) gives $(49 - 2)/42 = 47/42$. This final arithmetic step computes the exact expected value, which is the solution to the problem.",
                "direct_dependent_steps": [
                    30,
                    31
                ],
                "node": "Subtracting $\\frac{2}{42}$ from $\\frac{49}{42}$ yields $\\frac{47}{42}$."
            }
        ]
    }
]
