[
    {
        "problem_id": 2472,
        "domain": [
            "Mathematics -> Applied Mathematics -> Probability -> Other",
            "Mathematics -> Discrete Mathematics -> Combinatorics"
        ],
        "difficulty": 5.0,
        "problem_text": "You are repeatedly flipping a fair coin. What is the expected number of flips until the first time that your previous 2012 flips are 'HTHT...HT'?",
        "sample_id": 1,
        "final_answer": "The final answer is \\boxed{\\frac{2^{2014}-4}{3}}",
        "steps": [
            {
                "step_id": 1,
                "edge": "We define $S$ as the specific alternating sequence 'HTHT...HT' with length exactly 2012. This establishes the target pattern we are tracking in coin flips, where 'H' and 'T' alternate starting with 'H'. The problem statement requires this precise string for the stopping condition, so this definition sets the foundation for all subsequent combinatorial arguments about substring occurrences.",
                "direct_dependent_steps": null,
                "node": "Let $S$ be the alternating string $HTHT\\ldots HT$ of length 2012."
            },
            {
                "step_id": 2,
                "edge": "Building on the definition of $S$ from Step 1, we introduce $f(n)$ to count binary strings of length $n$ that completely avoid $S$ as a contiguous substring. This function is critical because it quantifies the 'safe' sequences where the stopping condition hasn't yet occurred. The dependency on Step 1 is explicit: without defining $S$, we could not specify what constitutes an avoided substring.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "Let $f(n)$ be the number of binary strings of length $n$ that do not contain $S$ as a substring."
            },
            {
                "step_id": 3,
                "edge": "Using the definition of $S$ from Step 1, we define $g(n)$ as the count of binary strings where $S$ appears for the very first time ending at position $n$. This isolates strings where the prefix of length $n-1$ avoids $S$ (ensuring no prior occurrence), but appending the $n$-th character completes $S$. This distinction is necessary to model the exact stopping time $t=n$, which Step 12 will later formalize.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "Let $g(n)$ be the number of binary strings of length $n$ that contain $S$ but whose prefix of length $n-1$ does not contain $S$."
            },
            {
                "step_id": 4,
                "edge": "Analyzing the structure of $S$ from Step 1 reveals its self-overlap properties: because $S$ alternates strictly as 'HTHT...HT', it can only overlap with a shifted copy of itself at even offsets (e.g., shifting by 2 positions aligns 'HT' with the next 'HT'). Specifically, overlaps occur precisely at shifts $k=2,4,\\ldots,2012$ since the alternating pattern repeats every 2 characters. This combinatorial property is intrinsic to $S$'s definition and will govern how occurrences of $S$ can interact in longer strings.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "The string $S$ has self-overlaps exactly at shifts in $\\{2,4,\\ldots,2012\\}$."
            },
            {
                "step_id": 5,
                "edge": "Starting from the definition of $f(n)$ in Step 2 (strings avoiding $S$), we consider appending the full string $S$ to any such string. This creates a new string of length $n+2012$ that necessarily contains $S$ as a substring. The dependency on Step 2 is essential here: only strings that previously avoided $S$ can be extended to produce a string where $S$ appears, and the extension guarantees containment since $S$ is appended in full.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "Appending $S$ to any string counted by $f(n)$ yields a string of length $n+2012$ that contains $S$."
            },
            {
                "step_id": 6,
                "edge": "Combining Step 4 (self-overlap shifts of $S$) and Step 5 (appending $S$ to $f(n)$-strings), we observe that in the concatenated string from Step 5, the first occurrence of $S$ must end at position $n+k$ where $k$ is an overlap shift from Step 4. This is because $S$'s self-overlaps imply that when appended to a string avoiding $S$, the earliest possible completion of $S$ occurs at these specific offsets—not earlier (due to no prior $S$) and not at odd shifts (due to Step 4's even-only overlaps).",
                "direct_dependent_steps": [
                    4,
                    5
                ],
                "node": "In each concatenated string the first occurrence of $S$ ends at position $n+k$ for some $k\\in\\{2,4,\\ldots,2012\\}$."
            },
            {
                "step_id": 7,
                "edge": "Linking Step 3 (definition of $g(m)$ as first-occurrence counters) with Step 6 (first occurrence ending at $n+k$), we recognize that strings of length $n+k$ where $S$ first appears at the end are exactly those counted by $g(n+k)$. This equivalence holds by Step 3's construction: $g(m)$ requires the prefix of length $m-1$ to avoid $S$, matching Step 6's condition that the first occurrence ends at position $n+k$ (so no earlier occurrence exists).",
                "direct_dependent_steps": [
                    3,
                    6
                ],
                "node": "The strings of length $n+k$ whose first occurrence of $S$ ends at position $n+k$ are exactly those counted by $g(n+k)$."
            },
            {
                "step_id": 8,
                "edge": "By synthesizing Step 5 (concatenation produces strings containing $S$) and Step 7 (such strings are counted by $g(n+k)$), we establish a bijection: every string counted by $f(n)$ maps uniquely to a string in the union of $g(n+k)$ for $k\\in\\{2,4,\\ldots,2012\\}$ via appending $S$, and vice versa. This bijectivity is crucial—it ensures no overcounting or undercounting, as each extended string has exactly one first-occurrence position determined by $S$'s overlaps from Step 4.",
                "direct_dependent_steps": [
                    5,
                    7
                ],
                "node": "The concatenation map from strings counted by $f(n)$ to the union of strings counted by $g(n+k)$ for $k\\in\\{2,4,\\ldots,2012\\}$ is bijective."
            },
            {
                "step_id": 9,
                "edge": "The bijection in Step 8 directly implies that the count $f(n)$ equals the sum of $g(n+k)$ over all valid overlap shifts $k$. Since Step 8 confirms a one-to-one correspondence between the sets counted by $f(n)$ and the union of sets counted by $g(n+k)$, their cardinalities must satisfy this additive relationship. This recurrence will later connect combinatorial counts to probabilities.",
                "direct_dependent_steps": [
                    8
                ],
                "node": "Therefore $f(n) = \\sum_{k\\in\\{2,4,\\ldots,2012\\}} g(n+k)$."
            },
            {
                "step_id": 10,
                "edge": "This step introduces a probabilistic interpretation: the event $\\{t>n\\}$ (stopping time exceeding $n$) means no occurrence of $S$ in the first $n$ flips. This is a fundamental rephrasing of the stopping condition using the definition of $t$ (the first time the previous 2012 flips match $S$), relying solely on problem context rather than prior steps.",
                "direct_dependent_steps": null,
                "node": "The event $\\{t>n\\}$ is equivalent to having no occurrence of $S$ in the first $n$ flips."
            },
            {
                "step_id": 11,
                "edge": "Combining Step 2 (definition of $f(n)$ as $S$-avoiding strings) and Step 10 (probabilistic interpretation of $\\{t>n\\}$), we derive $P(t>n) = f(n)/2^n$. Since all $2^n$ binary strings of length $n$ are equally likely for fair coin flips, and $f(n)$ counts those avoiding $S$ (i.e., satisfying $t>n$), this probability ratio follows directly from counting principles.",
                "direct_dependent_steps": [
                    2,
                    10
                ],
                "node": "Since there are $2^n$ total binary strings of length $n$ and $f(n)$ of them avoid $S$, we have $P(t>n) = f(n)/2^n$."
            },
            {
                "step_id": 12,
                "edge": "This step defines the event $\\{t=n+k\\}$ as the first occurrence of $S$ ending exactly at flip $n+k$. This is a precise characterization of the stopping time's realization, grounded in the problem's requirement that the previous 2012 flips match $S$ at position $n+k$. No prior steps are needed as this is a direct consequence of $t$'s definition.",
                "direct_dependent_steps": null,
                "node": "The event $\\{t=n+k\\}$ is equivalent to having the first occurrence of $S$ end at flip $n+k$."
            },
            {
                "step_id": 13,
                "edge": "Using Step 3 (definition of $g(m)$ as strings where $S$ first appears ending at $m$) and Step 12 (event $\\{t=m\\}$), we obtain $P(t=n+k) = g(n+k)/2^{n+k}$. This follows because $g(n+k)$ counts the favorable outcomes (first occurrence at $n+k$) among $2^{n+k}$ equally likely strings of length $n+k$, matching the probabilistic model of fair coin flips.",
                "direct_dependent_steps": [
                    3,
                    12
                ],
                "node": "Since there are $2^{n+k}$ total binary strings of length $n+k$ and $g(n+k)$ of them satisfy $t=n+k$, we have $P(t=n+k) = g(n+k)/2^{n+k}$."
            },
            {
                "step_id": 14,
                "edge": "Starting from Step 9's identity $f(n) = \\sum_{k\\in\\{2,4,\\ldots,2012\\}} g(n+k)$, we divide both sides by $2^n$ to normalize counts into probabilities. This algebraic manipulation preserves equality while preparing the equation for substitution with probabilistic terms from later steps. The division is valid since $2^n > 0$, and it directly transforms combinatorial counts into scaled values relevant to expectation calculations.",
                "direct_dependent_steps": [
                    9
                ],
                "node": "Dividing the identity from step 9 by $2^n$ yields $f(n)/2^n = \\sum_{k\\in\\{2,4,\\ldots,2012\\}} g(n+k)/2^n$."
            },
            {
                "step_id": 15,
                "edge": "From Step 13, we have $P(t=n+k) = g(n+k)/2^{n+k}$, which rearranges to $g(n+k)/2^n = 2^k P(t=n+k)$ via multiplying both sides by $2^k$. This rewrites the right-hand side of Step 14's normalized equation in terms of probabilities, introducing the $2^k$ scaling factor that accounts for the length difference between $n$ and $n+k$.",
                "direct_dependent_steps": [
                    13
                ],
                "node": "Because $g(n+k)/2^{n+k} = P(t=n+k)$, we have $g(n+k)/2^n = 2^k P(t=n+k)$."
            },
            {
                "step_id": 16,
                "edge": "Substituting Step 11 ($f(n)/2^n = P(t>n)$) and Step 15 ($g(n+k)/2^n = 2^k P(t=n+k)$) into Step 14's normalized equation yields $P(t>n) = \\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k P(t=n+k)$. This key identity links the survival probability $P(t>n)$ to the point probabilities of future stopping times, leveraging all three dependencies to transition from combinatorial to probabilistic reasoning.",
                "direct_dependent_steps": [
                    11,
                    14,
                    15
                ],
                "node": "Substituting $f(n)/2^n = P(t>n)$ and $g(n+k)/2^n = 2^k P(t=n+k)$ into the result of step 14 gives $P(t>n) = \\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k P(t=n+k)$."
            },
            {
                "step_id": 17,
                "edge": "Summing Step 16's identity over all $n \\geq 0$ aggregates the relationship across all possible string lengths. This summation is valid because both sides are non-negative, and it prepares for interchanging sums to isolate the expectation. The dependency on Step 16 ensures we are summing a proven equality, not an approximation.",
                "direct_dependent_steps": [
                    16
                ],
                "node": "Summing the identity from step 16 over $n$ from 0 to $\\infty$ yields $\\sum_{n\\ge 0} P(t>n) = \\sum_{n\\ge 0} \\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k P(t=n+k)$."
            },
            {
                "step_id": 18,
                "edge": "Applying Fubini's theorem for non-negative series to Step 17's double sum, we interchange the order of summation to group terms by $k$. This yields $\\sum_{n\\ge 0} P(t>n) = \\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k \\sum_{n\\ge 0} P(t=n+k)$, which simplifies the right-hand side by factoring out $2^k$ from the inner sum over $n$. The interchange is justified since all terms are probabilities (hence non-negative).",
                "direct_dependent_steps": [
                    17
                ],
                "node": "Interchanging the order of summation gives $\\sum_{n\\ge 0} P(t>n) = \\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k \\sum_{n\\ge 0} P(t=n+k)$."
            },
            {
                "step_id": 19,
                "edge": "Since $S$ has length 2012 (from Step 1), the stopping time $t$ cannot be less than 2012—any sequence shorter than 2012 flips cannot contain $S$. Thus, $P(t<m) = 0$ for all $m < 2012$, which implies no valid first occurrences exist before position 2012. This fact, dependent on Step 1's string length, will later justify shifting summation indices.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "Any occurrence of $S$ requires at least 2012 flips, so $P(t<m)=0$ for all integers $m<2012$."
            },
            {
                "step_id": 20,
                "edge": "For each $k \\in \\{2,4,\\ldots,2012\\}$, the sum $\\sum_{n\\ge 0} P(t=n+k)$ equals 1 because it aggregates probabilities over all possible stopping times $t \\geq k$. Step 19 ensures $t \\geq 2012 \\geq k$ (since $k \\leq 2012$), so no terms are zeroed out prematurely. This is a standard property of probability mass functions: the total probability over all outcomes is 1.",
                "direct_dependent_steps": [
                    19
                ],
                "node": "For each $k\\in\\{2,4,\\ldots,2012\\}$ we have $\\sum_{n\\ge 0} P(t=n+k) = 1$."
            },
            {
                "step_id": 21,
                "edge": "Substituting Step 20 ($\\sum_{n\\ge 0} P(t=n+k) = 1$) into Step 18's rearranged sum gives $\\sum_{n\\ge 0} P(t>n) = \\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k$. This simplification collapses the double sum into a single geometric series, leveraging the unit probability from Step 20 to eliminate the inner summation. The result is pivotal as it expresses the expectation-relevant sum as a concrete series.",
                "direct_dependent_steps": [
                    18,
                    20
                ],
                "node": "Hence $\\sum_{n\\ge 0} P(t>n) = \\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k$."
            },
            {
                "step_id": 22,
                "edge": "Rewriting Step 21's sum $\\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k$ as $\\sum_{i=1}^{1006} 4^i$ exploits the even indices: letting $k=2i$, we have $2^k = (2^2)^i = 4^i$, and $i$ ranges from 1 to 1006 (since $k=2$ to $2012$ in steps of 2 gives 1006 terms). This reindexing prepares for application of the geometric series formula.",
                "direct_dependent_steps": [
                    21
                ],
                "node": "Let the series on the right be written as $\\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k = \\sum_{i=1}^{1006}4^i$."
            },
            {
                "step_id": 23,
                "edge": "This step recalls the standard formula for a finite geometric series: $\\sum_{i=1}^n a r^{i-1} = a \\frac{r^n - 1}{r - 1}$. Here we use the variant starting at $i=1$ with first term $a$ and ratio $r$, which is a foundational result in sequences and series. No prior steps are needed as this is general mathematical knowledge.",
                "direct_dependent_steps": null,
                "node": "A finite geometric series with first term $a$, common ratio $r$, and $n$ terms sums to $\\frac{a(r^n-1)}{r-1}$."
            },
            {
                "step_id": 24,
                "edge": "Applying Step 23's geometric series formula to Step 22's sum $\\sum_{i=1}^{1006} 4^i$ with first term $a=4$, ratio $r=4$, and $n=1006$ terms gives $\\frac{4(4^{1006} - 1)}{4 - 1} = \\frac{4(4^{1006} - 1)}{3}$. Verification: the first term is $4^1=4$, the last term is $4^{1006}$, and the ratio between consecutive terms is 4, confirming correct parameterization.",
                "direct_dependent_steps": [
                    22,
                    23
                ],
                "node": "Applying this formula with $a=4$, $r=4$, and $n=1006$ yields $\\sum_{i=1}^{1006}4^i = \\frac{4(4^{1006}-1)}{3}$."
            },
            {
                "step_id": 25,
                "edge": "Simplifying Step 24's result using $4^{1006} = (2^2)^{1006} = 2^{2012}$, we compute $4 \\cdot 4^{1006} = 4^{1007} = 2^{2014}$. Thus, $\\frac{4(4^{1006} - 1)}{3} = \\frac{2^{2014} - 4}{3}$. Sanity check: $2^{2014} - 4$ is divisible by 3 since $2^{2} \\equiv 1 \\mod 3$, so $2^{2014} = (2^2)^{1007} \\equiv 1^{1007} = 1 \\mod 3$, and $1 - 4 = -3 \\equiv 0 \\mod 3$.",
                "direct_dependent_steps": [
                    24
                ],
                "node": "Since $4^{1006} = 2^{2012}$, we obtain $\\sum_{k\\in\\{2,4,\\ldots,2012\\}} 2^k = \\frac{2^{2014}-4}{3}$."
            },
            {
                "step_id": 26,
                "edge": "This step invokes the standard identity for nonnegative integer-valued random variables: $E(t) = \\sum_{n\\ge 0} P(t > n)$. This result, derived from interchanging sums in $E(t) = \\sum_{m=1}^\\infty m P(t=m)$, is a cornerstone of probability theory and requires no prior steps from this solution as it is general knowledge.",
                "direct_dependent_steps": null,
                "node": "For a nonnegative integer-valued random variable $t$, we have $E(t) = \\sum_{n\\ge 0} P(t>n)$."
            },
            {
                "step_id": 27,
                "edge": "Combining Step 25 ($\\sum_{n\\ge 0} P(t>n) = \\frac{2^{2014}-4}{3}$) and Step 26 ($E(t) = \\sum_{n\\ge 0} P(t>n)$) directly yields $E(t) = \\frac{2^{2014}-4}{3}$. This final substitution unifies the combinatorial-probabilistic derivation with the expectation formula, confirming the expected stopping time matches the computed series sum.",
                "direct_dependent_steps": [
                    25,
                    26
                ],
                "node": "Combining this result with $\\sum_{n\\ge 0} P(t>n) = \\frac{2^{2014}-4}{3}$ yields $E(t) = \\frac{2^{2014}-4}{3}$."
            },
            {
                "step_id": 28,
                "edge": "The conclusion in Step 27 is boxed as the final answer, adhering to mathematical convention for presenting exact results. This step merely formats the derived expectation $\\frac{2^{2014}-4}{3}$ as the solution to the problem, depending solely on Step 27's validated result.",
                "direct_dependent_steps": [
                    27
                ],
                "node": "The final answer is \\boxed{\\frac{2^{2014}-4}{3}}."
            }
        ]
    }
]
