[
    {
        "problem_id": 2709,
        "domain": [
            "Mathematics -> Applied Mathematics -> Probability -> Other",
            "Mathematics -> Discrete Mathematics -> Algorithms"
        ],
        "difficulty": 5.25,
        "problem_text": "Caroline starts with the number 1, and every second she flips a fair coin; if it lands heads, she adds 1 to her number, and if it lands tails she multiplies her number by 2. Compute the expected number of seconds it takes for her number to become a multiple of 2021.",
        "sample_id": 1,
        "final_answer": "The final answer is \\boxed{4040}",
        "steps": [
            {
                "step_id": 1,
                "edge": "We model Caroline's process modulo 2021 because we care only about when the number becomes a multiple of 2021 (i.e., reaches 0 modulo 2021). This reduces the infinite state space of natural numbers to a finite state space of residues modulo 2021, which is essential for applying Markov chain theory. The state space $\\mathbb{Z}/2021\\mathbb{Z}$ naturally captures all relevant information for future transitions, as the operations (add 1 or multiply by 2) depend only on the current residue.",
                "direct_dependent_steps": null,
                "node": "We model the process of Caroline’s number modulo 2021 as a Markov chain on the state space $\\mathbb{Z}/2021\\mathbb{Z}$."
            },
            {
                "step_id": 2,
                "edge": "Building on the Markov chain model from Step 1, we define the transition for a heads flip. The problem states that heads adds 1 to the current number. Since we work modulo 2021 (as established in Step 1), this operation is equivalent to adding 1 modulo 2021. This deterministic transition rule directly follows the problem statement and the modulo reduction chosen in Step 1, ensuring the state remains within $\\mathbb{Z}/2021\\mathbb{Z}$.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "A flip of heads in this chain corresponds to adding $1$ modulo $2021$ to the current state."
            },
            {
                "step_id": 3,
                "edge": "Similarly to Step 2, we define the transition for a tails flip using the model from Step 1. The problem specifies that tails multiplies the current number by 2. Working modulo 2021 (as set up in Step 1), this becomes multiplication by 2 modulo 2021. This deterministic rule is consistent with the problem's description and maintains the state within the finite residue system $\\mathbb{Z}/2021\\mathbb{Z}$.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "A flip of tails in this chain corresponds to multiplying the current state by $2$ modulo $2021$."
            },
            {
                "step_id": 4,
                "edge": "We establish irreducibility by leveraging the heads transition from Step 2. From any state $i$, repeated application of the heads operation (adding 1 modulo 2021) allows reaching any state $j$: specifically, adding 1 exactly $(j-i) \\mod 2021$ times moves from $i$ to $j$. Since heads occurs with probability $1/2 > 0$ at each step, there is a positive-probability path between any two states, satisfying the definition of irreducibility for Markov chains.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "The chain is irreducible because repeated additions of $1$ can move from any residue to any other residue in $\\mathbb{Z}/2021\\mathbb{Z}$."
            },
            {
                "step_id": 5,
                "edge": "We demonstrate a key component of aperiodicity using the tails transition from Step 3. At state 0, a tails flip multiplies by 2 modulo 2021, and $0 \\cdot 2 \\equiv 0 \\pmod{2021}$, so state 0 transitions to itself with probability $1/2$. A state with a self-loop (positive probability of remaining in place) has period 1, meaning it is aperiodic. While full chain aperiodicity also requires irreducibility (Step 4), Step 5 relies solely on Step 3 to show state 0 is aperiodic, which is critical for later establishing global aperiodicity.",
                "direct_dependent_steps": [
                    3
                ],
                "node": "The chain is aperiodic because state $0$ transitions to itself with probability $1/2$ via the multiply-by-$2$ operation since $0\\cdot2\\equiv0\\pmod{2021}$."
            },
            {
                "step_id": 6,
                "edge": "We apply a fundamental Markov chain theorem: an irreducible and aperiodic finite-state chain has a unique stationary distribution. Step 4 confirmed irreducibility (via the heads transition enabling movement between any states), and Step 5 confirmed aperiodicity (via the self-loop at state 0 from the tails transition). This theorem guarantees the existence and uniqueness of the stationary distribution for our chain, which is essential for computing expected times.",
                "direct_dependent_steps": [
                    4,
                    5
                ],
                "node": "An irreducible aperiodic finite-state Markov chain has a unique stationary distribution."
            },
            {
                "step_id": 7,
                "edge": "We verify stationarity of the uniform distribution using the transition rules from Steps 2 and 3. For uniform $\\pi(i) = 1/2021$, we check the balance equation $\\pi(j) = \\sum_i \\pi(i) P(i,j)$ for all $j$. From any state $i$, with probability $1/2$ we go to $(i+1) \\mod 2021$ (Step 2, heads) and with $1/2$ to $(2i) \\mod 2021$ (Step 3, tails). For fixed $j$, predecessors are: (i) $i = j-1 \\mod 2021$ (from heads), and (ii) $i = 2^{-1}j \\mod 2021$ (from tails; unique since 2 and 2021 are coprime). Thus inflow to $j$ is $(1/2)(1/2021) + (1/2)(1/2021) = 1/2021 = \\pi(j)$, confirming stationarity.",
                "direct_dependent_steps": [
                    2,
                    3
                ],
                "node": "The uniform distribution on $\\mathbb{Z}/2021\\mathbb{Z}$ with probability $1/2021$ for each state is stationary for this chain."
            },
            {
                "step_id": 8,
                "edge": "We combine Step 6 (unique stationary distribution) and Step 7 (uniform distribution is stationary). Since Step 6 guarantees uniqueness and Step 7 exhibits a valid stationary distribution, the stationary distribution must be uniform. This is a direct application of the uniqueness property, eliminating the need for further verification of stationarity.",
                "direct_dependent_steps": [
                    6,
                    7
                ],
                "node": "By uniqueness the stationary distribution of this chain must be uniform."
            },
            {
                "step_id": 9,
                "edge": "We cite a standard result from Markov chain theory: for a positive recurrent state in an irreducible chain, the expected return time equals the reciprocal of its stationary probability. Our finite-state irreducible aperiodic chain (Steps 4, 5, 6) is positive recurrent, making this theorem applicable. This principle links stationary probabilities to expected hitting times, which is central to solving the problem.",
                "direct_dependent_steps": null,
                "node": "In a positive recurrent Markov chain the expected return time to a state equals the reciprocal of its stationary probability."
            },
            {
                "step_id": 10,
                "edge": "Using Step 8 (stationary distribution is uniform over 2021 states), each state has stationary probability $1/2021$. Specifically for state 0, this gives $\\pi(0) = 1/2021$. The uniformity established in Step 8 directly implies this value without additional computation.",
                "direct_dependent_steps": [
                    8
                ],
                "node": "The stationary probability of state $0$ under the uniform distribution is $1/2021$."
            },
            {
                "step_id": 11,
                "edge": "Applying Step 9's theorem, the expected return time to state 0 is $1 / \\pi(0)$. Substituting $\\pi(0) = 1/2021$ from Step 10 yields $1 / (1/2021) = 2021$. This reciprocal relationship is a direct consequence of the stationary distribution's interpretation as the long-run frequency of visits.",
                "direct_dependent_steps": [
                    9,
                    10
                ],
                "node": "Hence the expected return time to state $0$ is $2021$."
            },
            {
                "step_id": 12,
                "edge": "We define $E_{0\\to0}$ as the expected return time to state 0 starting from 0. This notation is conventional in Markov chain analysis and isolates the return time concept for algebraic manipulation. It is introduced here to facilitate the law of total expectation applied in later steps.",
                "direct_dependent_steps": null,
                "node": "Let $E_{0\\to0}$ denote the expected return time to state $0$ starting from $0$."
            },
            {
                "step_id": 13,
                "edge": "Similarly, we define $E_{1\\to0}$ as the expected hitting time to state 0 starting from state 1. This notation is necessary because Caroline begins at 1 (the initial number), and we ultimately need this specific hitting time. The definition sets up the key unknown for the final calculation.",
                "direct_dependent_steps": null,
                "node": "Let $E_{1\\to0}$ denote the expected hitting time of state $0$ starting from state $1$."
            },
            {
                "step_id": 14,
                "edge": "Using Step 5's observation about state 0, a tails flip (probability $1/2$) multiplies 0 by 2 modulo 2021, yielding $0 \\cdot 2 \\equiv 0 \\pmod{2021}$. Thus, starting from 0, tails immediately returns to 0 in one step. This self-loop probability is critical for decomposing the return time.",
                "direct_dependent_steps": [
                    5
                ],
                "node": "Starting from state $0$, if the first flip is tails then the chain returns to $0$ in one step with probability $1/2$."
            },
            {
                "step_id": 15,
                "edge": "Applying Step 2's heads transition rule, adding 1 modulo 2021 to state 0 gives $0 + 1 \\equiv 1 \\pmod{2021}$. Since heads occurs with probability $1/2$, starting from 0, the chain moves to state 1 in one step with this probability. This transition is foundational for the alternative path in the return time decomposition.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "Starting from state $0$, if the first flip is heads then the chain moves to state $1$ in one step with probability $1/2$."
            },
            {
                "step_id": 16,
                "edge": "If the first flip is heads (Step 15), we spend one step to reach state 1, and then require $E_{1\\to0}$ additional steps (defined in Step 13) to hit 0 from there. Thus, the total expected time in this case is $1 + E_{1\\to0}$. This breaks down the return time into the initial step plus the subsequent hitting time from state 1.",
                "direct_dependent_steps": [
                    13,
                    15
                ],
                "node": "In the case of the first flip being heads the expected return time to $0$ is $1+E_{1\\to0}$ steps since one step was used to reach state $1$."
            },
            {
                "step_id": 17,
                "edge": "We apply the law of total expectation to $E_{0\\to0}$ (Step 12), conditioning on the first flip. From Step 14, tails (prob $1/2$) gives return time 1. From Step 16, heads (prob $1/2$) gives return time $1 + E_{1\\to0}$. Combining these, $E_{0\\to0} = (1/2) \\cdot 1 + (1/2) \\cdot (1 + E_{1\\to0})$. This equation captures all possible first-step outcomes.",
                "direct_dependent_steps": [
                    12,
                    14,
                    16
                ],
                "node": "By the law of total expectation we have $E_{0\\to0}=\\tfrac12\\cdot1+\\tfrac12\\cdot(1+E_{1\\to0})$."
            },
            {
                "step_id": 18,
                "edge": "Substituting $E_{0\\to0} = 2021$ from Step 11 into Step 17's equation gives $2021 = (1/2) \\cdot 1 + (1/2) \\cdot (1 + E_{1\\to0})$. This replacement uses the known return time (from stationary distribution theory) to create an equation solvable for the unknown $E_{1\\to0}$.",
                "direct_dependent_steps": [
                    11,
                    17
                ],
                "node": "Substituting $E_{0\\to0}=2021$ gives $2021=\\tfrac12+\\tfrac12(1+E_{1\\to0})$."
            },
            {
                "step_id": 19,
                "edge": "Simplifying Step 18's equation: $2021 = \\frac{1}{2} + \\frac{1}{2}(1 + E_{1\\to0}) = \\frac{1}{2} + \\frac{1}{2} + \\frac{1}{2}E_{1\\to0} = 1 + \\frac{1}{2}E_{1\\to0}$. Algebraic manipulation combines the constant terms, isolating the term with $E_{1\\to0}$ for solving.",
                "direct_dependent_steps": [
                    18
                ],
                "node": "Combining terms yields $2021=1+\\tfrac12E_{1\\to0}$."
            },
            {
                "step_id": 20,
                "edge": "Solving $2021 = 1 + \\frac{1}{2}E_{1\\to0}$ (Step 19): subtract 1 to get $2020 = \\frac{1}{2}E_{1\\to0}$, then multiply by 2 to obtain $E_{1\\to0} = 4040$. Sanity check: $2 \\times (2021 - 1) = 2 \\times 2020 = 4040$, which matches. This is the expected hitting time from state 1 to 0.",
                "direct_dependent_steps": [
                    19
                ],
                "node": "Solving for $E_{1\\to0}$ gives $E_{1\\to0}=2(2021-1)=4040$."
            },
            {
                "step_id": 21,
                "edge": "Step 20 computed $E_{1\\to0} = 4040$, which is the expected time to reach state 0 (multiple of 2021) starting from state 1. Since Caroline begins at 1 (initial number), this directly answers the problem. The boxed result follows from this hitting time calculation.",
                "direct_dependent_steps": [
                    20
                ],
                "node": "The final answer is \\boxed{4040}"
            }
        ]
    }
]
