[
    {
        "problem_id": 3163,
        "domain": [
            "Mathematics -> Applied Mathematics -> Probability -> Other"
        ],
        "difficulty": 4.5,
        "problem_text": "The integers \\(1,2,3,4,5,6,7,8,9,10\\) are written on a blackboard. Each day, a teacher chooses one of the integers uniformly at random and decreases it by 1. Let \\(X\\) be the expected value of the number of days which elapse before there are no longer positive integers on the board. Estimate \\(X\\). An estimate of \\(E\\) earns \\(\\left\\lfloor 20 \\cdot 2^{-|X-E| / 8}\\right\\rfloor\\) points.",
        "sample_id": 1,
        "final_answer": "\\boxed{120.75280458176904}",
        "steps": [
            {
                "step_id": 1,
                "edge": "We establish the initial state of the problem as given in the problem statement: there are exactly 10 distinct integers on the blackboard, specifically the sequence from 1 through 10. This foundational observation provides the complete set of values that will be subject to the daily decrement operations throughout the process.",
                "direct_dependent_steps": null,
                "node": "There are 10 integers initially with values $1,2,\\dots,10$."
            },
            {
                "step_id": 2,
                "edge": "We note the daily operation mechanism directly described in the problem: each day involves selecting one integer uniformly at random from the 10 available integers and reducing its value by exactly 1. This uniform random selection implies each integer has an equal probability of being chosen on any given day, which is critical for modeling the stochastic process.",
                "direct_dependent_steps": null,
                "node": "Each day, the teacher chooses one integer uniformly and decreases its value by $1$."
            },
            {
                "step_id": 3,
                "edge": "We identify the stopping condition as explicitly defined in the problem statement: the process terminates when no positive integers remain on the board. This means all integers must reach zero or negative values, which occurs precisely when each integer \\(i\\) has been decreased at least \\(i\\) times (since it starts at value \\(i\\)).",
                "direct_dependent_steps": null,
                "node": "The process stops when all integers on the board are non-positive."
            },
            {
                "step_id": 4,
                "edge": "We introduce the auxiliary random variable \\(T_i\\) to represent the number of days required for integer \\(i\\) to be decreased exactly \\(i\\) times. This definition is motivated by the need to track individual completion times for each integer, as the overall process depends on all integers reaching their respective termination thresholds. Since integer \\(i\\) starts at value \\(i\\), \\(T_i\\) corresponds to the time until it becomes non-positive.",
                "direct_dependent_steps": null,
                "node": "Let $T_i$ be the number of days until integer $i$ has been decreased $i$ times."
            },
            {
                "step_id": 5,
                "edge": "Building on Step 2, where each day involves a uniform random selection of one integer, we recognize that for a specific integer \\(i\\), each day constitutes an independent Bernoulli trial with 'success' defined as selecting \\(i\\). Since there are 10 integers, the success probability per trial is \\(1/10\\). This framing is essential because it characterizes the probability structure underlying the decrements for any single integer.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "Each daily choice for integer $i$ is a Bernoulli trial with success probability $1/10$."
            },
            {
                "step_id": 6,
                "edge": "Combining Step 4 (which defines \\(T_i\\) as the days until \\(i\\) decrements for integer \\(i\\)) and Step 5 (which establishes each decrement for \\(i\\) as a Bernoulli trial with \\(p = 1/10\\)), we conclude \\(T_i\\) follows a negative binomial distribution. Specifically, it is the number of trials needed to achieve \\(i\\) successes in independent Bernoulli trials with success probability \\(p = 1/10\\), hence parameters \\((r = i, p = 1/10)\\).",
                "direct_dependent_steps": [
                    4,
                    5
                ],
                "node": "Therefore $T_i$ follows a negative binomial distribution with parameters $(i,p=1/10)$."
            },
            {
                "step_id": 7,
                "edge": "Integrating Step 3 (the process stops when all integers are non-positive) and Step 4 (where \\(T_i\\) is the time until integer \\(i\\) is decreased \\(i\\) times), we observe that the process halts only when every integer \\(i\\) has been hit at least \\(i\\) times. This is because integer \\(i\\) becomes non-positive exactly after \\(i\\) decrements, so the global stopping condition requires all individual completion events to occur.",
                "direct_dependent_steps": [
                    3,
                    4
                ],
                "node": "The process stops when each integer $i$ has been hit at least $i$ times."
            },
            {
                "step_id": 8,
                "edge": "From Step 7, which states the process stops when each integer has been hit at least \\(i\\) times, we deduce that the total days \\(X\\) must equal the maximum of the individual completion times \\(T_1, T_2, \\dots, T_{10}\\). This is because the process continues until the last integer reaches its required number of decrements, making \\(X = \\max(T_1, \\dots, T_{10})\\) the precise stopping time.",
                "direct_dependent_steps": [
                    7
                ],
                "node": "Hence the total number of days $X$ equals $\\max(T_1,T_2,\\dots,T_{10})$."
            },
            {
                "step_id": 9,
                "edge": "Using Step 8, which establishes \\(X = \\max(T_1, \\dots, T_{10})\\), we directly apply the linearity of expectation to the maximum function, yielding \\(E[X] = E[\\max(T_1, \\dots, T_{10})]\\). This rephrasing focuses the problem on computing the expectation of the maximum of the negative binomial variables.",
                "direct_dependent_steps": [
                    8
                ],
                "node": "Therefore $E[X]=E\\bigl[\\max(T_1,\\dots,T_{10})\\bigr]$."
            },
            {
                "step_id": 10,
                "edge": "To simplify the complex expectation in Step 9 (which involves the maximum of discrete negative binomial variables), we leverage Step 6 (where \\(T_i\\) is negative binomial) and Step 9 (the expectation target) by approximating each \\(T_i\\) with a continuous-time analog \\(\\tau_i\\) from a Poisson process. This approximation replaces the discrete trial structure with a continuous flow, enabling calculus-based methods for the maximum expectation.",
                "direct_dependent_steps": [
                    6,
                    9
                ],
                "node": "We approximate each $T_i$ by its continuous-time analog $\\tau_i$ from a Poisson process."
            },
            {
                "step_id": 11,
                "edge": "Based on Step 10's introduction of the continuous-time analog, we set the hit rate for each integer \\(i\\) to 1 in its Poisson process. This choice ensures the relative rates match the discrete uniform selection: since there are 10 integers (Step 1), and each has equal selection probability (Step 2), assigning rate 1 per integer maintains the proportional likelihood of hits across integers in continuous time.",
                "direct_dependent_steps": [
                    10
                ],
                "node": "In continuous time, integer $i$ receives hits at rate $1$ in a Poisson process."
            },
            {
                "step_id": 12,
                "edge": "Combining Step 10 (the continuous-time approximation) and Step 11 (hits at rate 1 per integer), we identify \\(\\tau_i = \\inf\\{t : N_i(t) = i\\}\\) as the time until the \\(i\\)-th hit for integer \\(i\\) in a Poisson process with rate 1. This waiting time for \\(i\\) events in a rate-1 Poisson process follows a Gamma distribution with shape parameter \\(i\\) and rate parameter 1, a standard result in stochastic processes.",
                "direct_dependent_steps": [
                    10,
                    11
                ],
                "node": "Hence $\\tau_i=\\inf\\{t:N_i(t)=i\\}$ has a Gamma distribution with shape $i$ and rate $1$."
            },
            {
                "step_id": 13,
                "edge": "Extending Step 12 (which defines each \\(\\tau_i\\) as the continuous completion time for integer \\(i\\)), we define the overall continuous finish time \\(\\tau = \\max(\\tau_1, \\dots, \\tau_{10})\\). This parallels the discrete stopping time \\(X = \\max(T_1, \\dots, T_{10})\\) from Step 8, providing a continuous analog for the maximum completion time across all integers.",
                "direct_dependent_steps": [
                    12
                ],
                "node": "The continuous finish time is $\\tau=\\max(\\tau_1,\\dots,\\tau_{10})$."
            },
            {
                "step_id": 14,
                "edge": "For the non-negative continuous random variable \\(\\tau\\) defined in Step 13, we apply the fundamental expectation identity \\(E[\\tau] = \\int_0^\\infty P(\\tau > t)  dt\\). This representation is advantageous because it transforms the expectation computation into an integral of survival probabilities, which are often easier to handle for maxima of random variables.",
                "direct_dependent_steps": [
                    13
                ],
                "node": "The expectation $E[\\tau]=\\int_0^\\infty P(\\tau>t)\\,dt$ by the identity for non-negative variables."
            },
            {
                "step_id": 15,
                "edge": "Using Step 11 (which establishes independent Poisson processes for each integer) and Step 13 (where \\(\\tau = \\max(\\tau_1, \\dots, \\tau_{10})\\)), we compute \\(P(\\tau \\leq t) = P(\\text{all } \\tau_i \\leq t)\\). By independence of the processes, this equals the product \\(\\prod_{i=1}^{10} P(\\tau_i \\leq t)\\), as the maximum is at most \\(t\\) only if every individual \\(\\tau_i\\) is at most \\(t\\).",
                "direct_dependent_steps": [
                    11,
                    13
                ],
                "node": "We have $P(\\tau\\le t)=\\prod_{i=1}^{10}P(\\tau_i\\le t)$ by independence of the Poisson processes."
            },
            {
                "step_id": 16,
                "edge": "From Step 12, where \\(\\tau_i\\) follows a Gamma\\((i, 1)\\) distribution, we recall that the cumulative distribution function for this Gamma distribution is equivalent to the complementary cumulative distribution of a Poisson random variable. Specifically, \\(P(\\tau_i \\leq t) = 1 - \\sum_{k=0}^{i-1} e^{-t} \\frac{t^k}{k!}\\), which gives the probability of at least \\(i\\) events in a rate-1 Poisson process by time \\(t\\).",
                "direct_dependent_steps": [
                    12
                ],
                "node": "We have $P(\\tau_i\\le t)=1-\\sum_{k=0}^{i-1}e^{-t}\\frac{t^k}{k!}$ from the Gamma cdf."
            },
            {
                "step_id": 17,
                "edge": "Integrating Step 14 (\\(E[\\tau] = \\int_0^\\infty P(\\tau > t)  dt = \\int_0^\\infty [1 - P(\\tau \\leq t)]  dt\\)), Step 15 (\\(P(\\tau \\leq t) = \\prod_{i=1}^{10} P(\\tau_i \\leq t)\\)), and Step 16 (\\(P(\\tau_i \\leq t) = 1 - \\sum_{k=0}^{i-1} e^{-t} \\frac{t^k}{k!}\\)), we substitute to obtain the explicit integral expression \\(E[\\tau] = \\int_0^\\infty \\left[1 - \\prod_{i=1}^{10} \\left(1 - \\sum_{k=0}^{i-1} e^{-t} \\frac{t^k}{k!}\\right)\\right]  dt\\). This consolidates all components into a computable form.",
                "direct_dependent_steps": [
                    14,
                    15,
                    16
                ],
                "node": "Therefore $E[\\tau]=\\int_0^\\infty\\bigl[1-\\prod_{i=1}^{10}\\bigl(1-\\sum_{k=0}^{i-1}e^{-t}\\frac{t^k}{k!}\\bigr)\\bigr]\\,dt$."
            },
            {
                "step_id": 18,
                "edge": "We calculate the total hit rate in continuous time by combining Step 1 (10 integers on the board) and Step 11 (each integer receives hits at rate 1). Summing the individual rates gives a total rate of \\(10 \\times 1 = 10\\) hits per unit time, which matches the discrete process where exactly one hit occurs per day (Step 2).",
                "direct_dependent_steps": [
                    1,
                    11
                ],
                "node": "The total rate of hits across all integers in continuous time is $10$ per unit time."
            },
            {
                "step_id": 19,
                "edge": "From Step 18, which establishes a total hit rate of 10 per unit time, we determine the scaling between continuous time and discrete days. Since the discrete process has one hit per day and the continuous process has 10 hits per unit time, one unit of continuous time corresponds to 10 discrete days. Thus, discrete days = 10 × continuous time units.",
                "direct_dependent_steps": [
                    18
                ],
                "node": "Discrete days correspond to continuous time multiplied by $10$ days per unit time."
            },
            {
                "step_id": 20,
                "edge": "We numerically evaluate the integral expression for \\(E[\\tau]\\) derived in Step 17. Computational methods (e.g., numerical integration) yield the approximate value \\(12.075280458176904\\). As a sanity check, this value is consistent with expectations: for a single integer starting at 10, the expected time would be 100 days in discrete terms (since \\(E[T_i] = i / p = 10 / 0.1 = 100\\)), but the maximum across multiple integers should be less than the sum and greater than the largest individual expectation, and \\(12.075\\) in continuous units (scaled to 120.75 days) aligns with this intuition.",
                "direct_dependent_steps": [
                    17
                ],
                "node": "Numerical evaluation of the integral yields $E[\\tau]\\approx12.075280458176904$."
            },
            {
                "step_id": 21,
                "edge": "Applying Step 19 (discrete days = 10 × continuous time) to Step 20's result (\\(E[\\tau] \\approx 12.075280458176904\\)), we scale the continuous expectation to discrete days: \\(E[X] \\approx 10 \\times 12.075280458176904 = 120.75280458176904\\). This scaling accounts for the rate difference identified in Step 18, converting the continuous-time approximation back to the original discrete-day framework.",
                "direct_dependent_steps": [
                    19,
                    20
                ],
                "node": "Therefore $E[X]\\approx10\\times12.075280458176904=120.75280458176904$."
            },
            {
                "step_id": 22,
                "edge": "Following Step 21, which provides the numerical approximation \\(E[X] \\approx 120.75280458176904\\) for the expected number of days, we present this value as the final answer. The boxed format adheres to the problem's requirement for the solution representation, and the precision matches the computational output from the integral evaluation.",
                "direct_dependent_steps": [
                    21
                ],
                "node": "The final answer is \\boxed{120.75280458176904}."
            }
        ]
    }
]
