[
    {
        "problem_id": 3062,
        "domain": [
            "Mathematics -> Applied Mathematics -> Statistics -> Probability -> Other"
        ],
        "difficulty": 5.25,
        "problem_text": "If you flip a fair coin 1000 times, what is the expected value of the product of the number of heads and the number of tails?",
        "sample_id": 1,
        "final_answer": "249750",
        "steps": [
            {
                "step_id": 1,
                "edge": "We define n as the total number of coin flips, which is given in the problem statement as 1000. This establishes a foundational constant for all subsequent calculations and aligns with the problem's context of flipping a coin 1000 times.",
                "direct_dependent_steps": null,
                "node": "Let n = 1000 denote the number of coin flips."
            },
            {
                "step_id": 2,
                "edge": "We introduce H as the random variable representing the count of heads in the sequence of flips. This is a standard modeling approach in probability for binomial experiments, where we track the outcome of interest (heads) across independent trials.",
                "direct_dependent_steps": null,
                "node": "Let H be the random variable for the number of heads."
            },
            {
                "step_id": 3,
                "edge": "We define T as the random variable for the count of tails. This complements the definition of H in Step 2, ensuring we have complete coverage of all possible outcomes in the coin-flipping experiment.",
                "direct_dependent_steps": null,
                "node": "Let T be the random variable for the number of tails."
            },
            {
                "step_id": 4,
                "edge": "Every possible outcome must satisfy H + T = n because each flip results in exactly one head or tail, and n represents the total flips from Step 1. This conservation law follows directly from the definitions of H (Step 2) and T (Step 3), as there are no other outcomes in a binary coin flip scenario.",
                "direct_dependent_steps": [
                    1,
                    2,
                    3
                ],
                "node": "Every outcome satisfies H + T = n."
            },
            {
                "step_id": 5,
                "edge": "Rearranging the equation H + T = n from Step 4 algebraically isolates T, yielding T = n - H. This substitution is valid because the relationship in Step 4 holds for all outcomes, and it expresses tails in terms of heads and total flips, simplifying the product we need to analyze.",
                "direct_dependent_steps": [
                    4
                ],
                "node": "Rearranging gives T = n - H."
            },
            {
                "step_id": 6,
                "edge": "The problem explicitly asks for the expected value of the product of heads and tails, so we formally state our target as E[H T]. This step captures the core objective of the problem without relying on prior computational steps.",
                "direct_dependent_steps": null,
                "node": "We seek the expectation E[H T]."
            },
            {
                "step_id": 7,
                "edge": "We substitute T = n - H from Step 5 into the product H T referenced in Step 6, resulting in H(n - H). This replacement is mathematically exact and transforms the product of two random variables into a single-variable expression, which is easier to handle for expectation calculations.",
                "direct_dependent_steps": [
                    5,
                    6
                ],
                "node": "Substituting T = n - H yields H T = H(n - H)."
            },
            {
                "step_id": 8,
                "edge": "Expanding the substituted expression H(n - H) from Step 7 using distributive algebra yields n H - H^2. This simplification breaks the product into a linear term and a quadratic term, preparing the expression for application of expectation properties.",
                "direct_dependent_steps": [
                    7
                ],
                "node": "Expanding yields H(n - H) = n H - H^2."
            },
            {
                "step_id": 9,
                "edge": "Applying the linearity of expectation to the expanded form n H - H^2 from Step 8, we decompose E[n H - H^2] into n E[H] - E[H^2]. Linearity holds here because expectation is linear for any random variables, even when they are dependent, and n is a constant.",
                "direct_dependent_steps": [
                    8
                ],
                "node": "By linearity of expectation, E[n H - H^2] = n E[H] - E[H^2]."
            },
            {
                "step_id": 10,
                "edge": "Since Step 9 established that E[n H - H^2] equals n E[H] - E[H^2], and Step 8 showed that n H - H^2 is equivalent to H(n - H), we directly equate E[H(n - H)] to n E[H] - E[H^2]. This bridges the substituted product form to the decomposed expectation expression.",
                "direct_dependent_steps": [
                    9
                ],
                "node": "Since H(n - H) = n H - H^2, we have E[H(n - H)] = n E[H] - E[H^2]."
            },
            {
                "step_id": 11,
                "edge": "For a fair coin, the number of heads follows a binomial distribution with parameters n and p = 0.5. The expected value of a binomial random variable is n p, so E[H] = n/2. This is a standard result from probability theory for symmetric Bernoulli trials.",
                "direct_dependent_steps": null,
                "node": "For a fair coin, the expected value of H is E[H] = n/2."
            },
            {
                "step_id": 12,
                "edge": "Substituting n = 1000 from Step 1 into the expectation formula E[H] = n/2 from Step 11 gives E[H] = 1000/2 = 500. This arithmetic is straightforward: dividing 1000 by 2 yields 500, which matches the intuitive expectation for a fair coin over many flips.",
                "direct_dependent_steps": [
                    1,
                    11
                ],
                "node": "Substituting n = 1000 gives E[H] = 1000/2 = 500."
            },
            {
                "step_id": 13,
                "edge": "The variance of a binomial random variable with p = 0.5 is n p (1 - p) = n * 0.5 * 0.5 = n/4. This is a fundamental property derived from the variance formula for Bernoulli trials, applicable here due to the fair coin assumption.",
                "direct_dependent_steps": null,
                "node": "For a fair coin, the variance of H is Var(H) = n/4."
            },
            {
                "step_id": 14,
                "edge": "Using n = 1000 from Step 1 in the variance formula Var(H) = n/4 from Step 13 gives Var(H) = 1000/4 = 250. The division is exact: 1000 divided by 4 equals 250, which is consistent with the variance scaling linearly with n for fixed p.",
                "direct_dependent_steps": [
                    1,
                    13
                ],
                "node": "Substituting n = 1000 gives Var(H) = 1000/4 = 250."
            },
            {
                "step_id": 15,
                "edge": "By the standard definition of variance for any random variable, Var(H) = E[H^2] - (E[H])^2. This identity relates the second moment to the variance and the square of the mean, serving as a key tool for computing E[H^2] when variance is known.",
                "direct_dependent_steps": null,
                "node": "By definition, Var(H) = E[H^2] - (E[H])^2."
            },
            {
                "step_id": 16,
                "edge": "Rearranging the variance identity from Step 15 algebraically isolates E[H^2], yielding E[H^2] = Var(H) + (E[H])^2. This rearrangement is necessary because we will compute E[H^2] using known values of variance and expectation.",
                "direct_dependent_steps": [
                    15
                ],
                "node": "Rearranging yields E[H^2] = Var(H) + (E[H])^2."
            },
            {
                "step_id": 17,
                "edge": "We substitute the specific values from prior steps into the rearranged formula: Var(H) = 250 from Step 14, E[H] = 500 from Step 12, and the formula E[H^2] = Var(H) + (E[H])^2 from Step 16. This gives E[H^2] = 250 + 500^2, combining all required components for the second moment calculation.",
                "direct_dependent_steps": [
                    12,
                    14,
                    16
                ],
                "node": "Substituting Var(H) = 250 and E[H] = 500 yields E[H^2] = 250 + 500^2."
            },
            {
                "step_id": 18,
                "edge": "Computing 500 squared from Step 17: 500 * 500 = 250000. This is a basic arithmetic operation where multiplying 500 by itself yields 250,000, and a quick sanity check confirms that 5 * 5 = 25 with four trailing zeros (since 500 = 5 * 10^2, so (5*10^2)^2 = 25*10^4 = 250,000).",
                "direct_dependent_steps": [
                    17
                ],
                "node": "Computing 500^2 = 250000."
            },
            {
                "step_id": 19,
                "edge": "Adding the variance value 250 from Step 14 to the squared mean 250000 from Step 18 gives 250 + 250000 = 250250. The addition is precise: 250,000 plus 250 equals 250,250, which is E[H^2] as established in Step 17.",
                "direct_dependent_steps": [
                    14,
                    18
                ],
                "node": "Adding yields 250 + 250000 = 250250."
            },
            {
                "step_id": 20,
                "edge": "We now substitute all computed values into the expectation expression from Step 10: E[H(n - H)] = n E[H] - E[H^2]. Using n = 1000 and E[H] = 500 from Step 12, and E[H^2] = 250250 from Step 19, this becomes 1000*500 - 250250. This step assembles the final arithmetic required to evaluate the original expectation.",
                "direct_dependent_steps": [
                    10,
                    12,
                    19
                ],
                "node": "Substituting into E[H(n - H)] = n E[H] - E[H^2] gives E[H(n - H)] = 1000*500 - 250250."
            },
            {
                "step_id": 21,
                "edge": "Computing the product 1000 * 500 from Step 20: 1000 * 500 = 500,000. This is a simple multiplication where 10^3 * 5 * 10^2 = 5 * 10^5 = 500,000, and a quick verification shows that 1000 * 500 is indeed half a million.",
                "direct_dependent_steps": [
                    20
                ],
                "node": "Computing 1000*500 = 500000."
            },
            {
                "step_id": 22,
                "edge": "Subtracting E[H^2] = 250250 from Step 19 from the product 500,000 computed in Step 21 yields 500,000 - 250,250 = 249,750. Performing the subtraction: 500,000 minus 250,000 is 250,000, then subtracting an additional 250 gives 249,750, which is the numerical value of the expectation.",
                "direct_dependent_steps": [
                    21,
                    19
                ],
                "node": "Subtracting yields 500000 - 250250 = 249750."
            },
            {
                "step_id": 23,
                "edge": "The result from Step 22, 249750, is the expected value of the product of heads and tails as required by the problem. We box this final answer to clearly indicate the solution to the original query about 1000 coin flips.",
                "direct_dependent_steps": [
                    22
                ],
                "node": "The final answer is \\boxed{249750}."
            }
        ]
    }
]
