[
    {
        "problem_id": 2532,
        "domain": [
            "Mathematics -> Applied Mathematics -> Statistics -> Probability -> Other"
        ],
        "difficulty": 5.25,
        "problem_text": "Sherry and Val are playing a game. Sherry has a deck containing 2011 red cards and 2012 black cards, shuffled randomly. Sherry flips these cards over one at a time, and before she flips each card over, Val guesses whether it is red or black. If Val guesses correctly, she wins 1 dollar; otherwise, she loses 1 dollar. In addition, Val must guess red exactly 2011 times. If Val plays optimally, what is her expected profit from this game?",
        "sample_id": 1,
        "final_answer": "The final answer is \\boxed{\\frac{1}{4023}}",
        "steps": [
            {
                "step_id": 1,
                "edge": "We introduce the function $f(r,b,g)$ to model the expected profit systematically. This definition establishes the core framework for the recursive solution, where $r$ represents remaining red cards, $b$ remaining black cards, and $g$ the required number of remaining red guesses Val must make. Defining this state function is standard in dynamic programming approaches to sequential decision problems under uncertainty.",
                "direct_dependent_steps": null,
                "node": "Let $f(r,b,g)$ denote the expected profit when there are $r$ red cards, $b$ black cards, and Val must guess red exactly $g$ times."
            },
            {
                "step_id": 2,
                "edge": "Building on Step 1's state definition, we propose a closed-form expression for $f(r,b,g)$ as a linear function in $g$. The term $(b-r)$ captures the inherent imbalance between black and red cards, while the coefficient $\\frac{2(r-b)}{r+b}$ scales the impact of the required red guesses $g$. This structure suggests the expected profit depends linearly on both the card count difference and the constraint on red guesses, which we will verify through base cases and induction.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "We claim that $f(r,b,g) = (b-r) + \\frac{2(r-b)}{r+b}\\,g$."
            },
            {
                "step_id": 3,
                "edge": "This step establishes a base case where no red guesses are permitted ($g=0$) with one red card remaining. Since Val must guess black every time (as $g=0$), this scenario tests the boundary condition of the guessing constraint. The problem statement implies Val's guessing strategy must adhere strictly to the required count of red guesses, making this a valid starting point for verification.",
                "direct_dependent_steps": null,
                "node": "In the case $(r,b,g)=(1,0,0)$ Val must guess black."
            },
            {
                "step_id": 4,
                "edge": "Given the initial deck composition in Step 3 ($r=1,b=0$), the single remaining card must be red. This follows directly from the problem's setup where $r$ and $b$ represent the exact counts of remaining cards, so when $b=0$, all remaining cards are red by definition.",
                "direct_dependent_steps": [
                    3
                ],
                "node": "The single card is red."
            },
            {
                "step_id": 5,
                "edge": "Combining Step 3 (Val must guess black) and Step 4 (card is red), Val's guess is incorrect. The problem states incorrect guesses yield a $-1$ dollar outcome, so this specific outcome is determined by the game's payoff rules and the observed card color.",
                "direct_dependent_steps": [
                    3,
                    4
                ],
                "node": "Val loses $1$ dollar."
            },
            {
                "step_id": 6,
                "edge": "From Step 5's outcome where Val loses $1$ dollar, we directly assign this value to $f(1,0,0)$. This formalizes the base case result, translating the concrete game outcome into the expected profit function defined in Step 1 for this specific state.",
                "direct_dependent_steps": [
                    5
                ],
                "node": "Therefore $f(1,0,0)=-1$."
            },
            {
                "step_id": 7,
                "edge": "Evaluating the proposed formula from Step 2 at $(1,0,0)$, the term $(b-r)$ computes to $0-1 = -1$. This isolates the first component of the formula, showing how the card count difference alone contributes to the expected profit when no red guesses are required.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "For $(1,0,0)$ the formula gives $b-r=-1$."
            },
            {
                "step_id": 8,
                "edge": "Continuing the formula evaluation from Step 2 at $(1,0,0)$, the second term $\\frac{2(r-b)}{r+b}g$ becomes $\\frac{2(1-0)}{1+0} \\cdot 0 = 0$ since $g=0$. This demonstrates how the constraint on red guesses vanishes when no red guesses are mandated, as expected.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "For $(1,0,0)$ the formula gives $\\frac{2(r-b)}{r+b}g=0$."
            },
            {
                "step_id": 9,
                "edge": "Combining Step 6's computed value $f(1,0,0)=-1$, Step 7's result of $-1$ for the first term, and Step 8's $0$ for the second term, the formula's total $-1+0=-1$ matches the actual expected profit. This successful verification for the state $(1,0,0)$ provides initial evidence supporting the proposed formula.",
                "direct_dependent_steps": [
                    6,
                    7,
                    8
                ],
                "node": "Hence the formula yields $-1$ for $f(1,0,0)$."
            },
            {
                "step_id": 10,
                "edge": "This base case considers $g=0$ with one black card remaining ($r=0,b=1$). Similar to Step 3, Val must guess black throughout due to the $g=0$ constraint, creating a symmetric test case to Step 3 but with the opposite card color.",
                "direct_dependent_steps": null,
                "node": "In the case $(r,b,g)=(0,1,0)$ Val must guess black."
            },
            {
                "step_id": 11,
                "edge": "Given the state in Step 10 ($r=0,b=1$), the single remaining card must be black by definition of $b$. This follows directly from the card count parameters, analogous to Step 4's reasoning for the red card case.",
                "direct_dependent_steps": [
                    10
                ],
                "node": "The single card is black."
            },
            {
                "step_id": 12,
                "edge": "Merging Step 10 (Val must guess black) and Step 11 (card is black), Val's guess is correct. The problem's payoff rules specify a $+1$ dollar reward for correct guesses, yielding this positive outcome.",
                "direct_dependent_steps": [
                    10,
                    11
                ],
                "node": "Val wins $1$ dollar."
            },
            {
                "step_id": 13,
                "edge": "From Step 12's $+1$ dollar outcome, we assign $f(0,1,0)=1$. This formalizes the expected profit for this base state, mirroring Step 6's approach but for the black card scenario.",
                "direct_dependent_steps": [
                    12
                ],
                "node": "Therefore $f(0,1,0)=1$."
            },
            {
                "step_id": 14,
                "edge": "Evaluating Step 2's formula at $(0,1,0)$, the $(b-r)$ term gives $1-0=1$. This captures the positive contribution from having more black cards when no red guesses are required, consistent with Step 7's negative contribution in the red-dominated case.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "For $(0,1,0)$ the formula gives $b-r=1$."
            },
            {
                "step_id": 15,
                "edge": "For the state $(0,1,0)$ in Step 2's formula, the second term $\\frac{2(r-b)}{r+b}g$ computes to $\\frac{2(0-1)}{0+1} \\cdot 0 = 0$ because $g=0$. This confirms the red-guess constraint term vanishes identically when $g=0$, as seen previously in Step 8.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "For $(0,1,0)$ the formula gives $\\frac{2(r-b)}{r+b}g=0$."
            },
            {
                "step_id": 16,
                "edge": "Integrating Step 13's value $f(0,1,0)=1$, Step 14's $1$ from the first term, and Step 15's $0$ from the second term, the formula's total $1+0=1$ matches the actual outcome. This successful verification for $(0,1,0)$ further validates the proposed formula's correctness in base cases.",
                "direct_dependent_steps": [
                    13,
                    14,
                    15
                ],
                "node": "Hence the formula yields $1$ for $f(0,1,0)$."
            },
            {
                "step_id": 17,
                "edge": "This case tests $g=1$ with one red card remaining ($r=1,b=0$). Since Val must make exactly one red guess and only one card remains, she must guess red here. This explores the scenario where the required red guesses match the remaining card count, differing from Step 3's $g=0$ constraint.",
                "direct_dependent_steps": null,
                "node": "In the case $(r,b,g)=(1,0,1)$ Val must guess red."
            },
            {
                "step_id": 18,
                "edge": "Given $r=1,b=0$ in Step 17, the sole remaining card is necessarily red, identical to Step 4's reasoning. The card color is determined exclusively by the remaining counts $r$ and $b$.",
                "direct_dependent_steps": [
                    17
                ],
                "node": "The single card is red."
            },
            {
                "step_id": 19,
                "edge": "Combining Step 17 (Val guesses red) and Step 18 (card is red), the guess is correct, yielding a $+1$ dollar outcome per the problem's payoff rules. This contrasts with Step 5 where an incorrect guess occurred due to mismatched constraints.",
                "direct_dependent_steps": [
                    17,
                    18
                ],
                "node": "Val wins $1$ dollar."
            },
            {
                "step_id": 20,
                "edge": "From Step 19's $+1$ dollar result, we set $f(1,0,1)=1$. This defines the expected profit for the state where Val is forced to guess red on the only remaining card, which happens to be red.",
                "direct_dependent_steps": [
                    19
                ],
                "node": "Therefore $f(1,0,1)=1$."
            },
            {
                "step_id": 21,
                "edge": "Applying Step 2's formula to $(1,0,1)$, the $(b-r)$ term gives $0-1=-1$, identical to Step 7's calculation. This isolates the card count difference component before incorporating the $g$-dependent term.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "For $(1,0,1)$ the formula gives $b-r=-1$."
            },
            {
                "step_id": 22,
                "edge": "For $(1,0,1)$ in Step 2's formula, the second term computes as $\\frac{2(1-0)}{1+0} \\cdot 1 = 2$. This shows how the required red guess ($g=1$) amplifies the effect of the card imbalance, contrasting with Step 8 where $g=0$ nullified this term.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "For $(1,0,1)$ the formula gives $\\frac{2(r-b)}{r+b}g=2$."
            },
            {
                "step_id": 23,
                "edge": "Synthesizing Step 20's $f(1,0,1)=1$, Step 21's $-1$ from the first term, and Step 22's $2$ from the second term, the formula yields $-1+2=1$, matching the actual outcome. This confirms the formula works even when $g$ equals the total remaining cards.",
                "direct_dependent_steps": [
                    20,
                    21,
                    22
                ],
                "node": "Hence the formula yields $1$ for $f(1,0,1)$."
            },
            {
                "step_id": 24,
                "edge": "This case examines $g=1$ with one black card remaining ($r=0,b=1$). Val must guess red once despite the black card, testing the penalty when constrained guesses conflict with actual card colors.",
                "direct_dependent_steps": null,
                "node": "In the case $(r,b,g)=(0,1,1)$ Val must guess red."
            },
            {
                "step_id": 25,
                "edge": "With $r=0,b=1$ in Step 24, the single card must be black, consistent with Step 11's reasoning. The card color is dictated by the remaining counts regardless of Val's guessing constraint.",
                "direct_dependent_steps": [
                    24
                ],
                "node": "The single card is black."
            },
            {
                "step_id": 26,
                "edge": "Merging Step 24 (Val guesses red) and Step 25 (card is black), the guess is incorrect, resulting in a $-1$ dollar outcome per the problem's rules. This represents the cost of being forced to guess against the actual card distribution.",
                "direct_dependent_steps": [
                    24,
                    25
                ],
                "node": "Val loses $1$ dollar."
            },
            {
                "step_id": 27,
                "edge": "From Step 26's $-1$ dollar outcome, we define $f(0,1,1)=-1$. This establishes the expected profit for the scenario where Val must guess red on a guaranteed black card.",
                "direct_dependent_steps": [
                    26
                ],
                "node": "Therefore $f(0,1,1)=-1$."
            },
            {
                "step_id": 28,
                "edge": "Evaluating Step 2's formula at $(0,1,1)$, the $(b-r)$ term gives $1-0=1$, matching Step 14's calculation. This captures the baseline advantage from having more black cards before applying the $g$-dependent adjustment.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "For $(0,1,1)$ the formula gives $b-r=1$."
            },
            {
                "step_id": 29,
                "edge": "For $(0,1,1)$ in Step 2's formula, the second term computes as $\\frac{2(0-1)}{0+1} \\cdot 1 = -2$. This shows how the required red guess ($g=1$) inverts the card imbalance effect, contrasting with Step 22 where $r>b$ made this term positive.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "For $(0,1,1)$ the formula gives $\\frac{2(r-b)}{r+b}g=-2$."
            },
            {
                "step_id": 30,
                "edge": "Combining Step 27's $f(0,1,1)=-1$, Step 28's $1$ from the first term, and Step 29's $-2$ from the second term, the formula gives $1 + (-2) = -1$, matching the actual outcome. This completes the verification of all four critical base cases.",
                "direct_dependent_steps": [
                    27,
                    28,
                    29
                ],
                "node": "Hence the formula yields $-1$ for $f(0,1,1)$."
            },
            {
                "step_id": 31,
                "edge": "Having verified the formula in Steps 9, 16, 23, and 30 for states with one card remaining ($r+b=1$), we initiate mathematical induction on the total remaining cards $r+b$. This induction framework will extend the formula's validity to all nonnegative integer states by assuming correctness for smaller decks and proving it for larger ones.",
                "direct_dependent_steps": [
                    9,
                    16,
                    23,
                    30
                ],
                "node": "We proceed by induction on $r+b$."
            },
            {
                "step_id": 32,
                "edge": "Following Step 31's induction setup, we formally assume the formula holds for all states with fewer than $r+b$ total cards. This inductive hypothesis provides the foundation to compute $f(r-1,b,g-1)$, $f(r,b-1,g-1)$, and similar subproblems in subsequent steps, enabling algebraic verification for the current state.",
                "direct_dependent_steps": [
                    31
                ],
                "node": "Assume for all nonnegative $r',b',g'$ with $r'+b'<r+b$ that $f(r',b',g')=(b'-r')+\\frac{2(r'-b')}{r'+b'}g'$."
            },
            {
                "step_id": 33,
                "edge": "When Val guesses red first (as one possible optimal strategy), we compute her expected profit by conditioning on the next card's color. With probability $\\frac{r}{r+b}$ the card is red (yielding $+1$ plus recursive profit $f(r-1,b,g-1)$), and with probability $\\frac{b}{r+b}$ it's black (yielding $-1$ plus $f(r,b-1,g-1)$). This law-of-total-expectation approach models the immediate reward plus future expected value.",
                "direct_dependent_steps": [
                    31
                ],
                "node": "If Val guesses red first, her expected profit is $\\frac{r}{r+b}\\bigl(1+f(r-1,b,g-1)\\bigr)+\\frac{b}{r+b}\\bigl(-1+f(r,b-1,g-1)\\bigr)$."
            },
            {
                "step_id": 34,
                "edge": "Substituting Step 32's inductive hypothesis into Step 33's expression, we replace each recursive $f$ term with its closed-form. Algebraic simplification—combining fractions, factoring terms, and canceling common denominators—reveals the expression reduces exactly to $(b-r) + \\frac{2(r-b)}{r+b}g$. This confirms that guessing red first achieves the proposed formula's value.",
                "direct_dependent_steps": [
                    32,
                    33
                ],
                "node": "Substituting the induction hypothesis and simplifying algebraically shows that this equals $(b-r)+\\frac{2(r-b)}{r+b}g$."
            },
            {
                "step_id": 35,
                "edge": "Similarly, when Val guesses black first, we apply the law of total expectation: probability $\\frac{r}{r+b}$ for a red card (yielding $-1$ plus $f(r-1,b,g)$) and $\\frac{b}{r+b}$ for black (yielding $+1$ plus $f(r,b-1,g)$). This constructs the alternative expected profit expression for the other possible initial guess.",
                "direct_dependent_steps": [
                    31
                ],
                "node": "If Val guesses black first, her expected profit is $\\frac{r}{r+b}\\bigl(-1+f(r-1,b,g)\\bigr)+\\frac{b}{r+b}\\bigl(1+f(r,b-1,g)\\bigr)$."
            },
            {
                "step_id": 36,
                "edge": "Using Step 32's inductive hypothesis in Step 35's expression, we substitute closed-forms for all recursive terms. Through parallel algebraic simplification—mirroring Step 34's process but with sign adjustments for the black guess—we again obtain $(b-r) + \\frac{2(r-b)}{r+b}g$. This shows both guessing strategies yield identical expected profit, confirming optimality is irrelevant here as both choices are equally good.",
                "direct_dependent_steps": [
                    32,
                    35
                ],
                "node": "Substituting the induction hypothesis and simplifying algebraically shows that this also equals $(b-r)+\\frac{2(r-b)}{r+b}g$."
            },
            {
                "step_id": 37,
                "edge": "Since Steps 34 and 36 prove that both possible initial guesses (red or black) lead to the proposed formula's value under the inductive hypothesis, and Steps 9, 16, 23, 30 verified all base cases, the principle of mathematical induction establishes that $f(r,b,g) = (b-r) + \\frac{2(r-b)}{r+b}g$ holds universally for all valid states $r,b,g$.",
                "direct_dependent_steps": [
                    34,
                    36
                ],
                "node": "Therefore $f(r,b,g)=(b-r)+\\frac{2(r-b)}{r+b}g$ for all $r,b,g$ by induction."
            },
            {
                "step_id": 38,
                "edge": "Applying Step 37's universal formula to the original problem's parameters ($r=2011$ red cards, $b=2012$ black cards, $g=2011$ required red guesses), we compute $(2012-2011) + \\frac{2(2011-2012)}{2011+2012} \\cdot 2011$. Simplifying the constants gives $1 + \\frac{-2 \\cdot 2011}{4023}$, where $4023 = 2011 + 2012$ is the total card count.",
                "direct_dependent_steps": [
                    37
                ],
                "node": "Substituting $r=2011$, $b=2012$, and $g=2011$ gives $f(2011,2012,2011)=1+\\frac{-2\\cdot2011}{4023}$."
            },
            {
                "step_id": 39,
                "edge": "Simplifying the expression $1 + \\frac{-4022}{4023}$ from Step 38: rewrite $1$ as $\\frac{4023}{4023}$, then compute $\\frac{4023 - 4022}{4023} = \\frac{1}{4023}$. Sanity check: $4022/4023 \\approx 0.99975$, so $1 - 0.99975 \\approx 0.00025$, and $1/4023 \\approx 0.0002485$ matches this magnitude, confirming the arithmetic is reasonable.",
                "direct_dependent_steps": [
                    38
                ],
                "node": "Simplifying $1+\\frac{-4022}{4023}$ yields $\\frac{1}{4023}$."
            }
        ]
    }
]
