[
    {
        "problem_id": 877,
        "domain": [
            "Mathematics -> Algebra -> Intermediate Algebra -> Other",
            "Mathematics -> Applied Mathematics -> Statistics -> Mathematical Statistics"
        ],
        "difficulty": 4.5,
        "problem_text": "Find the largest real number $c$ such that $$\\sum_{i=1}^{101} x_{i}^{2} \\geq c M^{2}$$ whenever $x_{1}, \\ldots, x_{101}$ are real numbers such that $x_{1}+\\cdots+x_{101}=0$ and $M$ is the median of $x_{1}, \\ldots, x_{101}$.",
        "sample_id": 1,
        "final_answer": "The final answer is $\\boxed{\\frac{5151}{50}}$",
        "steps": [
            {
                "step_id": 1,
                "edge": "We sort the variables in non-decreasing order to leverage the definition of the median and simplify structural analysis. Sorting preserves both the sum and sum of squares (as these are symmetric in the variables), so it does not affect the problem's conditions or the inequality we aim to establish. This reorganization is a standard preparatory step when medians are involved, as it explicitly positions the median value within the sequence.",
                "direct_dependent_steps": null,
                "node": "We assume without loss of generality that $x_1\\le x_2\\le\\cdots\\le x_{101}$."
            },
            {
                "step_id": 2,
                "edge": "Given the sorted sequence from Step 1, the median of an odd-sized list (101 elements) is unambiguously the middle element. By definition, the 51st term in a 101-element sorted list serves as the median, so we identify $M = x_{51}$. This precise indexing is critical for later partitioning the sequence into elements below/above the median.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "We define the median $M$ as $x_{51}$."
            },
            {
                "step_id": 3,
                "edge": "Building on the median definition in Step 2, we restrict to $M \\geq 0$ without loss of generality. The inequality depends on $M^2$, which is invariant under sign reversal of all variables. If $M$ were negative, replacing $x_i$ with $-x_i$ would yield a sequence with median $-M > 0$ and identical sum of squares, preserving the ratio $\\sum x_i^2 / M^2$. This symmetry argument justifies focusing on non-negative medians to simplify extremal analysis.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "We note that $M\\ge0$."
            },
            {
                "step_id": 4,
                "edge": "This step states the problem's given constraint that the sum of all variables is zero. As a fundamental condition in the problem statement (not derived from prior steps), it anchors subsequent algebraic manipulations and ensures the sequence cannot be uniformly positive or negative, creating the tension between the median and the sum of squares that defines the optimization challenge.",
                "direct_dependent_steps": null,
                "node": "We note that $\\sum_{i=1}^{101}x_i=0$."
            },
            {
                "step_id": 5,
                "edge": "We recall the basic property that $f(t) = t^2$ is convex on $\\mathbb{R}$, verified by its non-negative second derivative ($f''(t) = 2 > 0$) or the inequality $(\\lambda a + (1-\\lambda)b)^2 \\leq \\lambda a^2 + (1-\\lambda)b^2$ for $\\lambda \\in [0,1]$. Convexity is essential for justifying extremal configurations via Jensen's inequality or smoothing arguments in later steps.",
                "direct_dependent_steps": null,
                "node": "We note that the function $f(t)=t^2$ is convex on the real numbers."
            },
            {
                "step_id": 6,
                "edge": "Combining the sorted order (Step 1), median definition (Step 2), non-negative median assumption (Step 3), zero-sum constraint (Step 4), and convexity of $t^2$ (Step 5), we apply convex smoothing to identify the extremal configuration. For fixed median $M = x_{51}$, the sum of squares is minimized when variables are equal within the lower and upper partitions (due to convexity penalizing spread). Setting $x_1 = \\cdots = x_{50} = a$ and $x_{51} = \\cdots = x_{101} = b$ with $b = M \\geq 0$, the zero-sum constraint $50a + 51b = 0$ implies $a = -51b/50$. Parameterizing $b = 50r$ (for later scaling convenience) yields $a = -51r$, producing the symmetric extremal sequence.",
                "direct_dependent_steps": [
                    1,
                    2,
                    3,
                    4,
                    5
                ],
                "node": "We apply convex smoothing to conclude that for some $r\\ge0$, $x_1=\\cdots=x_{50}=-51r$ and $x_{51}=\\cdots=x_{101}=50r$."
            },
            {
                "step_id": 7,
                "edge": "We observe that both $\\sum x_i^2$ and $M^2$ are homogeneous of degree 2 in the variables: scaling all $x_i$ by $k$ scales $\\sum x_i^2$ by $k^2$ and $M^2$ by $k^2$, leaving their ratio unchanged. This homogeneity is a foundational property of quadratic forms and enables normalization to simplify computations without affecting the critical ratio.",
                "direct_dependent_steps": null,
                "node": "We note that the inequality is homogeneous of degree two in the variables $x_i$."
            },
            {
                "step_id": 8,
                "edge": "Leveraging the homogeneity established in Step 7, we note that scaling the variables arbitrarily preserves the ratio $\\sum x_i^2 / M^2$. This allows us to choose a convenient scaling factor (e.g., setting $r=1$ or normalizing $M$) in subsequent steps, reducing computational complexity while maintaining the problem's integrity.",
                "direct_dependent_steps": [
                    7
                ],
                "node": "We note that we may thus scale the variables arbitrarily while preserving the ratio of $\\sum_i x_i^2$ to $M^2$."
            },
            {
                "step_id": 9,
                "edge": "From the median definition in Step 2 ($M = x_{51}$) and the extremal configuration in Step 6 ($x_{51} = \\cdots = x_{101} = 50r$), we directly substitute to find $M = 50r$. This explicit parameterization links the median to the scaling variable $r$, which will cancel out in the final ratio.",
                "direct_dependent_steps": [
                    2,
                    6
                ],
                "node": "We calculate that $M=x_{51}=50r$."
            },
            {
                "step_id": 10,
                "edge": "Using the extremal sequence from Step 6 (50 variables at $-51r$ and 51 variables at $50r$), we compute the sum of squares. Since squaring eliminates signs, this becomes $50 \\cdot (-51r)^2 + 51 \\cdot (50r)^2 = 50(51r)^2 + 51(50r)^2$. This decomposition cleanly separates contributions from the lower and upper partitions of the sorted sequence.",
                "direct_dependent_steps": [
                    6
                ],
                "node": "We write $\\sum_{i=1}^{101}x_i^2=50(51r)^2+51(50r)^2$."
            },
            {
                "step_id": 11,
                "edge": "We expand $(51r)^2$ as $51^2 r^2 = 2601r^2$. Verifying the arithmetic: $50^2 = 2500$ and $51^2 = (50+1)^2 = 2500 + 100 + 1 = 2601$, confirming the calculation. This intermediate step prepares the expression for coefficient multiplication in Step 12.",
                "direct_dependent_steps": [
                    10
                ],
                "node": "We compute that $(51r)^2=2601r^2$."
            },
            {
                "step_id": 12,
                "edge": "Computing $50 \\cdot 2601$, we break it into $50 \\cdot 2600 + 50 \\cdot 1 = 130{,}000 + 50 = 130{,}050$. A quick sanity check: $50 \\cdot 2600 = 130{,}000$ is correct, and adding the remaining $50$ yields the precise value needed for the sum of squares contribution from the lower partition.",
                "direct_dependent_steps": [
                    11
                ],
                "node": "We compute that $50\\cdot2601=130050$."
            },
            {
                "step_id": 13,
                "edge": "Similarly, $(50r)^2 = 50^2 r^2 = 2500r^2$. This straightforward computation leverages the known square $50^2 = 2500$, which is foundational for the upper partition's contribution to the sum of squares.",
                "direct_dependent_steps": [
                    10
                ],
                "node": "We compute that $(50r)^2=2500r^2$."
            },
            {
                "step_id": 14,
                "edge": "Calculating $51 \\cdot 2500$, we compute $50 \\cdot 2500 + 1 \\cdot 2500 = 125{,}000 + 2{,}500 = 127{,}500$. Cross-verifying: $51 \\times 2500$ is equivalent to $2500 \\times 50 + 2500 = 127{,}500$, ensuring accuracy for the upper partition's sum of squares component.",
                "direct_dependent_steps": [
                    13
                ],
                "node": "We compute that $51\\cdot2500=127500$."
            },
            {
                "step_id": 15,
                "edge": "Summing the partition contributions from Steps 12 ($130{,}050$) and 14 ($127{,}500$), we compute $130{,}050 + 127{,}500 = 257{,}550$. A rapid check: $130{,}000 + 127{,}500 = 257{,}500$, and adding the remaining $50$ confirms $257{,}550$, which aggregates the total coefficient for $r^2$ in the sum of squares.",
                "direct_dependent_steps": [
                    12,
                    14
                ],
                "node": "We compute that $130050+127500=257550$."
            },
            {
                "step_id": 16,
                "edge": "Combining the result from Step 15, we express the total sum of squares as $257{,}550r^2$. This consolidates all prior computational steps into a single compact expression, ready for ratio formation with $M^2$.",
                "direct_dependent_steps": [
                    15
                ],
                "node": "We conclude that $\\sum_{i=1}^{101}x_i^2=257550r^2$."
            },
            {
                "step_id": 17,
                "edge": "Using $M = 50r$ from Step 9 and $(50r)^2 = 2500r^2$ from Step 13, we compute $M^2 = 2500r^2$. This provides the denominator for the critical ratio, maintaining consistency with the scaling parameter $r$.",
                "direct_dependent_steps": [
                    9,
                    13
                ],
                "node": "We compute that $M^2=(50r)^2=2500r^2$."
            },
            {
                "step_id": 18,
                "edge": "Applying the scale invariance noted in Step 8, we form the ratio $\\sum x_i^2 / M^2$ using the expressions from Steps 16 ($257{,}550r^2$) and 17 ($2500r^2$). The $r^2$ terms will cancel, yielding a constant ratio independent of scaling—exactly the homogeneous property leveraged since Step 7.",
                "direct_dependent_steps": [
                    8,
                    16,
                    17
                ],
                "node": "We form the ratio $\\frac{\\sum_{i=1}^{101}x_i^2}{M^2}=\\frac{257550r^2}{2500r^2}$."
            },
            {
                "step_id": 19,
                "edge": "Canceling $r^2$ (valid for $r \\neq 0$, as $r=0$ implies all $x_i=0$—a trivial case where the inequality holds for any $c$), we simplify the ratio to $257{,}550 / 2500$. This reduction isolates the numerical constant defining the minimal ratio across all non-trivial sequences.",
                "direct_dependent_steps": [
                    18
                ],
                "node": "We cancel the factor $r^2$ to obtain $\\frac{257550}{2500}$."
            },
            {
                "step_id": 20,
                "edge": "Simplifying $257{,}550 / 2500$ by dividing numerator and denominator by their greatest common divisor (50), we compute $257{,}550 \\div 50 = 5{,}151$ and $2{,}500 \\div 50 = 50$. Verification: $50 \\times 5{,}151 = 257{,}550$ and $50 \\times 50 = 2{,}500$, confirming the simplified fraction $5{,}151/50$ is exact.",
                "direct_dependent_steps": [
                    19
                ],
                "node": "We simplify $\\frac{257550}{2500}$ by dividing numerator and denominator by $50$ to get $\\frac{5151}{50}$."
            },
            {
                "step_id": 21,
                "edge": "The ratio $5{,}151/50$ achieved in Step 20 is attainable by the extremal sequence constructed in Step 6. By convexity (Step 5) and the smoothing argument, this represents the minimal possible value of $\\sum x_i^2 / M^2$ over all valid sequences. Thus, it is the largest constant $c$ satisfying the inequality universally, as any larger $c$ would violate the inequality for this specific sequence.",
                "direct_dependent_steps": [
                    20
                ],
                "node": "We conclude that the largest constant $c$ satisfying the inequality is $\\frac{5151}{50}$."
            }
        ]
    }
]
