[
    {
        "problem_id": 850,
        "domain": [
            "Mathematics -> Applied Mathematics -> Probability -> Other"
        ],
        "difficulty": 5.25,
        "problem_text": "The spikiness of a sequence $a_{1}, a_{2}, \\ldots, a_{n}$ of at least two real numbers is the sum $\\sum_{i=1}^{n-1}\\left|a_{i+1}-a_{i}\\right|$. Suppose $x_{1}, x_{2}, \\ldots, x_{9}$ are chosen uniformly and randomly from the interval $[0,1]$. Let $M$ be the largest possible value of the spikiness of a permutation of $x_{1}, x_{2}, \\ldots, x_{9}$. Compute the expected value of $M$.",
        "sample_id": 1,
        "final_answer": "\\boxed{\\tfrac{79}{20}}",
        "steps": [
            {
                "step_id": 1,
                "edge": "We state the problem's given definition of spikiness as the sum of absolute differences between consecutive terms in a sequence. This foundational definition establishes the metric we will maximize over permutations and serves as the basis for all subsequent spikiness calculations.",
                "direct_dependent_steps": null,
                "node": "The spikiness of a sequence $a_1,\\dots,a_n$ is defined as $\\sum_{i=1}^{n-1}|a_{i+1}-a_i|$."
            },
            {
                "step_id": 2,
                "edge": "We note the problem's specification that the nine random variables are independently and uniformly distributed over [0,1]. This independence and uniformity are critical for later applying order statistic properties and expectation calculations without correlation concerns.",
                "direct_dependent_steps": null,
                "node": "The random variables $x_1,\\dots,x_9$ are chosen independently and uniformly from the interval $[0,1]$."
            },
            {
                "step_id": 3,
                "edge": "Building on Step 1's spikiness definition and Step 2's random variable setup, we formally define M as the maximum spikiness achievable through any permutation of the nine values. This clarifies our optimization objective: finding the permutation that maximizes the sum of absolute consecutive differences.",
                "direct_dependent_steps": [
                    1,
                    2
                ],
                "node": "Let $M$ be the maximum spikiness over all permutations of $x_1,\\dots,x_9$."
            },
            {
                "step_id": 4,
                "edge": "Using Step 2's uniform random variables, we sort them into non-decreasing order to obtain the order statistics y₁ ≤ y₂ ≤ ⋯ ≤ y₉. This sorted arrangement is essential because optimal spikiness permutations depend on the relative ordering of values rather than their original indices.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "Sort the values as $y_1\\le y_2\\le\\dots\\le y_9$."
            },
            {
                "step_id": 5,
                "edge": "We introduce the standard combinatorial concept of a peak—a sequence element strictly greater than both adjacent neighbors—as background knowledge. This definition is necessary for characterizing the structure of maximum-spikiness permutations where interior points alternate between peaks and valleys.",
                "direct_dependent_steps": null,
                "node": "We call an element of a sequence a peak if it is greater than each of its neighbors."
            },
            {
                "step_id": 6,
                "edge": "We define a valley as an element strictly less than both neighbors, complementing Step 5's peak definition. Together these concepts form the alternating pattern required for maximum spikiness, as non-alternating sequences would waste potential differences between consecutive elements.",
                "direct_dependent_steps": null,
                "node": "We call an element of a sequence a valley if it is less than each of its neighbors."
            },
            {
                "step_id": 7,
                "edge": "Combining Step 3's definition of M with Step 5's peak and Step 6's valley concepts, we recognize that any permutation achieving maximum spikiness must have every interior element as either a peak or valley. If an element were neither, rearranging neighbors could increase the sum of absolute differences, contradicting optimality.",
                "direct_dependent_steps": [
                    3,
                    5,
                    6
                ],
                "node": "In a permutation achieving $M$, every element is either a peak or a valley."
            },
            {
                "step_id": 8,
                "edge": "From Step 7's requirement that all elements are peaks or valleys, we deduce that for an odd-length sequence (n=9), both endpoints must share the same type (both peaks or both valleys). With nine positions, the alternating pattern forces endpoints to align: positions 1 and 9 would both be odd-indexed in the peak-valley alternation.",
                "direct_dependent_steps": [
                    7
                ],
                "node": "Since $9$ is odd, the endpoints of the optimal permutation are either both peaks or both valleys."
            },
            {
                "step_id": 9,
                "edge": "Using Step 4's sorted order and Step 8's endpoint constraint (both peaks), we construct the specific arrangement (y₆,y₁,y₇,y₂,y₈,y₃,y₉,y₄,y₅). This zigzag pattern places low values (y₁,y₂,y₃,y₄) between high values (y₅-y₉) to maximize consecutive differences, with y₅ at the end as a peak since y₄ < y₅.",
                "direct_dependent_steps": [
                    4,
                    8
                ],
                "node": "Consider the arrangement $(y_6,y_1,y_7,y_2,y_8,y_3,y_9,y_4,y_5)$ which has both endpoints as peaks."
            },
            {
                "step_id": 10,
                "edge": "Applying Step 1's spikiness definition to Step 9's arrangement, we expand the sum into eight absolute difference terms. Each term corresponds to consecutive pairs in the permutation: |y₁-y₆|, |y₇-y₁|, etc., forming the complete expression for this candidate permutation's spikiness.",
                "direct_dependent_steps": [
                    1,
                    9
                ],
                "node": "The spikiness of that arrangement equals $|y_1-y_6|+|y_7-y_1|+|y_2-y_7|+|y_8-y_2|+|y_3-y_8|+|y_9-y_3|+|y_4-y_9|+|y_5-y_4|$."
            },
            {
                "step_id": 11,
                "edge": "Leveraging Step 4's sorted order (y₁ ≤ ⋯ ≤ y₉) and Step 10's absolute value expression, we simplify |y₁-y₆| to y₆-y₁ since y₆ ≥ y₁. This removal of absolute value signs is valid for all terms due to the deliberate ordering in the permutation relative to the sorted sequence.",
                "direct_dependent_steps": [
                    4,
                    10
                ],
                "node": "For sorted $y_i$ we have $|y_1-y_6|=y_6-y_1$."
            },
            {
                "step_id": 12,
                "edge": "Using Step 4's sorted sequence and Step 10's expression, we resolve |y₇-y₁| as y₇-y₁ because y₇ ≥ y₁. The pattern continues: high-indexed yᵢ appear before low-indexed yⱼ in the permutation, ensuring all absolute values simplify to positive differences.",
                "direct_dependent_steps": [
                    4,
                    10
                ],
                "node": "For sorted $y_i$ we have $|y_7-y_1|=y_7-y_1$."
            },
            {
                "step_id": 13,
                "edge": "With Step 4's ordering and Step 10's term, |y₂-y₇| becomes y₇-y₂ since y₇ ≥ y₂. This follows the consistent pattern where the second element in each absolute value pair has a higher index in the sorted sequence than the first.",
                "direct_dependent_steps": [
                    4,
                    10
                ],
                "node": "For sorted $y_i$ we have $|y_2-y_7|=y_7-y_2$."
            },
            {
                "step_id": 14,
                "edge": "Given Step 4's sorted yᵢ and Step 10's expression, |y₈-y₂| simplifies to y₈-y₂ as y₈ ≥ y₂. Each absolute value term systematically converts to a positive difference by leveraging the known ordering from the sorted sequence.",
                "direct_dependent_steps": [
                    4,
                    10
                ],
                "node": "For sorted $y_i$ we have $|y_8-y_2|=y_8-y_2$."
            },
            {
                "step_id": 15,
                "edge": "Using Step 4's non-decreasing order and Step 10's term, |y₃-y₈| equals y₈-y₃ because y₈ ≥ y₃. This maintains the pattern where higher-indexed order statistics minus lower-indexed ones appear consistently across all terms.",
                "direct_dependent_steps": [
                    4,
                    10
                ],
                "node": "For sorted $y_i$ we have $|y_3-y_8|=y_8-y_3$."
            },
            {
                "step_id": 16,
                "edge": "From Step 4's sorted sequence and Step 10's expression, |y₉-y₃| simplifies to y₉-y₃ since y₉ ≥ y₃. The structure confirms that in this permutation, every consecutive pair spans a significant range in the sorted sequence.",
                "direct_dependent_steps": [
                    4,
                    10
                ],
                "node": "For sorted $y_i$ we have $|y_9-y_3|=y_9-y_3$."
            },
            {
                "step_id": 17,
                "edge": "Applying Step 4's ordering to Step 10's term, |y₄-y₉| becomes y₉-y₄ as y₉ ≥ y₄. This is the penultimate term in the spikiness sum, continuing the established simplification pattern for absolute values.",
                "direct_dependent_steps": [
                    4,
                    10
                ],
                "node": "For sorted $y_i$ we have $|y_4-y_9|=y_9-y_4$."
            },
            {
                "step_id": 18,
                "edge": "Using Step 4's sorted sequence and Step 10's expression, |y₅-y₄| resolves to y₅-y₄ because y₅ ≥ y₄. This final term completes the set of eight differences, all now expressed without absolute values due to the sorted order.",
                "direct_dependent_steps": [
                    4,
                    10
                ],
                "node": "For sorted $y_i$ we have $|y_5-y_4|=y_5-y_4$."
            },
            {
                "step_id": 19,
                "edge": "Summing Step 11 through Step 18's simplified expressions, we combine all positive terms (y₆, y₇, y₇, y₈, y₈, y₉, y₉, y₅) and all negative terms (y₁, y₁, y₂, y₂, y₃, y₃, y₄, y₄). This regrouping separates the spikiness sum S into a difference of two sums: one containing higher-indexed order statistics and one containing lower-indexed ones.",
                "direct_dependent_steps": [
                    11,
                    12,
                    13,
                    14,
                    15,
                    16,
                    17,
                    18
                ],
                "node": "Summing these eight equalities yields $S=(y_6+y_7+y_7+y_8+y_8+y_9+y_9+y_5)-(y_1+y_1+y_2+y_2+y_3+y_3+y_4+y_4)$."
            },
            {
                "step_id": 20,
                "edge": "From Step 19's summed expression, we collect coefficients for each yᵢ: y₁ appears twice negatively, y₂ twice negatively, y₃ twice negatively, y₄ twice negatively, y₅ once positively, y₆ once positively, y₇ twice positively, y₈ twice positively, y₉ twice positively. This yields the explicit coefficient form S = -2y₁ -2y₂ -2y₃ -2y₄ + y₅ + y₆ + 2y₇ + 2y₈ + 2y₉.",
                "direct_dependent_steps": [
                    19
                ],
                "node": "Collecting like terms yields $S=-2y_1-2y_2-2y_3-2y_4+y_5+y_6+2y_7+2y_8+2y_9$."
            },
            {
                "step_id": 21,
                "edge": "Based on Step 20's coefficient assignment, we identify the pattern for the peaks-at-both-ends case: coefficients follow (-2,-2,-2,-2,1,1,2,2,2) corresponding to y₁ through y₉. This reflects how extreme values (low indices) contribute negatively while central-high values contribute positively to maximize differences.",
                "direct_dependent_steps": [
                    20
                ],
                "node": "Thus the peaks-at-both-ends case produces coefficient pattern $(-2,-2,-2,-2,1,1,2,2,2)$."
            },
            {
                "step_id": 22,
                "edge": "Extending Step 21's pattern reasoning to the valleys-at-both-ends scenario (the other case from Step 8), we derive the alternative coefficient pattern (-2,-2,-2,-1,-1,2,2,2,2). This symmetry arises because reversing the peak/valley roles shifts which order statistics get amplified in the sum.",
                "direct_dependent_steps": [
                    21
                ],
                "node": "Similarly the valleys-at-both-ends case produces coefficient pattern $(-2,-2,-2,-1,-1,2,2,2,2)$."
            },
            {
                "step_id": 23,
                "edge": "Combining Step 21's peaks-at-ends pattern and Step 22's valleys-at-ends pattern, we define a base coefficient pattern (-2,-2,-2,-1,0,1,2,2,2) that captures the common structure. The zero coefficient for y₅ indicates its neutral role in the base case before adjusting for endpoint effects.",
                "direct_dependent_steps": [
                    21,
                    22
                ],
                "node": "Define the base coefficient pattern as $(-2,-2,-2,-1,0,1,2,2,2)$."
            },
            {
                "step_id": 24,
                "edge": "Using Step 23's base pattern, we express M as the base sum plus a correction term max(y₆-y₅, y₅-y₄). This accounts for the endpoint ambiguity: the optimal choice between peaks-at-ends or valleys-at-ends depends on whether y₅ is closer to y₄ or y₆, resolved by taking the larger adjacent difference.",
                "direct_dependent_steps": [
                    23
                ],
                "node": "Then $M=\\sum_{i=1}^9 b_i\\,y_i+\\max(y_6-y_5,y_5-y_4)$ where $(b_1,\\dots,b_9)$ is the base pattern."
            },
            {
                "step_id": 25,
                "edge": "We recall the standard result for uniform order statistics: for n i.i.d. Uniform[0,1] variables, the expectation of the i-th smallest is E[yᵢ] = i/(n+1). Here n=9, so E[yᵢ] = i/10, a fundamental property we'll use for expectation calculations.",
                "direct_dependent_steps": null,
                "node": "The expected value of the $i$th order statistic is $E[y_i]=\\frac{i}{10}$."
            },
            {
                "step_id": 26,
                "edge": "Applying Step 23's base coefficients and Step 25's expectation formula, we compute E[∑bᵢyᵢ] by multiplying each coefficient by the corresponding E[yᵢ]. This gives -2(E[y₁]+E[y₂]+E[y₃]) -1·E[y₄] +1·E[y₆] +2(E[y₇]+E[y₈]+E[y₉]), substituting i/10 for each E[yᵢ].",
                "direct_dependent_steps": [
                    23,
                    25
                ],
                "node": "Hence $E[\\sum_{i=1}^9 b_i\\,y_i] = -\\frac{2(1+2+3)}{10} -\\frac{1\\cdot4}{10} +\\frac{1\\cdot6}{10} +2\\cdot\\frac{7+8+9}{10}$."
            },
            {
                "step_id": 27,
                "edge": "Evaluating Step 26's expression: -2(1/10+2/10+3/10) = -2(6/10) = -12/10; -1·(4/10) = -4/10; +1·(6/10) = 6/10; +2(7/10+8/10+9/10) = 2(24/10) = 48/10. Combining these yields the simplified fractional form -12/10 -4/10 +6/10 +48/10.",
                "direct_dependent_steps": [
                    26
                ],
                "node": "The arithmetic expression $-\\frac{2(1+2+3)}{10} -\\frac{1\\cdot4}{10} +\\frac{1\\cdot6}{10} +2\\cdot\\frac{7+8+9}{10}$ simplifies to $-\\frac{12}{10}-\\frac{4}{10}+\\frac{6}{10}+\\frac{48}{10}$."
            },
            {
                "step_id": 28,
                "edge": "Summing Step 27's fractions: (-12 -4 +6 +48)/10 = 38/10. Simplifying 38/10 by dividing numerator and denominator by 2 gives 19/5. Quick check: 38÷2=19, 10÷2=5, and 19/5=3.8 matches 38/10=3.8.",
                "direct_dependent_steps": [
                    27
                ],
                "node": "Summing $-\\frac{12}{10}-\\frac{4}{10}+\\frac{6}{10}+\\frac{48}{10}$ yields $\\frac{38}{10}=\\frac{19}{5}$."
            },
            {
                "step_id": 29,
                "edge": "Given Step 24's max term and fixed y₄, y₆, we consider y₅'s conditional uniform distribution on [y₄,y₆]. For a uniform random variable on [a,b], E[max(b-x,x-a)] = 3(b-a)/4, a standard result from order statistics or direct integration over the interval.",
                "direct_dependent_steps": [
                    24
                ],
                "node": "For fixed $y_4$ and $y_6$, the conditional distribution of $y_5$ on $[y_4,y_6]$ implies $E[\\max(y_6-y_5,y_5-y_4)\\mid y_4,y_6]=\\tfrac34(y_6-y_4)$."
            },
            {
                "step_id": 30,
                "edge": "Taking unconditional expectation of Step 29's conditional result, linearity of expectation allows us to write E[max(...)] = (3/4)E[y₆-y₄]. This removes the conditioning while preserving the proportional relationship between the expected max difference and the range y₆-y₄.",
                "direct_dependent_steps": [
                    29
                ],
                "node": "Taking expectation yields $E[\\max(y_6-y_5,y_5-y_4)] = \\tfrac34 E[y_6-y_4]$."
            },
            {
                "step_id": 31,
                "edge": "Using Step 25's expectation formula, E[y₆] = 6/10 and E[y₄] = 4/10. By linearity, E[y₆-y₄] = 6/10 - 4/10 = 2/10. Sanity check: the expected gap between 6th and 4th order statistics in 9 uniform points should be positive and small, which 0.2 confirms.",
                "direct_dependent_steps": [
                    25
                ],
                "node": "Linearity of expectation gives $E[y_6-y_4] = E[y_6] - E[y_4] = \\tfrac{6}{10}-\\tfrac{4}{10} = \\tfrac{2}{10}$."
            },
            {
                "step_id": 32,
                "edge": "Substituting Step 31's result into Step 30's expression: (3/4) × (2/10) = 6/40 = 3/20. Verification: 3/4 of 0.2 is 0.15, and 3/20=0.15, which is consistent with the conditional expectation derivation.",
                "direct_dependent_steps": [
                    30,
                    31
                ],
                "node": "Hence $E[\\max(y_6-y_5,y_5-y_4)] = \\tfrac34 \\cdot \\tfrac{2}{10} = \\tfrac{3}{20}$."
            },
            {
                "step_id": 33,
                "edge": "Combining Step 28's base expectation (19/5 = 76/20) and Step 32's correction term (3/20), we add 76/20 + 3/20 = 79/20. This final sum gives E[M] since M decomposes into the base sum and the max term as established in Step 24.",
                "direct_dependent_steps": [
                    28,
                    32
                ],
                "node": "Therefore $E[M] = \\tfrac{19}{5} + \\tfrac{3}{20} = \\tfrac{79}{20}$."
            },
            {
                "step_id": 34,
                "edge": "Based on Step 33's computed expectation E[M] = 79/20, we present the final answer in the required boxed format. This value represents the expected maximum spikiness over all permutations of nine uniform random variables on [0,1].",
                "direct_dependent_steps": [
                    33
                ],
                "node": "The final answer is $\\boxed{\\tfrac{79}{20}}$."
            }
        ]
    }
]
