{
       "Question number": "4",
       "Sub-Question number": "6",
       "Question": "You are now considering minimizing the absolute loss instead: $\\sum_{y \\in L}|y-t|$. Define $L_{\\leq}=\\{y \\in L: y \\leq t\\}$ and $L_{>}=\\{y \\in L: y>t\\}$. Prove that setting $t$ to the median of $L$ minimizes this loss. To simplify things you can assume you have an odd number of samples (i.e. $m=2 r+1$ ) and that all $y_{i} \\in L$ are distinct (i.e. $y_{i} \\neq y_{j}$ for any $y_{i}, y_{j} \\in L$ ). (Without loss of generality it is sufficient to show there is no better splitting value $t^{\\prime}$ that is larger than the median. )",
       "Solution": "Let $t$ be the median of $L$. Then, we have $L_{\\leq} = \\{y \\in L: y \\leq t\\}$ and $L_{>} = \\{y \\in L: y > t\\}$. We want to show that there is no better splitting value $t'$ that is larger than the median. Let us prove this by contradiction. Imagine that we are able to find a $t'$ that is larger than the median that achieves this splitting. Since the median is the 50-th percentile, $|L_{\\leq}| = |L_{>}| = \\frac{m}{2}$. Since we set our $t'$ larger, the corresponding $L_{\\leq}$ has more elements. Thus establishing the contradiction."
}