{
       "Question number": "4",
       "Sub-Question number": "5",
       "Question": "You are building a regression tree with the squared loss impurity. i.e. the labels in the leaf are $L=\\left\\{y_{1}, \\ldots, y_{m}\\right\\}$ and the loss, under prediction $t$, is $\\sum_{y \\in L}(y-t)^{2}$. Prove that the average label $t=\\frac{1}{m} \\sum_{i=1}^{m} y_{i}$ minimizes the loss at a leaf.",
       "Solution": "$$\nt=\\operatorname{argmin}_{t} \\sum_{i=1}^{n}\\left(t-y_{i}\\right)^{2}\n$$\n\nTaking the derivative and eq. with 0 :\n\n$$\n\\begin{aligned}\n2 \\sum_{i=1}^{n}\\left(t-y_{i}\\right) &=0 \\\\\n2 n t-2 \\sum_{i=1}^{n} y_{i} &=0 \\\\\nt &=\\frac{1}{n} \\sum_{i=1}^{n} y_{i}\n\\end{aligned}\n$$"
}