{
       "Question number": "6",
       "Sub-Question number": "2",
       "Question": "Let the loss function be $\\ell(\\mathbf{w})=\\frac{1}{2 n} \\sum_{i=1}^{n}\\left(\\mathbf{x}_{i}^{\\top} \\mathbf{w}-y_{i}\\right)^{2}$. Write down the update for Stochastic Gradient Descent and Gradient Descent.",
       "Solution": "$G_{G D}=\\frac{1}{n} \\sum_{i=1}^{n}\\left(\\mathbf{x}_{i}^{\\top} \\mathbf{w}-\\right.$ $\\left.y_{i}\\right) \\mathbf{x}_{i}$ whereas the SGD update is $G_{S G D}=\\frac{1}{m} \\sum_{i=1}^{m}\\left(\\mathbf{x}_{s_{i}}^{\\top} \\mathbf{w}-y_{s_{i}}\\right) \\mathbf{x}_{s_{i}}$ for randomly picked $s_{i} \\in[n]$."
}