{
       "Question number": "3",
       "Sub-Question number": "1a.",
       "Question": "Suppose you are using a kernel SVM with the $\\mathrm{RBF}$ kernel $k(\\mathbf{x}, \\mathbf{z})=\\exp \\left(-\\frac{\\|\\mathbf{x}-\\mathbf{z}\\|_{2}^{2}}{\\sigma^{2}}\\right)$ to do classification. Recall that the kernel SVM is trained by solving the dual optimization problem: $$\n\\begin{aligned}\n\\min _{\\alpha_{1}, \\ldots, \\alpha_{n}} & \\frac{1}{2} \\sum_{i, j} \\alpha_{i} \\alpha_{j} y_{i} y_{j} \\mathbf{K}_{i j}-\\sum_{i} \\alpha_{i} \\\\\n\\text { s.t. } & 0 \\leq \\alpha_{i} \\leq C \\\\\n& \\sum_{i} \\alpha_{i} y_{i}=0\n\\end{aligned}\n$$ Assume you can either set $C$ and $\\sigma^{2}$ to a very large value $(\\gg 0)$ or a very small value $(\\epsilon)$. Provide a setting with high bias and one with high variance. Briefly explain your answers.",
       "Solution": "Large \\sigma and large C lead to high variance as decision boundary is smaller; small \\sigma and small C lead to high bias as decision boundary is very large"
}