{
       "Question number": "3",
       "Sub-Question number": "2",
       "Question": "For this question, you will find the following rules about recursively building kernels helpful. Given kernels $k_{1}(\\mathbf{x}, \\mathbf{z})$ and $k_{2}(\\mathbf{x}, \\mathbf{z})$, the following are well-defined kernels:\n\n$$\n\\begin{aligned}\nk(\\mathbf{x}, \\mathbf{z}) &=\\mathbf{x}^{\\top} A \\mathbf{z}, A \\succeq 0 \\\\\nk(\\mathbf{x}, \\mathbf{z}) &=c k_{1}(\\mathbf{x}, \\mathbf{z}) \\\\\nk(\\mathbf{x}, \\mathbf{z}) &=\\exp \\left(k_{1}(\\mathbf{x}, \\mathbf{z})\\right) \\\\\nk(\\mathbf{x}, \\mathbf{z}) &=f(\\mathbf{x}) k_{1}(\\mathbf{x}, \\mathbf{z}) f(\\mathbf{z}) \\\\\nk(\\mathbf{x}, \\mathbf{z}) &=k_{1}(\\mathbf{x}, \\mathbf{z})+k_{2}(\\mathbf{x}, \\mathbf{z})\n\\end{aligned}\n$$\n\nSuppose that $\\mathbf{x}, \\mathbf{z} \\in \\mathbb{R}^{2}$. Let $[\\mathbf{x}]_{1}$ and $[\\mathbf{x}]_{2}$ denote the first and second coordinates of $\\mathbf{x}$, respectively. Show that\n\n$$\nk(\\mathbf{x}, \\mathbf{z})=\\exp \\left(-\\frac{\\left\\|[\\mathbf{x}]_{1}-[\\mathbf{z}]_{1}\\right\\|_{2}^{2}}{\\sigma^{2}}\\right)+\\exp \\left(-\\frac{\\left\\|[\\mathbf{x}]_{2}-[\\mathbf{z}]_{2}\\right\\|_{2}^{2}}{\\sigma^{2}}\\right)\n$$\n\nis a kernel.\n\nHint: You may find the following two matrices helpful:\n\n$$\nA_{1}=\\left[\\begin{array}{ll}\n1 & 0 \\\\\n0 & 0\n\\end{array}\\right], \\quad A_{2}=\\left[\\begin{array}{ll}\n0 & 0 \\\\\n0 & 1\n\\end{array}\\right] .\n$$\n\nYou can assume they are positive semi-definite (i.e. $A_{1} \\succeq 0, A_{2} \\succeq 0$ ).",
       "Solution": "The trick here is to define\n\n$$\n\\begin{aligned}\nA_{1} &=\\left[\\begin{array}{ll}\n1 & 0 \\\\\n0 & 0\n\\end{array}\\right] \\\\\nA_{2} &=\\left[\\begin{array}{ll}\n0 & 0 \\\\\n0 & 1\n\\end{array}\\right]\n\\end{aligned}\n$$\n\nso that $\\mathbf{x}^{\\top} A_{1} \\mathbf{z}=[\\mathbf{x}]_{1}[\\mathbf{z}]_{1}$ and $\\mathbf{x}^{\\top} A_{2} \\mathbf{z}=[\\mathbf{x}]_{2}[\\mathbf{z}]_{2}$ (these matrices are psd with eigenvalues 0 and 1). The rest of the proof is identical to the proof for the RBF kernel in class:\n\n(a) $k_{1}(\\mathbf{x}, \\mathbf{z})=\\mathbf{x}^{\\top} A_{1} \\mathbf{z}=[\\mathbf{x}]_{1}[\\mathbf{z}]_{1}$, rule (1)\n\n(b) $k_{2}(\\mathbf{x}, \\mathbf{z})=\\frac{2}{\\sigma^{2}} k_{1}(\\mathbf{x}, \\mathbf{z})=\\frac{2}{\\sigma^{2}}[\\mathbf{x}]_{1}[\\mathbf{z}]_{1}$, rule $(2)$\n\n(c) $k_{3}(\\mathbf{x}, \\mathbf{z})=\\exp \\left(k_{2}(\\mathbf{x}, \\mathbf{z})\\right)=\\exp \\left(\\frac{2[\\mathbf{x}]_{1}[\\mathbf{z}]_{1}}{\\sigma^{2}}\\right), \\operatorname{rule}(3)$\n\n(d) $k_{4}(\\mathbf{x}, \\mathbf{z})=\\exp \\left(-\\frac{[\\mathbf{x}]_{1}[\\mathbf{x}]_{1}}{\\sigma^{2}}\\right) \\exp \\left(\\frac{2[\\mathbf{x}]_{1}[\\mathbf{z}]_{1}}{\\sigma^{2}}\\right) \\exp \\left(-\\frac{[\\mathbf{z}]_{1}[\\mathbf{z}]_{1}}{\\sigma^{2}}\\right)=\\exp \\left(-\\frac{\\left\\|[\\mathbf{x}]_{1}-[\\mathbf{z}]_{1}\\right\\|_{2}^{2}}{\\sigma^{2}}\\right)$, rule (4) with $f(\\mathbf{x})=\\exp \\left(-\\frac{[\\mathbf{x}]_{1}[\\mathbf{x}]_{1}}{\\sigma^{2}}\\right)$\n\n(e) Repeating the above with $A_{2}, k_{5}(\\mathbf{x}, \\mathbf{z})=\\exp \\left(-\\frac{\\left\\|[\\mathbf{x}]_{2}-[\\mathbf{z}]_{2}\\right\\|_{2}^{2}}{\\sigma^{2}}\\right)$ is a kernel.\n\n(f) Finally, $k_{4}+k_{5}$ is a kernel by rule (5)."
}