{
       "Question number": "5",
       "Sub-Question number": "4a",
       "Question": "Given a distribution $P$ you can sample a training set $D$ and obtain a classifier $h$. Imagine you train $m$ such classifiers $h_{1}, \\ldots, h_{m}$ on $m$ data sets $D_{1}, \\ldots, D_{m}$, each drawn i.i.d. from the data distribution $P$. As you increase $m$ from $m=1$ to $m \\gg 0$, show how you can use these models to obtain a low variance classifier $\\hat{h}$.",
       "Solution": "You average them: $\\hat{h}=\\frac{1}{m} \\sum_{i=1}^{m} h_{i}$."
}