{
       "Question number": "6",
       "Sub-Question number": "1",
       "Question": "Name two reasons why Newton's Method typically is not used to train deep neural networks.",
       "Solution": "1. too many parameters to store the Hessian; 2 . it converges quickly to the closest local minima / saddle point and not to a wide minimum"
}