{
       "Question number": "6",
       "Sub-Question number": "5",
       "Question": "ML-practitioners tend to drop the learning rate during training. Explain why and what effect it has.",
       "Solution": "Starting out with a large learning rate has two advantages: 1. it prevents you from getting trapped in sharp local minima, because the weights \"jump around\" too much with each step; and 2. it moves you quickly \"down-hill\" because you take larger steps. Then switching to a smaller learning rate allows the network to converge to the local minima closest to the current weight position."
}