{
       "Question number": "6",
       "Sub-Question number": "a",
       "Question": "What do we mean when we say that Q-learning and SARSA learning are \"model-free\" reinforcement learning methods?",
       "Solution": "Neither method learns $r(s,a)$ and $p(s'|s,a)$. They do not learn to predict the reward from an action or the next state distribution."
}