L09 : Reinforcement Learning II: Bellman Equations, Q Learning
Lecture Goals
- Be able to understand the difference between Bellman Expectation Equation and Bellman Optimality Equation
- Intuitive reasoning for the Q-Learning update rule
- Be able to identify relationships between state value functions, state-action value functions and policies