L09 : Reinforcement Learning II: Bellman Equations, Q Learning

Lecture Goals

  • Be able to understand the difference between Bellman Expectation Equation and Bellman Optimality Equation
  • Intuitive reasoning for the Q-Learning update rule
  • Be able to identify relationships between state value functions, state-action value functions and policies