Lecture 9 :: Advanced Prediction Models for Business Applications

Lecture 9

L09 : Reinforcement Learning II: Bellman Equations, Q Learning

Lecture Goals

Be able to understand the difference between Bellman Expectation Equation and Bellman Optimality Equation
Intuitive reasoning for the Q-Learning update rule
Be able to identify relationships between state value functions, state-action value functions and policies