Topics
RL Overview
Introduction to reinforcement learning.
Model-Based RL
Learning and using environment models.
Generalized Policy Iteration
The GPI framework for control.
Monte Carlo Control
Learning from complete episodes.
SARSA
On-policy temporal difference control.
REINFORCE
Policy gradient methods.

