Skip to main content
This chapter covers reinforcement learning algorithms for learning optimal behavior through interaction with an environment.

Topics

RL Overview

Introduction to reinforcement learning.

Model-Based RL

Learning and using environment models.

Generalized Policy Iteration

The GPI framework for control.

Monte Carlo Control

Learning from complete episodes.

SARSA

On-policy temporal difference control.

REINFORCE

Policy gradient methods.