Chapter 8: Generalization and Function Approximation

11/11/99


Click here to start


Table of Contents

Chapter 8: Generalization and Function Approximation

Value Prediction with FA

Adapt Supervised Learning Algorithms

Backups as Training Examples

Any FA Method?

Gradient Descent Methods

Performance Measures

Gradient Descent

Gradient Descent Cont.

Gradient Descent Cont.

But We Donít have these Targets

What about TD(l) Targets?

On-Line Gradient-Descent TD(l)

Linear Methods

Nice Properties of Linear FA Methods

Coarse Coding

Learning and Coarse Coding

Tile Coding

Tile Coding Cont.

Radial Basis Functions (RBFs)

Can you beat the ìcurse of dimensionalityî?

Control with FA

GPI with Linear Gradient Descent Sarsa(l)

GPI Linear Gradient Descent Watkinsí Q(l)

Mountain-Car Task

Mountain-Car Results

Bairdís Counterexample

Bairdís Counterexample Cont.

Should We Bootstrap?

Summary

Author: Andy Barto

Email: barto@cs.umass.edu

Download presentation source