Table of ContentsChapter 8: Generalization and Function Approximation Value Prediction with FA Adapt Supervised Learning Algorithms Backups as Training Examples Any FA Method? Gradient Descent Methods Performance Measures Gradient Descent Gradient Descent Cont. Gradient Descent Cont. But We Donít have these Targets What about TD(l) Targets? On-Line Gradient-Descent TD(l) Linear Methods Nice Properties of Linear FA Methods Coarse Coding Learning and Coarse Coding Tile Coding Tile Coding Cont. Radial Basis Functions (RBFs) Can you beat the ìcurse of dimensionalityî? Control with FA GPI with Linear Gradient Descent Sarsa(l) GPI Linear Gradient Descent Watkinsí Q(l) Mountain-Car Task Mountain-Car Results Bairdís Counterexample Bairdís Counterexample Cont. Should We Bootstrap? Summary |
Author: Andy Barto
Email: barto@cs.umass.edu |