CMPSCI 687

Reinforcement Learning

Spring 2006


Course Information

This course will provide a comprehensive introduction to reinforcement learning, a powerful approach to learning from interaction to achieve goals in stochastic and incompletely-known environments. Reinforcement learning has adapted key ideas from machine learning, operations research, control theory, psychology, and neuroscience to produce some strikingly successful engineering applications. The focus is on algorithms for learning what actions to take, and when to take them, so as to optimize long-term performance. This may involve sacrificing immediate reward to obtain greater reward in the long-term or just to obtain more information about the environment. The course will cover Markov decision processes, dynamic programming, temporal-difference learning, Monte Carlo reinforcement learning methods, eligibility traces, the role of function approximation, and the integration of learning and planning. We will also introduce policy gradient methods, methods for partially observable problems, hierarchical learning, and connections to the brain's reward systems.

Lecture: Tuesday & Thursday 9:30-10:45, CMPS 150

Prerequisites: Interest in learning approaches to artificial intelligence; basic probability theory; computer programming ability. If you have passed Math 515 or equivalent, you have enough basic probility theory. If you have passed a programming course at the level of CMPSCI 287, you have enough programming ability; knowledge of C++ is recommended. Please talk with the instructor if you want to take the course but have doubts about your qualifications.

Credit: 3 units

Instructor: Andrew Barto, barto [at] cs [dot] umass [dot] edu, 545-2109

Teaching assistant: Andrew Stout, [andrew's last name]@cs.umass.edu

Required book: We will be using a textbook by R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998. Clicking on the title will take you to a full description of the book, from which you can obtain a detailed look at what will be covered in this course. I did not order the book through the Textbook Annex or a local bookstore. The full text of the book is on the web, so you don't really need to buy the book (though the book would be more convenient!)

The Plan: The plan is to cover the complete contents of the book, plus supplementary readings that will be made available when needed. Some of these will be assigned. The course schedule will indicate when you should be finished reading each of those. Others are suggested readings. See the detailed schedule by clicking here or the Schedule link at the bottom of the page. The schedule is subject to revision!

Required work:

Grading:

Related Courses Elsewhere

Another Really Useful Link