|
When: Monday 4:10-6:00 Where: 602 Northwest Corner Building Office Hours: Wednesday 4-5 on Zoom, or by appointment |
OverviewThe last decade has seen a surge of interest, both in research and in industry, in machine learning algorithms and systems for sequential decision making, in which an agent interacts with an unknown environment to accomplish some goal. In this course, we will investigate the algorithmic principles and theoretical foundations of sequential decision making, starting from the simplest problem settings and gradually increasing in complexity. We begin with multi-armed bandits, the simplest decision-making setting, and then add in the challenge of generalization through the frameworks of linear and contextual bandits. Finally, we consider general reinforcement learning setttings, both with and without function approximation. Requirements:
Prerequisites: This is an advanced, theory-heavy course. Exposure to algorithms and proofs at the level of an undergraduate algorithms course (CSOR 4231) is absolutely essential. A strong grasp of machine learning (COMS 4771), probability, and statistics are preferred. If you have taken, and were comfortable with, COMS 4773, then you should be well-prepared for this course. If you do not meet these requirements, please email the instructor. Readings and resourcesThere is no required textbook for this course. However you may find the following resources useful. They are listed roughly in order of relevance to the course.
Homeworks
ProjectsProject guidelines are available here. Important dates are:
Lecture Schedule
|