CMPSCI 687 Supplementary Readings

Samuel, A. L., "Some Studies in Machine Learning Using the Game of Checkers" IBM Journal on Research and Development, vol. 3, pp. 211-229, 1959. Reprinted in E. A. Feigenbaum and J. Feldman, editors, Computers and Thought, McGraw-Hill, New York, pp. 71-105,1963.
Minsky, M. L., "Steps Toward Artificial Intelligence", Proceedings of the Institute of Radio Engineers, vol. 49, pp. 8-30, 1961. Reprinted in E. A. Feigenbaum and J. Feldman, editors, Computers and Thought, McGraw-Hill, New York, pp. 406-450, 1963.
Importance sampling reading. This is from Doina Precup's thesis.
Schultz, W., Dayan, P., and Montague, R., "A Neural Substrate of Prediction and Reward", Science, Vol. 275, March 14, 1997.
Redish, A. D., "Addiction as a Computational Process Gone Awry" , Science, Vol. 306, Dec, 10, 2004.
Sutton, R. S., Precup, D., and Singh, S., "Between MDPs and semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning", Artificial Intelligence, Vol. 112, pp. 181-211, 1999.
Barto, A. G., Singh, S., and Chentanez, N. "Intrinsically Motivated Learning of Hierarchical Collections of Skills", International Conference on Developmental Learning (ICDL), LaJolla, CA, 2004.
Cassandra, A. R., Kaelbling, L. P., and Littman, M. L. "Acting Optimally in Partially Observable Stochastic Domains", 12th AAAI, 1994.
Williams, R. J. "Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Leanring, Machine Learning, Vol. 8, pp. 229-256, 1992.
Kohl, N. and Stone, P. Policy Gradient Reinforcement Learning for Fast Quadrupedal Locomotion", Proceedings of the IEEE International Conference on Robotics and Automation, May 2004.
Grudic, G. Z., Kumar, V., and Ungar, L. "Using Policy Gradient Reinforcement Learning on Autonomous Controllers", Proceedings of the 2003 IEEE/RSJ Intl. Conference on Intelligent Robots and Systems, pp. 406-411, October 2003.
Taylor, M. E., Stone, P., and Liu, Y. "Value Functions for RL-Based Behavior Transfer: A Comparative Study", Proceedings of the Twentieth National Conference on Artificial Intelligence, July 2005.
Konidaris, G. D. and Barto, A. G. " Autonomous Shaping: Knowledge Transfer in Reinforcement Learning", Proceedings of the Twenty Third International Conference on Machine Learning (ICML 2006), to appear.