- Samuel, A. L., "Some Studies in Machine Learning Using the Game of Checkers" IBM Journal on Research and Development, vol. 3, pp. 211-229, 1959. Reprinted in E. A. Feigenbaum and J. Feldman, editors, Computers and Thought, McGraw-Hill, New York, pp. 71-105,1963.
-
Minsky, M. L., "Steps Toward Artificial Intelligence", Proceedings of the Institute of Radio Engineers, vol. 49, pp. 8-30, 1961. Reprinted in E. A. Feigenbaum and J. Feldman, editors, Computers and Thought, McGraw-Hill, New York, pp. 406-450, 1963.
-
Importance sampling reading. This is from Doina Precup's thesis.
-
Schultz, W., Dayan, P., and Montague, R.,
"A Neural Substrate of Prediction and Reward", Science, Vol. 275, March 14, 1997.
-
Redish, A. D., "Addiction as a Computational Process Gone Awry" , Science, Vol. 306, Dec, 10, 2004.
-
Sutton, R. S., Precup, D., and Singh, S., "Between MDPs and semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning", Artificial Intelligence, Vol. 112, pp. 181-211, 1999.
-
Barto, A. G., Singh, S., and Chentanez, N. "Intrinsically Motivated Learning of Hierarchical Collections of Skills", International Conference on Developmental Learning (ICDL), LaJolla, CA, 2004.
-
Cassandra, A. R., Kaelbling, L. P., and Littman, M. L.
"Acting Optimally in Partially Observable Stochastic Domains", 12th AAAI, 1994.
-
Williams, R. J. "Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Leanring, Machine Learning, Vol. 8, pp. 229-256, 1992.
-
Kohl, N. and Stone, P. Policy Gradient Reinforcement Learning for Fast Quadrupedal Locomotion", Proceedings of the IEEE International Conference on Robotics and Automation, May 2004.
-
Grudic, G. Z., Kumar, V., and Ungar, L. "Using Policy Gradient Reinforcement Learning on Autonomous Controllers", Proceedings of the 2003 IEEE/RSJ Intl. Conference on Intelligent Robots and Systems, pp. 406-411, October 2003.
-
Taylor, M. E., Stone, P., and Liu, Y. "Value Functions for RL-Based Behavior Transfer: A Comparative Study", Proceedings of the Twentieth National Conference on Artificial Intelligence, July 2005.
-
Konidaris, G. D. and Barto, A. G. " Autonomous Shaping: Knowledge Transfer in Reinforcement Learning", Proceedings of the Twenty Third International Conference on Machine Learning (ICML 2006), to appear.