Publications

Found 11 results

Filters: author is Littman  [Clear All Filters]
2007
A Hierarchy of Prescriptive Goals for Multiagent Learning Zinkevich, M.; Greenwald, A.; Littman, M. , Artificial Intelligence, May, Volume 171, p.440--447, (2007)
Online Linear Regression and Its Application to Model-Based Reinforcement Learning Strehl, A.L.; Littman, M.L. , NIPS, (2007)
2006
A Hierarchical Approach to Efficient Reinforcement Learning in Deterministic Domains Diuk, C.; Littman, M.; Strehl, A. , Fifth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-06), (2006)
Experience-efficient learning in associative bandit problems Strehl, A.L.; Mesterharm, C.; Littman, M.L.; Hirsh, H. , ICML-06: Proceedings of the 23rd international conference on Machine learning, p.889--896, (2006)
Incremental Model-based Learners With Formal Learning-Time Guarantees Strehl, A.L.; Li, L.; Littman, M.L. , UAI-06: Proceedings of the 22nd conference on Uncertainty in Artificial Intelligence, p.485--493, (2006)
PAC model-free reinforcement learning Strehl, A.L.; Li, L.; Wiewiora, E.; Langford, J.; Littman, M.L. , ICML-06: Proceedings of the 23rd international conference on Machine learning, p.881--888, (2006)
An Efficient Optimal-Equilibrium Algorithm for Two-Player Game Trees Littman, M.; Ravi, N.; Talwar, A.; Zinkevich, M. , Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI-06), (2006)
2005
A Theoretical Analysis of Model-Based Interval Estimation Strehl, A.L.; Littman, M.L. , Proceedings of the Twenty-second International Conference on Machine Learning (ICML-05), p.857--864, (2005)
Efficient Exploration With Latent Structure Leffler, B.; Littman, M.L.; Strehl, A.L.; Walsh, T. , Robotics Science and Systems, 2005, (2005)
Cyclic equilibria in markov games Zinkevich, M.; Greenwald, A.; Littman, M. , Neural Information Processing Systems, (2005)
2004
An Empirical Evaluation of Interval Estimation for Markov Decision Processes Strehl, A.L.; Littman, M.L. , The 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI-2004), p.128--135, (2004)