|
Research Area: Machine Learning |
Profile
Lihong Li is a Research Scientist in the machine learning group at Yahoo! Research. He obtained a PhD degree in Computer Science from the Rutgers University, advised by Prof. Michael Littman. Before that, he obtained a MSc degree from the University of Alberta and BE from the Tsinghua University. His main research interests are in reinforcement learning and machine learning, including: exploration/exploitation tradeoff, value-function approximation, feature selection for reinforcement learning, online/bandit learning, computational learning theory, and decision-theoretic planning. In the summers of 2006-2008, he interned at Google, Yahoo! Research, and AT&T Shannon Labs.
Recent Publications, Projects and News
- An empirical evaluation of Thompson sampling Olivier Chapelle; Lihong Li, NIPS, 2011 [view abstract]
- Contextual bandits with linear payoff functions Wei Chu; Lihong Li; Lev Reyzin; Robert E. Schapire, AISTATS, 2011 [view abstract]
- Contextual Bandit Algorithms with Supervised Learning Guarantees Alina Beygelzimer; John Langford; Lihong Li; Lev Reyzin; Robert E. Schapire, AISTATS, 2011 [view abstract]
- Linear-time estimators for propensity scores Deepak Agarwal; Lihong Li; Alexander J. Smola, AISTATS, 2011 [view abstract]
- Unbiased online active learning in data streams Wei Chu; Martin Zinkevich; Lihong Li; Achint Thomas; Belle Tseng, KDD, 2011 [view abstract]
- Doubly robust policy evaluation and learning Miroslav Dudik; John Langford; Lihong Li, ICML, 2011 [view abstract]
- Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms Lihong Li; Wei Chu; John Langford; Xuanhui Wang, WSDM, 2011 [view abstract]
- Reducing reinforcement learning to KWIK online regression Lihong Li; Michael L. Littman, Annals of Mathematics and Artificial Intelligence, Springer, 2011, 3-4 [view abstract]
- Knows what it knows: A framework for self-aware learning Lihong Li; Michael L. Littman; Thomas J. Walsh; Alexander L. Strehl, Machine Learning Journal, 2011 [view abstract]
- Cloud control: Voluntary admission control for Intranet traffic management John Langford; Lihong Li; Preston McAfee; Kishore Papineni, Information Systems and E-Business Management, 2011 [view abstract]
- Online learning for recency search ranking using real-time user feedback Taesup Moon; Lihong Li; Wei Chu; Ciya Liao; Zhaohui Zheng; Yi Chang, CIKM, 2010 [view abstract]
- Learning from logged implicit exploration data Alexander L. Strehl; John Langford; Lihong Li; Sham M. Kakade, NIPS, 2010 [view abstract]
- Parallelized stochastic gradient descent Martin Zinkevich; Alexander J. Smola; Markus Weimer; Lihong Li, NIPS, 2010 [view abstract]
- Maintaining equilibria during exploration in sponsored search auctions John Langford; Lihong Li; Yevgeniy Vorobeychik; Jennifer Wortman, Algorithmica, Springer, 2010, 4 [view abstract]
- A contextual-bandit approach to personalized news article recommendation Lihong Li; Wei Chu; John Langford; Robert E. Schapire, WWW 2010: Proceedings of the 19th international conference on World Wide Web, 2010 [view abstract]
- The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning Carlos Diuk; Lihong Li; Bethany R. Leffler, ICML, 2009 [view abstract]
- Online exploration in least-squares policy iteration Lihong Li; Michael L. Littman; Christopher R. Mansley, AAMAS, 2009 [view abstract]
- A Bayesian sampling approach to exploration in reinforcement learning John Asmuth; Lihong Li; Michael L. Littman; Ali Nouri; David Wingate, UAI, 2009 [view abstract]
- Learning and planning in environments with delayed feedback Thomas J. Walsh; Ali Nouri; Lihong Li; Michael L. Littman, Autonomous Agents and Multi-Agent Systems, 2009, 1 [view abstract]
- Sparse online learning via truncated gradient John Langford; Lihong Li; Tong Zhang, Journal of Machine Learning Research, 2009 [view abstract]
- Provably efficient learning with typed parametric models Emma Brunskill; Bethany R. Leffler; Lihong Li; Michael L. Littman; Nichlos Roy, Journal of Machine Learning Research, 2009 [view abstract]
- Reinforcement learning in finite MDPs: PAC analysis Alexander L. Strehl; Lihong Li; Michael L. Littman, Journal of Machine Learning Research, 2009 [view abstract]
