Publications

  • Alexandra Carpentier and Rémi Munos. "Bandit Theory Meets Compressed Sensing for High-Dimensional Stochastic Linear Bandit". Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS-2012), April 2012.
  • Matthew Hoffman, Alessandro Lazaric, Mohammad Ghavamzadeh, & Rémi Munos. "Regularized Least Squares Temporal Difference Learning with Nested L2 and L1 Penalization". Ninth European Workshop on Reinforcement Learning (EWRL-2011), Athens, Greece, September 2011.
  • Mohammad Ghavamzadeh, Alessandro Lazaric, Rémi Munos, & Matthew Hoffman. "Finite-Sample Analysis of Lasso-TD". Proceedings of the Twenty-Eighth International Conference on Machine Learning (ICML-2011), pp. 1177-1184, Bellevue, WA, June 2011.
  • Aviv Tamar, Dotan Di Castro, and Ron Meir. "Integrating Partial Model Knowledge in Model Free Reinforcement Learning Algorithms". Proceedings of the Twenty-Eighth International Conference on Machine Learning (ICML-2011), Bellevue, WA, June 2011.
  • Mohammad Ghavamzadeh, Alessandro Lazaric, Odalric Maillard, & Rémi Munos. "LSTD with Random Projections". Accepted for Spotlight Presentation (73 out of 1219 submissions). Proceedings of the Twenty-Fourth Annual Conference on Advances in Neural Information Processing Systems (NIPS), pp. 721-729, 2010.
  • Odalric Maillard and Rémi Munos. "Scrambled Objects for Least-Squares Regression". Proceedings of the Twenty-Fourth Annual Conference on Advances in Neural Information Processing Systems (NIPS), pp. 1549-1557, 2010.
  • Dotan Di Castro and Shie Mannor. "Adaptive Bases for Reinforcement Learning". Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, pp. 312-327, 2010.
  • Dotan Di Castro and Shie Mannor. "Adaptive Bases for Q-learning". Proceedings of the Forty-Ninth IEEE Conference on Decision and Control (CDC), 2010.
  • Dotan Di Castro and Shie Mannor. "Tutor Learning Using Linear Constraints in Approximate Dynamic Programming". Proceedings of the Forty-Eighth Allerton Conference on Communication, Control, and Computing, 2010.
  • Odalric Maillard and Rémi Munos. "Compressed Least-Squares Regression". Proceedings of the Twenty-Third Annual Conference on Advances in Neural Information Processing Systems (NIPS-2009), pp. 1213-1221, 2009.
Comments