publications
google scholar → | dblp →
2023
A definition of continual reinforcement learning
David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado Philip van Hasselt, Satinder Singh
Advances in Neural Information Processing Systems (NeurIPS)
[ pdf ] [ talk & poster ]
Deep reinforcement learning with plasticity injection
Evgenii Nikishin, Junhyuk Oh, Georg Ostrovski, Clare Lyle, Razvan Pascanu, Will Dabney, André Barreto
Advances in Neural Information Processing Systems (NeurIPS) - selected as a spotlight
[ pdf ] [ talk & poster ]
Simulation-based search
David Silver, André Barreto
International Congress of Mathematicians (ICM)
Temporal abstraction in reinforcement learning with the successor representation
Marlos C. Machado, André Barreto, Doina Precup, Michael Bowling
Journal of Machine Learning Research, v. 24 (1), pp. 3531–3599
[ pdf ]
On the convergence of bounded agents
David Abel, André Barreto, Hado van Hasselt, Benjamin Van Roy, Doina Precup, Satinder Singh
arXiv
[ pdf ]
2022
Generalised policy improvement with geometric policy composition
Shantanu Thakoor, Mark Rowland, Diana Borsa, Will Dabney, Rémi Munos, André Barreto
Proceedings of International Conference on Machine Learning (ICML) - selected as a spotlight
[ pdf ] [ talk & slides ] [ talk 2 ]
Approximate value equivalence
Christopher Grimm, André Barreto, Satinder Singh
Advances in Neural Information Processing Systems (NeurIPS)
[ pdf ] [ talk & slides ]
The phenomenon of policy churn
Tom Schaul, André Barreto, John Quan, Georg Ostrovski
Advances in Neural Information Processing Systems (NeurIPS)
[ pdf ] [ talk & slides ]
Model-value inconsistency as a signal for epistemic uncertainty
Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram L. Friesen, Feryal Behbahani, Tom Schaul, André Barreto, Simon Osindero
Proceedings of International Conference on Machine Learning (ICML)
[ pdf ] [ talk & slides ]
Expressing non-Markov reward to a Markov agent
David Abel, André Barreto, Michael Bowling, Will Dabney, Steven Hansen, Anna Harutyunyan, Mark K. Ho, Ramana Kumar, Michael L. Littman, Doina Precup, Satinder Singh
The Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM)
[ pdf ]
Efficient information diffusion in time-varying graphs through deep reinforcement learning
Matheus R. F. Mendonça, André Barreto, Artur Ziviani
World Wide Web, volume 25, 2535–2560
[ more info ]
2021
Proper value equivalence
Christopher Grimm, André Barreto, Gregory Farquhar, David Silver, Satinder Singh
Advances in Neural Information Processing Systems (NeurIPS) - selected as a spotlight
[ pdf ] [ talk & slides ] [ talk ] [ more info]
Risk-aware transfer in reinforcement learning using successor features
Michael Gimelfarb, André Barreto, Scott Sanner, Chi-Guhn Lee
Advances in Neural Information Processing Systems (NeurIPS)
[ pdf ] [ talk & slides ] [ more info]
Discovering a set of policies for the worst case reward
Tom Zahavy, André Barreto, Daniel J Mankowitz, Shaobo Hou, Brendan O'Donoghue, Iurii Kemaev, Satinder Baveja Singh
International Conference on Learning Representations (ICLR) - selected as a spotlight
[ pdf ] [ talk & slides ] [ more info]
Temporally-extended ε-greedy exploration
Will Dabney, Georg Ostrovski, André Barreto
International Conference on Learning Representations (ICLR)
[ pdf ] [ talk ] [ more info ]
The value-improvement path: towards better representations for reinforcement learning
Will Dabney, André Barreto, Mark Rowland, Robert Dadashi, John Quan, Marc G. Bellemare, David Silver
AAAI Conference on Artificial Intelligence (AAAI)
Expected eligibility traces
Hado van Hasselt, Sephora Madjiheurem, Matteo Hessel, David Silver, André Barreto, Diana Borsa
AAAI Conference on Artificial Intelligence (AAAI) - selected as a distinguished paper
Discovering diverse nearly optimal policies with successor features
Tom Zahavy, Brendan O'Donoghue, André Barreto, Volodymyr Mnih, Sebastian Flennerhag, Satinder Singh
arXiv
[ pdf ]
Beyond fine-tuning: transferring behavior in reinforcement learning
Víctor Campos, Pablo Sprechmann, Steven Hansen, André Barreto, Steven Kapturowski, Alex Vitvitskyi, Adrià Puigdomènech Badia, Charles Blundell
arXiv
[ pdf ]
New machine learning and physics-based scoring functions for drug discovery
Isabella Guedes, André Barreto, Diogo Marinho, Eduardo Krempser, Mélaine Kuenemann, Olivier Sperandio, Laurent Dardenne, Maria A. Miteva
Scientific Reports
[ more info]
2020
Fast reinforcement learning with generalized policy updates
André Barreto, Shaobo Hou, Diana Borsa, David Silver, Doina Precup
Proceedings of the National Academy of Sciences
[ pdf ] [ blog post] [ talk ] [ talk 2 ] [ more info ]
The value equivalence principle for model-based reinforcement learning
Christopher Grimm, André Barreto, Satinder Singh, David Silver
Advances in Neural Information Processing Systems (NeurIPS)
On efficiency in hierarchical reinforcement learning
Zheng Wen, Doina Precup, Morteza Ibrahimi, André Barreto, Benjamin Van Roy, Satinder Singh
Advances in Neural Information Processing Systems (NeurIPS)
Fast task inference with variational intrinsic successor features
Steven Hansen, Will Dabney, André Barreto, David Warde-Farley, Tom Van de Wiele, Volodymyr Mnih
Proceedings of the International Conference on Learning Representations (ICLR)
Approximating network centrality measures using node embedding and machine learning
Matheus Mendonça, André Barreto, Artur Ziviani
IEEE Transactions on Network Science and Engineering
[ more info ]
Efficient information diffusion in time-varying graphs through deep reinforcement learning
Matheus Mendonça, André Barreto, Artur Ziviani
arXiv
[ pdf ]
Temporal difference uncertainties as a signal for exploration
Sebastian Flennerhag, Jane X. Wang, Pablo Sprechmann, Francesco Visin, Alexandre Galashov, Steven Kapturowski, Diana L. Borsa, Nicolas Heess, André Barreto, Razvan Pascanu
arXiv
[ pdf ]
2019
The option keyboard: combining skills in reinforcement learning
André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan J. Hunt, Shibl Mourad, David Silver, Doina Precup
Advances in Neural Information Processing Systems (NeurIPS)
[ pdf ] [ video ] [ slides ] [ poster ] [ more info ]
Universal successor features approximators
Diana Borsa, André Barreto, John Quan, Daniel Mankowitz, Rémi Munos, Hado van Hasselt, David Silver, Tom Schaul
Proceedings of the International Conference on Learning Representations (ICLR)
[ pdf ] [ video 1 ] [ video 2 ] [ more info ]
Composing entropic policies using divergence correction
Jonathan Hunt, André Barreto, Timothy Lillicrap, Nicolas Heess
Proceedings of International Conference on Machine Learning (ICML)
[ pdf ] [ videos ] [ more info ]
Adaptive temporal-difference learning for policy evaluation with per-state uncertainty estimates
Carlos Riquelme, Hugo Penedones, Damien Vincent, Hartmut Maennel, Timothy A. Mann, André Barreto, Sylvain Gelly, Gergely Neu
Advances in Neural Information Processing Systems (NeurIPS)
[ pdf ] [ poster ] [ more info ]
General non-linear Bellman equations
Hado van Hasselt, John Quan, Matteo Hessel, Zhongwen Xu, Diana Borsa, André Barreto
The Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM)
Graph-based skill acquisition for reinforcement learning
Matheus Mendonça, Artur Ziviani, André Barreto
ACM Computing Surveys, 52 (1)
[ more info ]
2018
Transfer in deep reinforcement learning using successor features and generalised policy improvement
André Barreto, Diana Borsa, John Quan, Tom Schaul, David Silver, Matteo Hessel, Daniel Mankowitz, Augustin Žídek, Rémi Munos
International Conference on Machine Learning (ICML)
[ pdf ] [ video ] [ slides 1 ] [ slides 2 ] [ poster ] [ more info ]
Fast deep reinforcement learning using online adjustments from the past
Steven Hansen, Alexander Pritzel, Pablo Sprechmann, André Barreto, Charles Blundell
Advances in Neural Information Processing Systems (NeurIPS)
Online TD(λ) for discrete-time Markov jump linear systems
Rafael Beirigo, Marcos Todorov, André Barreto
IEEE Annual Conference on Decision and Control (CDC)
[ more info ]
Temporal difference learning with neural networks - study of the leakage propagation problem
Hugo Penedones, Damien Vincent, Hartmut Maennel, Sylvain Gelly, Timothy Mann, André Barreto
arXiv
Unicorn: continual learning with a universal, off-Policy, agent
Daniel J Mankowitz, Augustin Žídek, André Barreto, Dan Horgan, Matteo Hessel, John Quan, Junhyuk Oh, Hado van Hasselt, David Silver, Tom Schaul
The Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM)
Abstract state transition graphs for model-based reinforcement learning
Matheus Mendonça, Artur Ziviani, André Barreto
Brazilian Conference on Intelligent Systems (BRACIS)
[ more info ]
2017
Successor features for transfer in reinforcement learning
André Barreto, Will Dabney, Rémi Munos, Jonathan J. Hunt, Tom Schaul, Hado van Hasselt, David Silver
Advances in Neural Information Processing Systems (NIPS) - selected as a spotlight
[ pdf ] [ slides ] [poster ] [ workshop version ] [ more info ]
The Predictron: end-to-end learning and planning
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David P. Reichert, Neil C. Rabinowitz, André Barreto, Thomas Degris
Proceedings of the International Conference on Machine Learning (ICML)
[ pdf ] [ video ] [ more info ]
Natural value approximators: learning when to trust past estimates
Zhongwen Xu, Joseph Modayil, Hado van Hasselt, André Barreto, David Silver, Tom Schaul
Advances in Neural Information Processing Systems (NIPS) - selected as a spotlight
Value-aware loss function for model-based reinforcement learning
Amir-Massoud Farahmand, André Barreto, Daniel Nikovski
International Conference on Artificial Intelligence and Statistics (AISTATS)
Count-based quadratic control of Markov jump linear systems with unknown transition probabilities
Rafael Beirigo, Marcos Todorov, André Barreto
IEEE Annual Conference on Decision and Control (CDC)
[ more info ]
Transfer on count-based quadratic control of Markov jump linear systems with unknown transition probabilities
Rafael Beirigo, Marcos Todorov, André Barreto
Conferência Brasileira de Dinâmica, Controle e Aplicações (DINCON)
[ pdf ]
2016
Practical kernel-based reinforcement learning
André Barreto, Doina Precup, Joelle Pineau
Journal of Machine Learning Research, v. 17 (67), pp. 1−70
Incremental stochastic factorization for online reinforcement learning
André Barreto, Rafael L. Beirigo, Joelle Pineau, Doina Precup
AAAI Conference on Artificial Intelligence (AAAI)
2015
An expectation-maximization algorithm to compute a stochastic factorization from data
André Barreto, Rafael L. Beirigo, Joelle Pineau, Doina Precup
International Joint Conference on Artificial Intelligence (IJCAI)
Classification-based approximate policy iteration
Amir-massoud Farahmand, Doina Precup, André Barreto, Mohammad Ghavamzadeh
IEEE Transactions on Automatic Control, v. 60 (12)
2014
Policy iteration based on stochastic factorization
André Barreto, Joelle Pineau, Doina Precup
Journal of Artificial Intelligence Research, v. 50, pp. 763−803
Tree-based on-line reinforcement learning
André Barreto
AAAI Conference on Artificial Intelligence (AAAI)
Starting to uncover the relationship between stochastic factorization and hidden Markov models
André Barreto, Borja Pigem, Joelle Pineau, Doina Precup
NIPS Workshop on Novel Trends and Applications in Reinforcement Learning
2013
CAPI: Generalized classification-based approximate policy iteration
Amir-massoud Farahmand, Doina Precup, André Barreto, Mohammad Ghavamzadeh
The Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM)
Reinforcement learning competition 2013: Controllability and kernel-based stochastic factorization
Anwarissa Asbah, André Barreto, Clement Gehring, Joelle Pineau, Doina Precup
ICML Workshop on the Reinforcement Learning Competition
2012
On-line reinforcement learning using incremental kernel-based stochastic factorization
André Barreto, Doina Precup, Joelle Pineau
Advances in Neural Information Processing Systems (NIPS)
Analysis of composition-based metagenomic classification
Susan Higashi, André Barreto, Maurício Cantão, Ana Tereza Vasconcelos
BMC Genomics, v. 13, pp. 1–11
2011
Computing the stationary distribution of a finite Markov chain through stochastic factorization
André Barreto and Marcelo Fragoso
SIAM Journal on Matrix Analysis and Applications, v. 32, pp. 1513–1523
Reinforcement learning using kernel-based stochastic factorization
André Barreto, Doina Precup, Joelle Pineau
Advances in Neural Information Processing Systems (NIPS), pp.720–728
Lumping the states of a finite Markov chain through stochastic factorization
André Barreto and Marcelo Fragoso
World Congress of the International Federation of Automatic Control (IFAC)
Exploring performance profiles for analyzing benchmark experiments
Helio Barbosa, Heder Bernardino, André Barreto
Metaheuristics International Conference (MIC), 2011
[ more info ]
Evolving numerical constants in grammatical evolution with the ephemeral constant method
Douglas Augusto, Helio Barbosa, André Barreto, Heder Bernardino
Portuguese Conference on Artificial Intelligence (EPIA)
[ more info ]
A new approach for generating numerical constants in grammatical evolution
Douglas Augusto, Helio Barbosa, André Barreto, Heder Bernardino
Conference on Genetic and Evolutionary Computation (GECCO)
[ more info ]
...2010
Probabilistic performance profiles for the experimental evaluation of stochastic algorithms
André Barreto, Heder Bernardino, Helio Barbosa
Conference on Genetic and Evolutionary Computation (GECCO), 2010
Kernel-based stochastic factorization for batch reinforcement learning
André Barreto and Doina Precup
NIPS: Learning and Planning from Batch Time Series Data Workshop, 2010
On the characteristics of sequential decision problems and their impact on evolutionary computation and reinforcement learning
André Barreto, Douglas Augusto, Helio Barbosa
Artificial Evolution, volume 5975 of Lecture Notes in Computer Science, 2010
Using performance profiles to analyze the results of the 2006 CEC constrained optimization competition
Helio Barbosa, Heder Bernardino, André Barreto
IEEE World Congress on Computational Intelligence (WCCI), 2010
[ more info ]
On the characteristics of sequential decision problems and their impact on evolutionary computation
André Barreto, Douglas Augusto, Helio Barbosa
Conference on Genetic and Evolutionary Computation (GECCO), 2009
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning
André Barreto and Charles Anderson
Artificial Intelligence, v. 172(4-5), pages 454–482, 2008
A note on the variance of rank-based selection strategies for genetic algorithms and genetic programming
Artem Sokolov, Darrell Whitley, André Barreto
Genetic Programming and Evolvable Machines, v. 8(3), pp. 221–237, 2007
GOLS—Genetic orthogonal least squares algorithm for training RBF networks
André Barreto, Helio Barbosa, Nelson Ebecken
Neurocomputing, v. 69 (16-18), pp. 2041–2064, 2006
Alternative evolutionary algorithms for evolving programs: evolution strategies and steady-state GP
Darrel Whitley, Marc Richards, Ross Beveridge, André Barreto
Conference on Genetic and Evolutionary Computation (GECCO), 2006 - winner best paper on genetic programming
An interactive genetic algorithm with coevolution of weights for multiobjective problems
Helio Barbosa and André Barreto
Conference on Genetic and Evolutionary Computation (GECCO), 2001
Growing compact RBF networks using a genetic algorithm
André Barreto, Helio Barbosa, Nelson Ebecken
Brazilian Symposium on Neural Networks (SBRN), 2002
Graph layout using a genetic algorithm
André Barreto and Helio Barbosa
Brazilian Symposium on Neural Networks (SBRN), 2000
google scholar → | dblp →