Publications

Journal Articles

Learning Reward Machines: A Study in Partially Observable Reinforcement Learning. Rodrigo Toro Icarte, Toryn Klassen, Richard Valenzano, Margarita Castro, Ethan Waldie, and Sheila McIlraith. Artificial Intelligence Journal, 2023.

Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning. Rodrigo Toro Icarte, Toryn Klassen, Richard Valenzano, and Sheila McIlraith. Journal of Artificial Intelligence Research, Volume 73, 2022.

Probably bounded suboptimal heuristic search. Roni Stern, Gal Dreiman, and Richard Valenzano. Artificial Intelligence, Volume 267, 2019.

On polygon numbers of circle graphs and distance hereditary graphs. Lorna Stewart and Richard Valenzano. Discrete Applied Mathematics, Volume 248, 2018.


Conference Papers

Type-WA*: Using Exploration in Bounded Suboptimal Planning. Eldan Cohen, Richard Valenzano, and Sheila McIlraith. IJCAI 2021.

Learning Reward Machines for Partially Observable Reinforcement Learning. Rodrigo Toro Icarte, Toryn Klassen, Richard Valenzano, Margarita Castro, and Sheila McIlraith. NeurIPS 2019.

LTL and Beyond: Formal Languages for Reward Function Specification in Reinforcement Learning. Alberto Camacho, Rodrigo Toro Icarte, Toryn Klassen, Richard Valenzano, and Sheila McIlraith. IJCAI 2019.

Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning. Rodrigo Toro Icarte, Toryn Klassen, Richard Valenzano, and Sheila McIlraith. ICML 2018.

Teaching Multiple Tasks to an RL Agent using LTL. Rodrigo Toro Icarte, Toryn Klassen, Richard Valenzano, and Sheila McIlraith. AAMAS 2018.

Using Advice in Model-Based Reinforcement Learning. Rodrigo Toro Icarte, Toryn Klassen, Richard Valenzano, and Sheila McIlraith. CCAI 2018.

An Analysis and Enhancement of the Gap Heuristic for the Pancake Puzzle. Richard Valenzano and Danniel Sihui Yang. SoCS 2017.

Searching with a Corrupted Heuristic. Levi Lelis, Richard Valenzano, Gabriel Nazar, and Roni Stern. SOCS 2016.

On the Completeness of Best-First Search Variants That Use Random Exploration. Richard Valenzano and Fan Xie. AAAI 2016.

Using Metric Temporal Logic to Specify Scheduling Problems. Roy Luo, Richard Valenzano, Yi Li, Christopher Beck, and Sheila McIlraith. KR 2016.

Worst-Case Solution Quality Analysis When Not Re-Expanding Nodes in Best-First Search. Richard Valenzano, Nathan Sturtevant, and Jonathan Schaeffer. AAAI 2014.

A Comparison of Knowledge-Based GBFS Enhancements and Knowledge-Free Exploration. Richard Valenzano, Nathan Sturtevant, Jonathan Schaeffer, and Fan Xie. ICAPS 2014.

Using Alternative Suboptimality Bounds in Heuristic Search. Richard Valenzano, Shahab Jabbari Arfaee, Jordan Thayer, Roni Stern, and Nathan Sturtevant. ICAPS 2013.

Better Time Constrained Search via Randomization and Postprocessing. Fan Xie, Richard Valenzano, and Martin Müller. ICAPS 2013.

Evaluating State-Space Abstractions in Extensive-Form Games. Michael Johanson, Neil Burch, Richard Valenzano, and Michael Bowling. AAMAS 2013.

ArvandHerd: Parallel Planning with a Portfolio. Richard Valenzano, Hootan Nakhost, Martin Müller, Jonathan Schaeffer, and Nathan Sturtevant. ECAI 2012.

Simultaneously Searching with Multiple Settings: An Alternative to Parameter Tuning for Suboptimal Single-Agent Search Algorithms. Richard Valenzano, Nathan Sturtevant, Jonathan Schaeffer, Karen Buro, and Akihiro Kishimoto. ICAPS 2010.


Technical Reports

A Formal Characterization of the Local Search Topology of the Pancake Puzzle. Richard Valenzano and Danniel Sihui Yang. CoRR 2017.

ArvandHerd 2014. Richard Valenzano, Hootan Nakhost, Martin Müller, Jonathan Schaeffer, and Nathan Sturtevant. IPC 2014.

Adding Exploration to Greedy Best-First Search. Richard Valenzano, Nathan R. Sturtevant, and Jonathan Schaeffer. University of Alberta Technical Report TR13-06, 2013.

ArvandHerd: Parallel Planning with a Portfolio. Richard Valenzano, Hootan Nakhost, Martin Müller, Jonathan Schaeffer, and Nathan Sturtevant. IPC 2011.

Arvand: the Art of Random Walks. Hootan Nakhost, Martin Müller, Richard Valenzano, and Fan Xie. IPC 2011.


Theses

Design Decisions on Suboptimal Heuristic Search-Based Systems. Richard Valenzano. Ph.D. Thesis. University of Alberta, 2014.

Simultaneously Searching with Multiple Algorithm Settings: An Alternative to Parameter Tuning for Suboptimal Single-Agent Search. Richard Valenzano. M.Sc. Thesis. University of Alberta, 2009.