Accepted Papers

Counterfactual Learning from Human Proofreading Feedback for Semantic Parsing. pdf

Carolin Lawrence and Stefan Riezler.

Teacher-Student Adaptation for Video Segmentation via Human Robot Interaction (HRI). pdf

Mennatullah Siam, Chen Jiang, Steven Lu, Laura Petrich, Mahmoud Gamal, Mohamed ElHoseiny, Martin Jagersand

What Would pi* Do?: Imitation Learning via Off-Policy Reinforcement Learning. pdf

Siddharth Reddy, Anca Dragan and Sergey Levine

Modelling User's Theory of AI's Mind in Interactive Intelligent Systems. pdf

Tomi Peltola, Mustafa Mert Çelikok, Pedram Daee and Samuel Kaski

Using Natural Language Descriptions to Guide Zero-Shot Image Classification: A Meta-Learning Approach.

R. Lily Hu, Caiming Xiong and Richard Socher

Teaching with IMPACT . pdf

Carl Trimbach and Michael Littman


From Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following. pdf

Justin Fu, Anoop Korattikara, Sergey Levine and Sergio Guadarrama


Meta-Learning Language-Guided Policy Learning. pdf

John Co-Reyes, Abhishek Gupta, Suvansh Sanjeev, Nick Altieri, John DeNero, Pieter Abbeel and Sergey Levine


Training an Interactive Helper. pdf

Mark Woodward, Chelsea Finn and Karol Hausman

Assisted Inverse Reinforcement Learning. pdf

Parameswaran Kamalaruban, Rati Devidze, Teresa Yeo, Trisha Mittal, Volkan Cevher and Adish Singla


Learning a Multi-Modal Policy via Imitating Demonstrations with Mixed Behaviors. pdf

Fang-I Hsiao, Jui-Hsuan Kuo and Min Sun


Advice-Based Exploration in Model-Based Reinforcement Learning. pdf

Rodrigo Toro Icarte, Toryn Q. Klassen, Richard Valenzano and Sheila McIlraith


Compositional Imitation Learning: Explaining and executing one task at a time. pdf

Thomas Kipf, Yujia Li, Hanjun Dai, Vinicius Zambaldi, Edward Grefenstette, Pushmeet Kohli and Peter Battaglia


Teaching Multiple Tasks to an RL Agent using LTL (Abridged Report). pdf

Rodrigo Toro Icarte, Toryn Q. Klassen, Richard Valenzano and Sheila McIlraith


Reward-adjusted diameters of a Markov decision process and their conditioning by potential-based reward shaping. pdf

Falcon Z. Dai and Matthew Walter


Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning. pdf

Aishwarya Agrawal, Mateusz Malinowski, Felix Hill, Ali Eslami, Oriol Vinyals and Tejas Kulkarni


The Implicit Preference Information in an Initial State. pdf

Rohin Shah, Dmitrii Krasheninnikov, Jordan Alexander, Pieter Abbeel and Anca Dragan


Towards an IDE for agent design. pdf

Matthew Rahtz, James Fang, Anca Dragan and Dylan Hadfield-Menell


Learning to Learn from Imperfect Demonstrations. pdf

Ge Yang and Chelsea Finn


Investigating Machine-Learning Interaction with Wizard-of-Oz Experiments. pdf

Rob Sheline and Chris MacLellan


One-shot Semantic Parsing. pdf

Brian Lu, Igor Labutov, Bishan Yang, Tom Mitchell