Accepted Papers
Counterfactual Learning from Human Proofreading Feedback for Semantic Parsing. pdf
Carolin Lawrence and Stefan Riezler.Teacher-Student Adaptation for Video Segmentation via Human Robot Interaction (HRI). pdf
Mennatullah Siam, Chen Jiang, Steven Lu, Laura Petrich, Mahmoud Gamal, Mohamed ElHoseiny, Martin JagersandWhat Would pi* Do?: Imitation Learning via Off-Policy Reinforcement Learning. pdf
Siddharth Reddy, Anca Dragan and Sergey LevineModelling User's Theory of AI's Mind in Interactive Intelligent Systems. pdf
Tomi Peltola, Mustafa Mert Çelikok, Pedram Daee and Samuel KaskiUsing Natural Language Descriptions to Guide Zero-Shot Image Classification: A Meta-Learning Approach.
R. Lily Hu, Caiming Xiong and Richard SocherTeaching with IMPACT . pdf
Carl Trimbach and Michael LittmanFrom Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following. pdf
Justin Fu, Anoop Korattikara, Sergey Levine and Sergio GuadarramaMeta-Learning Language-Guided Policy Learning. pdf
John Co-Reyes, Abhishek Gupta, Suvansh Sanjeev, Nick Altieri, John DeNero, Pieter Abbeel and Sergey LevineTraining an Interactive Helper. pdf
Mark Woodward, Chelsea Finn and Karol HausmanAssisted Inverse Reinforcement Learning. pdf
Parameswaran Kamalaruban, Rati Devidze, Teresa Yeo, Trisha Mittal, Volkan Cevher and Adish SinglaLearning a Multi-Modal Policy via Imitating Demonstrations with Mixed Behaviors. pdf
Fang-I Hsiao, Jui-Hsuan Kuo and Min SunAdvice-Based Exploration in Model-Based Reinforcement Learning. pdf
Rodrigo Toro Icarte, Toryn Q. Klassen, Richard Valenzano and Sheila McIlraithCompositional Imitation Learning: Explaining and executing one task at a time. pdf
Thomas Kipf, Yujia Li, Hanjun Dai, Vinicius Zambaldi, Edward Grefenstette, Pushmeet Kohli and Peter BattagliaTeaching Multiple Tasks to an RL Agent using LTL (Abridged Report). pdf
Rodrigo Toro Icarte, Toryn Q. Klassen, Richard Valenzano and Sheila McIlraithReward-adjusted diameters of a Markov decision process and their conditioning by potential-based reward shaping. pdf
Falcon Z. Dai and Matthew WalterGenerating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning. pdf
Aishwarya Agrawal, Mateusz Malinowski, Felix Hill, Ali Eslami, Oriol Vinyals and Tejas KulkarniThe Implicit Preference Information in an Initial State. pdf
Rohin Shah, Dmitrii Krasheninnikov, Jordan Alexander, Pieter Abbeel and Anca DraganTowards an IDE for agent design. pdf
Matthew Rahtz, James Fang, Anca Dragan and Dylan Hadfield-MenellLearning to Learn from Imperfect Demonstrations. pdf
Ge Yang and Chelsea FinnInvestigating Machine-Learning Interaction with Wizard-of-Oz Experiments. pdf
Rob Sheline and Chris MacLellanOne-shot Semantic Parsing. pdf
Brian Lu, Igor Labutov, Bishan Yang, Tom Mitchell