Imitation, Intent, and Interaction (I3)

Accepted Papers

Contributed talks

"A Narration-based Reward Shaping Approach using Grounded Natural Language Command"

Nicholas R Waytowich (US Army Research Laboratory)*; Sean Barton (US Army Research Laboratory); Vernon Lawhern (US Army Research Laboratory); Garrett Warnell (US Army Research Laboratory)

[pdf]


"Self-Enhanced Inverse Reinforcement Learning for Text Generation"

Ping Yu (University at Buffalo); Ruiyi Zhang (Duke University); Chunyuan Li (Microsoft Research); Yizhe Zhang (Microsoft); Changyou Chen (University at Buffalo)*

[pdf]


"Generative Adversarial Imitation from Observation"

Faraz Torabi (The University of Texas at Austin)*; Garrett Warnell (US Army Research Laboratory); Peter Stone ((organization))

[pdf]


"TarMAC: Targeted Multi-Agent Communication"

Abhishek Das (Georgia Tech)*; Théophile Gervet (Carnegie Mellon University); Joshua Romoff (McGill University); Dhruv Batra (Georgia Tech & Facebook AI Research); Devi Parikh (Georgia Tech & Facebook AI Research); Mike Rabbat (Facebook FAIR); Joelle Pineau (Facebook)

[pdf]


"SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies"

Seyed Kamyar Seyed Ghasemipour (University of Toronto, Vector Institute)*; Shixiang Gu (Google Brain); Richard Zemel (University of Toronto)

[pdf]


"Nested Reasoning About Autonomous Agents Using Probabilistic Programs"

Iris R Seaman (Northeastern University)*; Jan-Willem van de Meent (Northeastern); David Wingate (Brigham Young University)

[pdf]

Contributed posters

"Input Estimation in Linear Dynamical Systems with Applications to Learning-from-Observations"

Sebastian Curi (ETH)*; Kfir Yehuda Levy (ETH); Andreas Krause (ETH Zürich)

[pdf]


"End-to-End Robotic Reinforcement Learning without Reward Engineering"

Avi Singh (UC Berkeley)*; Larry Yang (UC Berkeley); Kristian Hartikainen (UC Berkeley); Chelsea Finn (UC Berkeley); Sergey Levine (UC Berkeley)

[pdf]


"On Multi-Agent Learning in Team Sports Games"

Yunqi Zhao (Electronic Arts); Igor Borovikov (Electronic Arts); Jason Rupert (Electronic Arts); Caedmon Somers (Electronic Arts); Ahmad Beirami (Electronic Arts)*

[pdf]


"Discriminatively Learning Inverse Optimal Control Models for Predicting Human Intentions"

Sanket Gaurav (University of Illinois at Chicago)*; Brian Ziebart (UIC)

[pdf]


"RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration"

Brahma S Pavse (University of Texas at Austin); Faraz Torabi (The University of Texas at Austin)*; Josiah Hanna (UT Austin); Garrett Warnell (US Army Research Laboratory); Peter Stone ((organization))

[pdf]


"Compositional Plan Vectors"

Coline M Devin (University of California, Berkeley)*

[pdf]


"On the Utility of Learning about Humans for Human-AI Coordination"

Micah D Carroll (UC Berkeley)*; Rohin Shah (UC Berkeley); Mark Ho (Princeton); Thomas Griffiths (Princeton University); Sanjit Seshia (UC Berkeley); Pieter Abbeel (UC Berkeley); Anca Dragan (EECS Department, University of California, Berkeley)

[pdf]


"Unsupervised Visuomotor Control through Distributional Planning Networks"

Tianhe Yu (Stanford University)*

[pdf]


"Sample-efficient Adversarial Imitation Learning from Observation"

Faraz Torabi (The University of Texas at Austin)*; Sean Geiger (UT Austin); Garrett Warnell (US Army Research Laboratory); Peter Stone ((organization))

[pdf]


"Hierarchical Soft Actor-Critic: Adversarial Exploration via Mutual Information Optimization"

Ari Azarafrooz (Cylance)*; John Brock (Cylance)

[pdf]


"Understanding the Relation of Behavioral Cloning and Inverse Reinforcement Learning through Divergence Minimization"

Seyed Kamyar Seyed Ghasemipour (University of Toronto, Vector Institute)*; Richard Zemel (University of Toronto); Shixiang Gu (Google Brain)

[pdf]