Spring 2019 Paper List

Below is the list of papers that we covered in Spring 2019:

Course instructions:

Below is the list of papers that we will be covering in this course, along with some related background reading and some videos that can be interesting to show in your presentations. Note - you do not need to cover all of the material in the background reading; you should use it only if it will assist you in explaining the concepts contained in the assigned paper.

You can sign up for paper presentations here.

Outline:

Schedule

Review of Reinforcement Learning

1/15, 1/17

Robot Locomotion

1/22

- - Robotics paper: Experiments in Balance with a 3D One-Legged Hopping Machine

Videos:

Robots from MIT's Leg Lab

- - Deep RL paper: Continuous Control With Deep Reinforcement Learning

Video:

Example DDPG policies

Background:

Deterministic Policy Gradient Algorithms

1/24

- - Robotics paper: A Three-Dimensional Passive-Dynamic Walking Robot with Two Legs and Knees

Video:

Collins robot compilation 3

Background:

Efficient Bipedal Robots Based on Passive-Dynamic Walkers

Passive Dynamic Walking

Russ Tedrake. Underactuated Robotics: Algorithms for Walking, Running, Swimming, Flying, and Manipulation (Course Notes for MIT 6.832) - Ch. 4: Walking and Running

Lecture by Russ Tedrake: Simple models of walking/running

- - Deep RL paper: Emergence of Locomotion Behaviours in Rich Environments

Background: Proximal Policy Optimization Algorithms

1/29

- - Robotics paper: Stochastic Policy Gradient Reinforcement Learning on a Simple 3D Biped

Background:

Learning to Walk in 20 Minutes

Actuating a Simple 3D Passive Dynamic Walker

- - Deep RL paper: Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor

Videos:

https://sites.google.com/view/soft-actor-critic

https://sites.google.com/view/minitaur-locomotion

https://sites.google.com/view/sac-and-applications

Background / Related Work:

Learning to Walk via Deep Reinforcement Learning

Soft Actor-Critic Algorithms and Applications

Tuomas Haarnoja's PhD Thesis

Robot locomotion over rough terrain

2/5

- - Robotics paper: A Control Architecture for Quadruped Locomotion Over Rough Terrain

Background: Hierarchical Apprenticeship Learning, with Application to Quadruped Locomotion

- - Deep RL paper: DeepLoco: Dynamic Locomotion Skills Using Hierarchical Deep Reinforcement Learning

Robot jumping

2/7

- - Robotics paper: Policy Search via the Signed Derivative
  - Deep RL paper: Deep Reinforcement Learning from Human Preferences

Robotic Acrobatics

2/12

- - Robotics paper: Control of a Biped Somersault in 3D
  - Deep RL paper: SFV: Reinforcement Learning of Physical Skills from Videos

Multi-robot coordination

2/14

- - Robotics paper: Coordinated Multi-Robot Exploration
  - Deep RL paper: Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Robot Grasping

2/19

- - Robotics paper: A Positive Pressure Universal Gripper Based on the Jamming of Granular Material

Background: Universal robotic gripper based on the jamming of granular material

- - Deep RL paper: Learning 6-DoF Grasping and Pick-Place Using Attention Focus

2/21

- - Robotics paper: A novel type of compliant and underactuated robotic hand for dexterous grasping
  - Deep RL paper: Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations

2/26

- - Robotics paper: The Highly Adaptive SDM Hand: Design and Performance Evaluation

Followup work: A compliant, underactuated hand for robust manipulation

- - Deep RL paper: Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning

2/28

- - Robotics paper: Autonomous manipulation with a general-purpose simple hand
  - Deep RL paper: QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

3/5

- - Robotics paper: Open-Loop Precision Grasping With Underactuated Hands Inspired by a Human Manipulation Strategy

Background:

Previous conference paper: Precision Grasping and Manipulation of Small Objects from Flat Surfaces using Underactuated Fingers

- - Deep RL paper: Learning task-oriented grasping for tool manipulation from simulated self-supervision

3/7

- - Robotics paper: Shape Completion Enabled Robotic Grasping

Background: Graspit! a versatile simulator for robotic grasping

- - Deep RL paper: More Than a Feeling: Learning to Grasp and Regrasp using Vision and Touch

In-hand manipulation

3/19

- - Robotics paper: In-Hand Manipulation Primitives for a Minimal, Underactuated Gripper With Active Surfaces

- - Deep RL paper: Learning Dexterous In-Hand Manipulation

https://blog.openai.com/learning-dexterity/

Robot manipulation

3/21

- - Robotics paper: Dynamic Regrasping using a High-speed Multifingered Hand and a High-speed vision System

- - Deep RL paper: Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control (unpublished)

Blog post: https://bair.berkeley.edu/blog/2018/11/30/visual-rl/

3/26

- - Robotics paper: Extrinsic dexterity: In-hand manipulation with external forces
  - Deep RL paper: Time-Contrastive Networks: Self-Supervised Learning from Video

3/28

- - Robotics paper: Prehensile Pushing: In-hand Manipulation with Push- Primitives
  - Deep RL paper: Hindsight Experience Replay

4/2

- - Robotics paper: Learning Robot In-Hand Manipulation with Tactile Features

- - Deep RL paper: Zero-Shot Visual Imitation

Background:

Combining Self-Supervised Learning and Imitation for Vision based Rope Manipulation

[webpage] [video]

Learning to Poke by Poking: Experiential Learning of Intuitive Physics

4/4

- - Robotics paper: Learning Task-Specific Models for Dexterous, In-Hand Manipulation with Simple, Adaptive Robot Hands

- - Deep RL paper: Learning an Embedding Space for Transferable Robot Skills

4/9

- - Robotics paper: Open Loop Stable Control Strategies for Robot Juggling

Videos

Overview of blind and non-blind robot juggling by Chris Atkeson

3 Ball Blind Robot Paddle Juggling

Feedback Control - Swinging Blind Juggler - IDSC ETH Zurich

The Circular Cloverleaf Blind Juggler - IDSC ETH Zurich

The Blind Juggler - IDSC - ETH Zurich

Shannon (robot) juggling: 3 and 5 balls - blind juggling

Other related work;

Bouncing an Unconstrained Ball in Three Dimensions with a Blind Juggling Robot

Design and Analysis of a Blind Juggling Robot

Rhythmic Feedback Control of a Blind Planar Juggler

https://www.blindjuggler.org/

History of juggling robots

- - Deep RL paper: Reinforcement and Imitation Learning for Diverse Visuomotor Skills

Hierarchical Planning:

4/16

- Robotics paper: Hierarchical Task and Motion Planning in the Now

- Deep RL paper: Data-Efficient Hierarchical Reinforcement Learning

Aerial vehicles

4/18

Robotics paper:

- - An Application of Reinforcement Learning to Aerobatic Helicopter Flight

Prior work:

Learning vehicular dynamics with application to modeling helicopters.

Apprenticeship learning via inverse reinforcement learning.

Exploration and apprenticeship learning in reinforcement learning.

Followup work:

Autonomous Helicopter Aerobatics through Apprenticeship Learning

Related work:

- - - - Autonomous Helicopter Control using Reinforcement Learning Policy Search Methods

- - Deep RL paper: Deep Drone Racing: Learning Agile Flight in Dynamic Environments

4/23

- - Robotics paper: Experiments in Fixed-Wing UAV Perching

Video

Perching with a Simple Fixed-Wing Glider

Followup work:

Robust Post-Stall Perching with a Simple Fixed-Wing Glider using LQR-Trees

Related work:

Landing, perching and taking off from vertical surfaces

- - Deep RL paper: CAD2RL: Real Single-Image Flight Without a Single Real Image

Guest Lecture

4/25

Final Project Presentations

4/30, 5/2

Google Sites

Report abuse