On-Policy RL for Learning to Drive in Urban Settings

Abstract

Traditional autonomous vehicle pipelines that follow a modular approach have been very successful in the past both in academia and industry, which has led to autonomy deployed on road. Though this approach provides ease of interpretation, its generalizability to unseen environments is limited and hand-engineering of numerous parameters is required, especially in the prediction and planning systems. Recently, deep reinforcement learning has been shown to learn complex strategic games and perform challenging robotic tasks, which provides an appealing framework for learning to drive.

In this thesis, we propose two works that formulate the urban driving tasks using reinforcement learning and learn optimal control policies primarily using waypoints and low-dimensional representations, also known as affordances. We demonstrate that our agents when trained from scratch learn the tasks of lane-following and driving around intersections as well as learn to stop in front of other actors or traffic lights even in the dense traffic setting.

Videos

Learning to Drive using Waypoints

Learning to Drive with Dynamic Actors

Contact

If you have any questions, problems, suggestions for improvement or need access to our code, please feel free to reach us via email.

Page updated

Google Sites

Report abuse