Navigating Autonomous Vehicle at the Road Intersection with Reinforcement Learning
Michael Martinson, Alexey Skrynnik, Aleksandr I. Panov
Abstract
In this paper, we consider the problem of controlling an intelligent agent that simulates the behavior of an unmanned car when passing a road intersection together with other vehicles. We consider the case of using smart city systems, which allow the agent to get full information about what is happening at the intersection in the form of video frames from surveillance cameras. The paper proposes the implementation of a control system based on a trainable behavior generation module. The agent’s model is implemented using reinforcement learning (RL) methods. In our work, we analyze various RL methods (PPO, Rainbow, TD3), and variants of the computer vision subsystem of the agent. Also, we present our results of the best implementation of the agent when driving together with other participants in compliance with traffic rules.
This work
In this project, we consider the task of learning an agent that simulates a self-driving car that performs the task of passing through the road intersection. As a basic statement of the problem, we consider a realistic scenario of using data from the agent’s sensors (images from cameras within the field of view, laser rangefinders, etc.), data coming from video surveillance cameras located in complex and loaded transport areas, in particular at road intersections.
Env CarIntersect
Our environment - CarIntersect simulate four way crossroad with some physics and bot cars. Technical description in env folder link.
Methods examples
TD3 on track Full rotate | Rainbow on track Full rotate | PPO on track Small rotation with bots |
TD3 on track Line | Rainbow with image as state on track Line | PPO with image state on track Line |
Convergence
State as a vector
Small rotation | Medium rotation | Line | Full rotation |
---|---|---|---|
State as an image
Small rotation | Medium rotation | Line | Full rotation |
---|---|---|---|
With bot cars
Vector state | Image state |
---|---|
Beyond the article scope
We provide some additional experiments with our environment, especially with DRQ and RND. Also we change and simplify experiment launching. See additional.