Papers
Topics
Authors
Recent
2000 character limit reached

Courteous Behavior of Automated Vehicles at Unsignalized Intersections via Reinforcement Learning (2106.06369v1)

Published 11 Jun 2021 in cs.LG and cs.RO

Abstract: The transition from today's mostly human-driven traffic to a purely automated one will be a gradual evolution, with the effect that we will likely experience mixed traffic in the near future. Connected and automated vehicles can benefit human-driven ones and the whole traffic system in different ways, for example by improving collision avoidance and reducing traffic waves. Many studies have been carried out to improve intersection management, a significant bottleneck in traffic, with intelligent traffic signals or exclusively automated vehicles. However, the problem of how to improve mixed traffic at unsignalized intersections has received less attention. In this paper, we propose a novel approach to optimizing traffic flow at intersections in mixed traffic situations using deep reinforcement learning. Our reinforcement learning agent learns a policy for a centralized controller to let connected autonomous vehicles at unsignalized intersections give up their right of way and yield to other vehicles to optimize traffic flow. We implemented our approach and tested it in the traffic simulator SUMO based on simulated and real traffic data. The experimental evaluation demonstrates that our method significantly improves traffic flow through unsignalized intersections in mixed traffic settings and also provides better performance on a wide range of traffic situations compared to the state-of-the-art traffic signal controller for the corresponding signalized intersection.

Citations (15)

Summary

  • The paper proposes a novel Courteous Virtual Traffic Signal Control framework that leverages deep reinforcement learning to manage unsignalized intersections.
  • It models intersection management as a Markov Decision Process and employs proximal policy optimization with return scaling to stabilize learning.
  • Experimental results demonstrate significant improvements in throughput and reduced travel times in both simulated environments and real-world scenarios.

Courteous Behavior of Automated Vehicles at Unsignalized Intersections via Reinforcement Learning

Introduction

The gradual integration of Connected Autonomous Vehicles (CAVs) into road traffic presents a unique opportunity to enhance traffic management systems, particularly at unsignalized intersections. The research presented in "Courteous Behavior of Automated Vehicles at Unsignalized Intersections via Reinforcement Learning" (2106.06369) addresses the challenges associated with managing mixed traffic systems, where both human-driven vehicles (HVs) and CAVs coexist. The paper proposes a deep reinforcement learning approach to optimize traffic flow at unsignalized intersections by allowing CAVs to yield their right of way, thereby improving overall traffic efficiency and safety.

Methodology

The authors introduce a centralized intersection management method termed Courteous Virtual Traffic Signal Control (CVTSC), utilizing deep reinforcement learning to achieve cooperative behavior between CAVs and HVs. By modeling the intersection management task as a Markov Decision Process (MDP), the CVTSC learns policies to control CAVs, effectively acting as virtual traffic signals that prioritize yielding at intersections.

The solution involves defining a discrete action space for CAVs, specifying actions that allow or restrict passage on certain routes within the intersection (Figure 1). A critical aspect of the approach is return scaling, which addresses the imbalance of cumulative rewards at different states, ensuring stable learning dynamics across varying traffic demands. The paper also employs proximal policy optimization for learning the policy and value functions, benefiting from adaptive discounting to manage the variability in step lengths during training. Figure 1

Figure 1

Figure 1: A typical three-way intersection illustrating priorities for lane management.

Results

Experimental evaluations conducted using the traffic simulator SUMO, both with simulated and real-world traffic data, exhibit significant improvements in traffic flow at unsignalized intersections managed by CVTSC. The approach demonstrates superiority over traditional road sign-based methods and adaptive traffic signal controllers across various CAV penetration rates.

The experiments reveal that CVTSC markedly improves both the efficiency and equity of the intersection management. At simulated traffic densities ranging from 500 to 3000 vehicles per hour, CVTSC consistently achieved higher throughput and reduced travel times compared to baseline methods (Figure 2). Figure 2

Figure 2

Figure 2

Figure 2

Figure 2: Performance of CVTSC compared to baselines in terms of vehicle throughput and travel time under varying traffic densities.

Real-world Application and Implications

The paper further evaluates CVTSC on real-world traffic data, specifically targeting an intersection in Freiburg, Germany (Figure 3). The results corroborate the simulation findings, with CVTSC effecting reduced travel times and increased throughput across different CAV penetration rates. Figure 3

Figure 3: Real-world intersection layout used in the paper.

These findings illustrate the potential of reinforcement learning methods to effectively manage mixed traffic scenarios, highlighting practical applications of AI in traffic systems. The implications for urban planning include reduced environmental impact, enhanced safety, and improved public acceptance for CAVs.

Conclusion

The research outlines a robust framework for the management of unsignalized intersections in mixed traffic settings, providing a roadmap for future developments in AI-driven traffic systems. The integration of return scaling and deep reinforcement learning presents tangible benefits in optimizing traffic flow, paving the way for wider applications as the proportion of CAVs on roads increases. Future work could explore adaptation to more complex intersection layouts and integration with higher levels of vehicular automation. Additionally, ongoing research could focus on resolving communication challenges and increasing public trust in automated traffic systems. Figure 4

Figure 4: Box plot illustrating travel times in simulations using the real-world intersection.

In summary, the paper underscores the potential of reinforcement learning in facilitating courteous traffic management, aligning the multifaceted goals of efficiency, safety, and equity in modern vehicular networks.

Slide Deck Streamline Icon: https://streamlinehq.com

Whiteboard

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.