Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 161 tok/s
Gemini 2.5 Pro 56 tok/s Pro
GPT-5 Medium 31 tok/s Pro
GPT-5 High 34 tok/s Pro
GPT-4o 40 tok/s Pro
Kimi K2 200 tok/s Pro
GPT OSS 120B 427 tok/s Pro
Claude Sonnet 4.5 38 tok/s Pro
2000 character limit reached

Intelligent Autonomous Intersection Management (2202.04224v1)

Published 9 Feb 2022 in cs.MA and cs.LG

Abstract: Connected Autonomous Vehicles will make autonomous intersection management a reality replacing traditional traffic signal control. Autonomous intersection management requires time and speed adjustment of vehicles arriving at an intersection for collision-free passing through the intersection. Due to its computational complexity, this problem has been studied only when vehicle arrival times towards the vicinity of the intersection are known beforehand, which limits the applicability of these solutions for real-time deployment. To solve the real-time autonomous traffic intersection management problem, we propose a reinforcement learning (RL) based multiagent architecture and a novel RL algorithm coined multi-discount Q-learning. In multi-discount Q-learning, we introduce a simple yet effective way to solve a Markov Decision Process by preserving both short-term and long-term goals, which is crucial for collision-free speed control. Our empirical results show that our RL-based multiagent solution can achieve near-optimal performance efficiently when minimizing the travel time through an intersection.

Citations (3)

Summary

  • The paper presents a novel multiagent approach combining a polling-based coordinating agent with RL-based trajectory control to schedule vehicles dynamically.
  • The methodology leverages multi-discount Q-learning to balance short-term collision avoidance with long-term path optimization, significantly reducing computational complexity.
  • Empirical results demonstrate near-optimal performance and reduced travel times in multi-lane intersections, validating the framework's real-time feasibility and safety.

Intelligent Autonomous Intersection Management

The paper "Intelligent Autonomous Intersection Management" addresses the real-time management of connected autonomous vehicles (CAVs) through intersections, replacing traditional traffic signals. It proposes an efficient, deployable, multiagent architecture based on reinforcement learning (RL) and a novel algorithm named multi-discount Q-learning. This approach tackles the complexity of managing real-time autonomous traffic, accounting for both long-term trajectory planning and short-term dynamic adjustments necessary for collision-free operation.

Autonomous Intersection Management Challenges

Autonomous Intersection Management (AIM) aims to exploit the capabilities of CAVs to improve traffic flow through intersections better than traditional methods. Most existing AIM strategies rely on knowing vehicles' arrival times in advance, which limits their real-world applicability as real-time conditions are often stochastic. The computational complexity inherent in solving these scheduling and trajectory optimization problems has traditionally necessitated expensive linear programming (LP) approaches, which are impractical for real-time deployment.

Proposed Solution

Multiagent System Design

The proposed system consists of two key agent types:

  1. Polling-based Coordinating Agent: This agent formulates a time schedule for each vehicle entering the intersection's control region. It applies a novel polling algorithm that adapts to the complexity of multi-lane intersections with multiple turning directions by managing scheduling in a computationally efficient manner.
  2. RL-based Trajectory Control Agents: Assigned to individual vehicles, these agents determine the optimal trajectory to reach the intersection at the scheduled time while maximizing velocity. The distributed architecture allows these agents to operate independently while adhering to the overall AIM policy set by the coordinating agent.

Multi-Lane Polling System

The multi-lane polling system extends traditional single-lane approaches by representing each lane as a queue and employing queue-dependent transition functions to manage service and switch-over times. This method generalizes to real-world intersections with multiple directions and ensures non-conflicting time schedules for vehicles.

Novel Multi-Discount Q-learning

To tackle challenges posed by tasks with both short-term safety and long-term scheduling objectives, the authors introduce multi-discount Q-learning. This algorithm uses a discount vector to differentiate between short and long temporal horizons, enabling simultaneous optimization of different objectives. The adaptive discounting is managed via a reward-dependent discount function, which is crucial for preserving the temporal integrity of each sub-task in the decision-making process. Figure 1

Figure 1

Figure 1: A four legged intersection with vehicles represented with black rectangles. The control region for one road segment is shown with length L. The intersection region is shown in gray and the intersection starting point for road segments are shown in blue lines.

Empirical Evaluation

Experimental Setup

  • VE-setup: Focuses on testing individual vehicle trajectories against baselines using an RL-based agent and optimal LP models.
  • IE-setup: Tests full intersection management with stochastically arriving vehicles, simulating real-world traffic patterns.

Results

  • Trajectory Optimization: Multi-discount Q-learning achieves near-optimal trajectory performance comparable to LP solutions but with significantly lower computational time and complexity, ensuring feasibility for real-time application.
  • Intersection Management Performance: The proposed I-AIM framework demonstrates reduced average travel times compared to traditional methods, even under varying traffic conditions. Most notably, the RL-based solution incurs no scheduling deviations, underscoring its safety and reliability advantages over heuristic or LP-based approaches.

Conclusions

The integration of RL with novel algorithmic enhancements presents a feasible, real-time solution to AIM, achieving efficiency across typical urban traffic conditions. The proposed multi-discount approach shows potential applications in diverse areas where tasks comprise objectives with differing temporal scopes, notably in robotics and broader autonomous systems. Future work may extend this framework to coordinate multiple intersections concurrently and evaluate broader urban networks. Such advancements could further harness the potential of AIM to meet evolving urban mobility demands.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.