MASP: Scalable GNN-based Planning for Multi-Agent Navigation (2312.02522v2)

Published 5 Dec 2023 in cs.LG, cs.AI, and cs.RO

Abstract: We investigate multi-agent navigation tasks, where multiple agents need to reach initially unassigned goals in a limited time. Classical planning-based methods suffer from expensive computation overhead at each step and offer limited expressiveness for complex cooperation strategies. In contrast, reinforcement learning (RL) has recently become a popular approach for addressing this issue. However, RL struggles with low data efficiency and cooperation when directly exploring (nearly) optimal policies in a large exploration space, especially with an increased number of agents(e.g., 10+ agents) or in complex environments (e.g., 3-D simulators). In this paper, we propose the Multi-Agent Scalable Graph-based Planner (MASP), a goal-conditioned hierarchical planner for navigation tasks with a substantial number of agents in the decentralized setting. MASP employs a hierarchical framework to reduce space complexity by decomposing a large exploration space into multiple goal-conditioned subspaces, where a high-level policy assigns agents goals, and a low-level policy navigates agents toward designated goals. For agent cooperation and the adaptation to varying team sizes, we model agents and goals as graphs to better capture their relationship. The high-level policy, the Goal Matcher, leverages a graph-based Self-Encoder and Cross-Encoder to optimize goal assignment by updating the agent and the goal graphs. The low-level policy, the Coordinated Action Executor, introduces the Group Information Fusion to facilitate group division and extract agent relationships across groups, enhancing training efficiency for agent cooperation. The results demonstrate that MASP outperforms RL and planning-based baselines in task efficiency.

References (38)

Summary

The paper introduces a hierarchical framework integrating GNNs to reduce search space and accelerate training for multi-agent navigation.
It details a decentralized Multi-Goal Matcher and a Coordinated Action Executor to enhance agent cooperation and efficient goal assignment.
Empirical results demonstrate near-perfect success rates and strong generalization across diverse, dynamic simulation scenarios.

Introduction

Within the field of multi-agent systems, efficiently navigating autonomous agents toward specific goals, particularly in contexts where multiple entities operate independently, presents an intricate challenge. Classical methods, based in planning, come with their limitations when it comes to computation overhead and flexibility. Reinforcement learning (RL) alternatives offer promise in this area, providing robust representation capabilities; however, these models encounter difficulties with data efficiency and cooperation.

Hierarchical Framework and GNN

The Multi-Agent Scalable GNN-based Planner (MASP) is built around a hierarchical framework that effectively reduces the high-dimensional search space involved in navigation tasks through its division into smaller manageable regions. This structure significantly accelerates the convergence of training and boosts data efficiency. To better facilitate cooperation and goal attainment among agents, MASP integrates Graph Neural Networks (GNN), which enable a deep understanding of the inter-agent relationships and interactions with goals.

MASP is comprised of two key components:

Multi-Goal Matcher (MGM): It employs a decentralized graph matching strategy that assigns the most appropriate goals to agents at each global step.
Coordinated Action Executor (CAE): With a Graph Merger and Goal Encoder, this component captures the essential correlation between agents and their assigned goals, promoting synergistic cooperation.

Experimental Performance

Empirically, MASP demonstrates superior performance compared to existing planning-based methods and RL competitors. In environments like MPE and Omnidrones that accommodate large groups of agents, MASP achieves nearly perfect success rates with minimal steps taken. Notably, in challenging 3D simulations involving up to 20 agents, MASP displays striking generalization abilities, as it performs effectively even in scenarios composed of unseen team sizes.

Conclusion

MASP validates its efficiency in establishing cooperative strategies and its adaptability to complex and dynamic environmental conditions. It does so while also demonstrating strong generalization capabilities and impressive performance in scenarios with large numbers of agents. This makes MASP a compelling approach for decentralized multi-agent navigation tasks and opens avenues for broader applications in multi-agent systems.

PDF Markdown