View a PDF of the paper titled MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale, by Anton Andreychuk and 3 other authors
Abstract:Multi-agent pathfinding (MAPF) is a challenging computational problem that typically requires to find collision-free paths for multiple agents in a shared environment. Solving MAPF optimally is NP-hard, yet efficient solutions are critical for numerous applications, including automated warehouses and transportation systems. Recently, learning-based approaches to MAPF have gained attention, particularly those leveraging deep reinforcement learning. Following current trends in machine learning, we have created a foundation model for the MAPF problems called MAPF-GPT. Using imitation learning, we have trained a policy on a set of pre-collected sub-optimal expert trajectories that can generate actions in conditions of partial observability without additional heuristics, reward functions, or communication with other agents. The resulting MAPF-GPT model demonstrates zero-shot learning abilities when solving the MAPF problem instances that were not present in the training dataset. We show that MAPF-GPT notably outperforms the current best-performing learnable-MAPF solvers on a diverse range of problem instances and is efficient in terms of computation (in the inference mode).
Submission history
From: Alexey Skrynnik [view email]
[v1]
Thu, 29 Aug 2024 12:55:10 UTC (859 KB)
[v2]
Thu, 12 Sep 2024 13:49:00 UTC (1,037 KB)
[v3]
Wed, 25 Sep 2024 13:09:35 UTC (1,037 KB)
Source link
lol