A Data Efficient Framework for Learning Local Heuristics (2404.06728v2)
Abstract: With the advent of machine learning, there have been several recent attempts to learn effective and generalizable heuristics. Local Heuristic A* (LoHA*) is one recent method that instead of learning the entire heuristic estimate, learns a "local" residual heuristic that estimates the cost to escape a region (Veerapaneni et al 2023). LoHA*, like other supervised learning methods, collects a dataset of target values by querying an oracle on many planning problems (in this case, local planning problems). This data collection process can become slow as the size of the local region increases or if the domain requires expensive collision checks. Our main insight is that when an A* search solves a start-goal planning problem it inherently ends up solving multiple local planning problems. We exploit this observation to propose an efficient data collection framework that does <1/10th the amount of work (measured by expansions) to collect the same amount of data in comparison to baselines. This idea also enables us to run LoHA* in an online manner where we can iteratively collect data and improve our model while solving relevant start-goal tasks. We demonstrate the performance of our data collection and online framework on a 4D $(x, y, \theta, v)$ navigation domain.
- Multi-Heuristic A. In Fox, D.; Kavraki, L. E.; and Kurniawati, H., eds., Robotics: Science and Systems X, University of California, Berkeley, USA, July 12-16, 2014.
- Hindsight Experience Replay. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA, 5048–5058.
- Learning Heuristic Search via Imitation. CoRR, abs/1707.03034.
- The Closed List is an Obstacle Too. In Ma, H.; and Serina, I., eds., Proceedings of the Fourteenth International Symposium on Combinatorial Search, SOCS 2021, Virtual Conference [Jinan, China], July 26-30, 2021, 121–125. AAAI Press.
- Learning heuristic functions for large state spaces. Artificial Intelligence, 175(16): 2075–2098.
- Speeding Up Search-Based Motion Planning using Expansion Delay Heuristics. Proceedings of the International Conference on Automated Planning and Scheduling, 31(1): 528–532.
- Learning Heuristic A: Efficient Graph Search using Neural Network. In 2020 IEEE International Conference on Robotics and Automation (ICRA), 9542–9547.
- Korf, R. E. 1990. Real-time heuristic search. Artificial Intelligence, 42(2): 189–211.
- Studies in Semi-Admissible Heuristics. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-4(4): 392–399.
- Learning Heuristic Functions for Mobile Robot Path Planning Using Deep Neural Networks. Proceedings of the International Conference on Automated Planning and Scheduling, 29(1): 764–772.
- Learning Local Heuristics for Search-Based Navigation Planning. In Proceedings of the Thirty-Third International Conference on Automated Planning and Scheduling, July 8-13, 2023, Prague, Czech Republic, 634–638. AAAI Press.
- Rishi Veerapaneni (19 papers)
- Jonathan Park (1 paper)
- Muhammad Suhail Saleem (11 papers)
- Maxim Likhachev (64 papers)