Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Surge Routing: Event-informed Multiagent Reinforcement Learning for Autonomous Rideshare (2307.02637v2)

Published 5 Jul 2023 in cs.AI, cs.MA, and cs.RO

Abstract: Large events such as conferences, concerts and sports games, often cause surges in demand for ride services that are not captured in average demand patterns, posing unique challenges for routing algorithms. We propose a learning framework for an autonomous fleet of taxis that leverages event data from the internet to predict demand surges and generate cooperative routing policies. We achieve this through a combination of two major components: (i) a demand prediction framework that uses textual event information in the form of events' descriptions and reviews to predict event-driven demand surges over street intersections, and (ii) a scalable multiagent reinforcement learning framework that leverages demand predictions and uses one-agent-at-a-time rollout combined with limited sampling certainty equivalence to learn intersection-level routing policies. For our experimental results we consider real NYC ride share data for the year 2022 and information for more than 2000 events across 300 unique venues in Manhattan. We test our approach with a fleet of 100 taxis on a map with 2235 street intersections. Our experimental results demonstrate that our method learns routing policies that reduce wait time overhead per serviced request by 25% to 75%, while picking up 1% to 4% more requests than other model-based RL frameworks and classical methods in operations research.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (62)
  1. Deep Reinforcement Learning for Crowdsourced Urban Delivery. Transportation Research Part B: Methodological 152 (2021), 227–257. https://doi.org/10.1016/j.trb.2021.08.015
  2. ASHRAE. 2013. ANSI/ASHRAE/IES Standard 90.1-2013: Energy Standard for Buildings Except Low-Rise Residential Buildings.
  3. Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding. arXiv:1908.05161 [cs.LG]
  4. Russell Bent and Pascal Van Hentenryck. 2004. The Value of Consensus in Online Stochastic Scheduling.. In Proceedings of the 14th International Conference on Automated Planning and Scheduling, ICAPS 2004. American Association for Artificial Intelligence, Providence, RI, 219–226.
  5. D.P. Bertsekas. 1979. A Distributed Algorithm for the Assignment Problem. Lab for Information and Decision Systems Report (05 1979).
  6. Dimitri Bertsekas. 2020a. Multiagent value iteration algorithms in dynamic programming and reinforcement learning. Results in Control and Optimization 1 (2020), 100003. https://doi.org/10.1016/j.rico.2020.100003
  7. D. Bertsekas. 2020b. Rollout, Policy Iteration, and Distributed Reinforcement Learning. Athena Scientific. https://books.google.com/books?id=Hbo-EAAAQBAJ
  8. Dimitri Bertsekas. 2021. Multiagent Reinforcement Learning: Rollout and Policy Iteration. IEEE/CAA Journal of Automatica Sinica 8, 2 (2021), 249–272. https://doi.org/10.1109/JAS.2021.1003814
  9. Dimitri Bertsekas. 2022. Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control. Athena Scientific, Nashua, NH, USA.
  10. D.P. Bertsekas and D.A. Castanon. 1998. Rollout algorithms for stochastic scheduling problems. In Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171), Vol. 2. 2143–2148 vol.2. https://doi.org/10.1109/CDC.1998.758655
  11. Online Vehicle Routing: The Edge of Optimization in Large-Scale Applications. Oper. Res. 67 (2019), 143–162.
  12. Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems. In Proceedings of the 2020 Conference on Robot Learning (Proceedings of Machine Learning Research, Vol. 155), Jens Kober, Fabio Ramos, and Claire Tomlin (Eds.). PMLR, 1814–1828. https://proceedings.mlr.press/v155/bhattacharya21a.html
  13. BERT-Based Deep Spatial-Temporal Network for Taxi Demand Prediction. IEEE Transactions on Intelligent Transportation Systems 23, 7 (2022), 9442–9454. https://doi.org/10.1109/TITS.2021.3122114
  14. Universal Sentence Encoder. arXiv:1803.11175 [cs.CL]
  15. Short-Term Prediction of Demand for Ride-Hailing Services: A Deep Learning Approach. Journal of Big Data Analytics in Transportation 3 (08 2021). https://doi.org/10.1007/s42421-021-00041-4
  16. Solving the first-mile ridesharing problem using autonomous vehicles. Computer-Aided Civil and Infrastructure Engineering 35, 1 (2020), 45–60. https://doi.org/10.1111/mice.12461 arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1111/mice.12461
  17. Understanding Shared Autonomous Vehicle Preferences: A Comparison between Shuttles, Buses, Ridesharing and Taxis. Sustainability 14, 20 (Oct 2022), 13656. https://doi.org/10.3390/su142013656
  18. NYC Taxi & Limousine Commission. 2020-2022. TLC Trip Record Data. https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
  19. COMNET. 2016. Factsheet COMNET Overview - Appendix C: Schedules. https://www.comnet.org/appendix-c-schedules
  20. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data. arXiv:1705.02364 [cs.CL]
  21. G. A. Croes. 1958. A Method for Solving Traveling-Salesman Problems. Operations Research 6, 6 (1958), 791–812. http://www.jstor.org/stable/167074
  22. Simple, direct and efficient multi-way spectral clustering. Information and Inference: A Journal of the IMA 8, 1 (06 2018), 181–203. https://doi.org/10.1093/imaiai/iay008 arXiv:https://academic.oup.com/imaiai/article-pdf/8/1/181/28053156/iay008.pdf
  23. Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 609–620.
  24. Ran Duan and Seth Pettie. 2014. Linear-Time Approximation for Maximum Weight Matching. J. ACM 61, 1, Article 1 (2014), 23 pages. https://doi.org/10.1145/2529989
  25. A SPECTRAL CLUSTERING-BASED APPROACH FOR SENTIMENT CLASSIFICATION IN MODERN STANDARD ARABIC. In Proceedings of the International Conferences Big Data Analytics, Data Mining and Computational Intelligence 2018.
  26. Hybrid multi-agent deep reinforcement learning for autonomous mobility on demand systems. In Learning for Dynamics and Control Conference. PMLR, 1284–1296.
  27. Günes Erkan and Dragomir R Radev. 2004. The university of michigan at duc 2004. In Proceedings of the Document Understanding Conferences Boston, MA. Citeseer.
  28. J. Farhan and T. Donna Chen. 2018. Impact of ridesharing on operational efficiency of shared autonomous electric vehicle fleet. Transportation Research Part C: Emerging Technologies 93 (08 2018), 310–321. https://doi.org/10.1016/j.trc.2018.04.022
  29. Graph meta-reinforcement learning for transferable autonomous mobility-on-demand. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2913–2923.
  30. Multiagent Reinforcement Learning for Autonomous Routing and Pickup Problem with Adaptation to Variable Demand. International Conference on Robotics and Automation (2023).
  31. Data-driven occupancy schedules for commercial buildings. https://doi.org/10.3929/ethz-b-000341619
  32. ERA5 hourly data on single levels from 1959 to present. https://doi.org/10.24381/cds.adbb2d47
  33. An Optimal Algorithm for On-Line Bipartite Matching. In Proceedings of the Twenty-Second Annual ACM Symposium on Theory of Computing (Baltimore, Maryland, USA) (STOC ’90). Association for Computing Machinery, New York, NY, USA, 352–358. https://doi.org/10.1145/100216.100262
  34. Prediction of urban human mobility using large-scale taxi traces and its applications. Frontiers of Computer Science in China 6 (02 2012), 111–121. https://doi.org/10.1007/s11704-011-1192-6
  35. Contextualized Spatial–Temporal Network for Taxi Origin-Destination Demand Prediction. IEEE Transactions on Intelligent Transportation Systems 20, 10 (2019), 3875–3887. https://doi.org/10.1109/TITS.2019.2915525
  36. RoBERTa: A Robustly Optimized BERT Pretraining Approach. ArXiv abs/1907.11692 (2019).
  37. Online spatio-temporal matching in stochastic and dynamic domains. Artificial Intelligence 261 (2018), 71–112. https://doi.org/10.1016/j.artint.2018.04.005
  38. Predicting taxi demand hotspots using automated Internet Search Queries. Transportation Research Part C: Emerging Technologies 102 (2019), 73–86. https://doi.org/10.1016/j.trc.2019.03.001
  39. Is Travel Demand Actually Deep? An Application in Event Areas Using Semantic Information. IEEE Transactions on Intelligent Transportation Systems 21, 2 (2020), 641–652. https://doi.org/10.1109/TITS.2019.2897341
  40. Data-Driven Robust Taxi Dispatch under Demand Uncertainties. arXiv:1603.06263 [cs.SY]
  41. Melanie Mitchell. 1998. An Introduction to Genetic Algorithms. MIT Press, Cambridge, MA, USA.
  42. Predicting Taxi–Passenger Demand Using Streaming Data. IEEE Transactions on Intelligent Transportation Systems 14, 3 (2013), 1393–1402. https://doi.org/10.1109/TITS.2013.2262376
  43. J. Muñoz Sabater. 2019. ERA5-Land hourly data from 1981 to present. https://doi.org/10.24381/cds.e2161bac
  44. Reinforcement Learning for Solving the Vehicle Routing Problem. In Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), Vol. 31. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2018/file/9fb4651c05b2ed70fba5afe0b039a550-Paper.pdf
  45. Short-term Demand Forecasting for Online Car-hailing Services Using Recurrent Neural Networks. Applied Artificial Intelligence 34 (2019), 674 – 689.
  46. Open-Meteo. 2022. Historical Weather API. https://open-meteo.com/en/docs/historical-weather-api
  47. Data driven occupancy information for energy simulation and energy use assessment in residential buildings. Energy 218 (2021), 119539. https://doi.org/10.1016/j.energy.2020.119539
  48. Deep reinforcement learning in transportation research: A review. Transportation Research Interdisciplinary Perspectives 11 (2021), 100425. https://doi.org/10.1016/j.trip.2021.100425
  49. Google Maps Platform. 2022. Nearby Search API. https://developers.google.com/maps/documentation/places/web-service/search-nearby
  50. PredictHQ. 2022. Attended events API. https://www.predicthq.com/apis/event-api
  51. Combining time-series and textual data for taxi demand prediction in event areas: A deep learning approach. Information Fusion 49 (2019), 120–129. https://doi.org/10.1016/j.inffus.2018.07.007
  52. SerAPI. 2022. Google Maps API. https://serpapi.com/google-maps-api
  53. The Riemannian Geometry of Deep Generative Models. arXiv:1711.08014 [cs.LG]
  54. David Silver and Joel Veness. 2010. Monte-Carlo Planning in Large POMDPs. In Proc. 23rd International Conf. on NeurIPS (Vancouver, British Columbia, Canada). Red Hook, NY, USA, 2164–2172.
  55. Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning. arXiv:1804.00079 [cs.CL]
  56. Offline–Online Approximate Dynamic Programming for Dynamic Vehicle Routing with Stochastic Requests. Transportation Science 53, 1 (2019), 185–202. https://doi.org/10.1287/trsc.2017.0767 arXiv:https://doi.org/10.1287/trsc.2017.0767
  57. Barry Brian Werger and Maja J Matarić. 2000. Broadcast of local eligibility for multi-target observation. In Distributed Autonomous Robotic Systems 4. Springer, 347–356.
  58. Universal Sentence Representation Learning with Conditional Masked Language Model. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Online and Punta Cana, Dominican Republic, 6216–6228. https://doi.org/10.18653/v1/2021.emnlp-main.502
  59. Local Search in Combinatorial Optimization. Princeton University Press. http://www.jstor.org/stable/j.ctv346t9c
  60. Tripartite graph clustering for dynamic sentiment analysis on social media. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. 1531–1542.
  61. Graph-based informative-sentence selection for opinion summarization. In Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 408–412.
  62. The State-of-the-Art in Twitter Sentiment Analysis: A Review and Benchmark Evaluation. ACM Trans. Manage. Inf. Syst. 9, 2, Article 5 (aug 2018), 29 pages. https://doi.org/10.1145/3185045
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Daniel Garces (7 papers)
  2. Stephanie Gil (35 papers)

Summary

We haven't generated a summary for this paper yet.