ProAgent: From Robotic Process Automation to Agentic Process Automation (2311.10751v2)
Abstract: From ancient water wheels to robotic process automation (RPA), automation technology has evolved throughout history to liberate human beings from arduous tasks. Yet, RPA struggles with tasks needing human-like intelligence, especially in elaborate design of workflow construction and dynamic decision-making in workflow execution. As LLMs have emerged human-like intelligence, this paper introduces Agentic Process Automation (APA), a groundbreaking automation paradigm using LLM-based agents for advanced automation by offloading the human labor to agents associated with construction and execution. We then instantiate ProAgent, an LLM-based agent designed to craft workflows from human instructions and make intricate decisions by coordinating specialized agents. Empirical experiments are conducted to detail its construction and execution procedure of workflow, showcasing the feasibility of APA, unveiling the possibility of a new paradigm of automation driven by agents. Our code is public at https://github.com/OpenBMB/ProAgent.
- Towards intelligent robotic process automation for bpmers. arXiv preprint arXiv:2001.00804, 2020.
- Do as i can, not as i say: Grounding language in robotic affordances. ArXiv preprint, abs/2204.01691, 2022.
- Graph of thoughts: Solving elaborate problems with large language models. arXiv preprint arXiv:2308.09687, 2023.
- Large language models as tool makers. arXiv preprint arXiv:2305.17126, 2023.
- D3ba: a tool for optimizing business processes using non-deterministic planning. In Business Process Management Workshops: BPM 2020 International Workshops, Seville, Spain, September 13–18, 2020, Revised Selected Papers 18, pp. 181–193. Springer, 2020a.
- From robotic process automation to intelligent process automation: –emerging trends–. In Business Process Management: Blockchain and Robotic Process Automation Forum: BPM 2020 Blockchain and RPA Forum, Seville, Spain, September 13–18, 2020, Proceedings 18, pp. 215–228. Springer, 2020b.
- Agentverse: Facilitating multi-agent collaboration and exploring emergent behaviors in agents. arXiv preprint arXiv:2308.10848, 2023.
- Yiru Chen. Monte carlo tree search for generating interactive data analysis interfaces. In Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, pp. 2837–2839, 2020.
- Mary Cummings. Automation bias in intelligent time critical decision support systems. In AIAA 1st intelligent systems technical conference, pp. 6313, 2004.
- On the evaluation of intelligent process automation. arXiv preprint arXiv:2001.02639, 2020.
- Automation bias: a systematic review of frequency, effect mediators, and mitigators. Journal of the American Medical Informatics Association, 19(1):121–127, 2012.
- Automatic business process structure discovery using ordered neurons lstm: a preliminary study. arXiv preprint arXiv:2001.01243, 2020.
- Reasoning with language model is planning with world model. arXiv preprint arXiv:2305.14992, 2023.
- Robotic process automation. Electronic markets, 30(1):99–106, 2020.
- Language models as zero-shot planners: Extracting actionable knowledge for embodied agents. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu, and Sivan Sabato (eds.), International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, volume 162 of Proceedings of Machine Learning Research, pp. 9118–9147. PMLR, 2022.
- Robotic process automation: systematic literature review. In Business Process Management: Blockchain and Central and Eastern Europe Forum: BPM 2019 Blockchain and CEE Forum, Vienna, Austria, September 1–6, 2019, Proceedings 17, pp. 280–295. Springer, 2019.
- Survey of hallucination in natural language generation. ACM Computing Surveys, 55(12):1–38, 2023.
- Robotic process and cognitive automation: the next phase. SB Publishing, 2018.
- Automated discovery of data transformations for robotic process automation. arXiv preprint arXiv:2001.01007, 2020.
- Interactive task and concept learning from natural language instructions and gui demonstrations. arXiv preprint arXiv:1909.00031, 2019.
- On faithfulness and factuality in abstractive summarization. arXiv preprint arXiv:2005.00661, 2020.
- Multipurpose intelligent process automation via conversational assistant. arXiv preprint arXiv:2001.02284, 2020.
- n8n. n8n.io - a powerful workflow automation tool. URL https://n8n.io/.
- Webgpt: Browser-assisted question-answering with human feedback. ArXiv preprint, abs/2112.09332, 2021.
- OpenAI. OpenAI: Introducing ChatGPT, 2022. URL https://openai.com/blog/chatgpt.
- OpenAI. Gpt-4 technical report, 2023.
- Generative agents: Interactive simulacra of human behavior. arXiv preprint arXiv:2304.03442, 2023.
- Gorilla: Large language model connected with massive apis. arXiv preprint arXiv:2305.15334, 2023.
- Communicative agents for software development. arXiv preprint arXiv:2307.07924, 2023a.
- Creator: Disentangling abstract and concrete reasonings of large language models through tool creation. arXiv preprint arXiv:2305.14318, 2023b.
- Webcpm: Interactive web search for chinese long-form question answering. arXiv preprint arXiv:2305.06849, 2023a.
- Tool learning with foundation models. arXiv preprint arXiv:2304.08354, 2023b.
- Toolllm: Facilitating large language models to master 16000+ real-world apis. arXiv preprint arXiv:2307.16789, 2023c.
- Business process automation. ARIS in practice, 2004.
- Toolformer: Language models can teach themselves to use tools. ArXiv preprint, abs/2302.04761, 2023.
- Algorithm of thoughts: Enhancing exploration of ideas in large language models. arXiv preprint arXiv:2308.10379, 2023.
- Reflexion: Language agents with verbal reinforcement learning, 2023.
- Cognitive architectures for language agents. arXiv preprint arXiv:2309.02427, 2023.
- A review of business process mining: state-of-the-art and future trends. Business Process Management Journal, 14(1):5–22, 2008.
- Process mining: from theory to practice. Business Process Management Journal, 18(3):493–512, 2012.
- unipath. The uipath business automation platform. URL https://www.uipath.com/.
- Wil Van Der Aalst. Process mining: Overview and opportunities. ACM Transactions on Management Information Systems (TMIS), 3(2):1–17, 2012.
- Voyager: An open-ended embodied agent with large language models. arXiv preprint arXiv:2305.16291, 2023a.
- A survey on large language model based autonomous agents. arXiv preprint arXiv:2308.11432, 2023b.
- Emergent abilities of large language models. arXiv preprint arXiv:2206.07682, 2022.
- Robotic process automation–a systematic literature review and assessment framework. arXiv preprint arXiv:2012.11951, 2020.
- The rise and potential of large language model based agents: A survey. arXiv preprint arXiv:2309.07864, 2023.
- Webshop: Towards scalable real-world web interaction with grounded language agents. Advances in Neural Information Processing Systems, 35:20744–20757, 2022a.
- React: Synergizing reasoning and acting in language models. ArXiv preprint, abs/2210.03629, 2022b.
- Tree of thoughts: Deliberate problem solving with large language models. arXiv preprint arXiv:2305.10601, 2023.
- Large language model as autonomous decision maker. arXiv preprint arXiv:2308.12519, 2023.
- Zapier. Zapier — automation makes you move forward. URL https://zapier.com/.
- Siren’s song in the ai ocean: A survey on hallucination in large language models. arXiv preprint arXiv:2309.01219, 2023.