FlowMind: Automatic Workflow Generation with LLMs (2404.13050v1)
Abstract: The rapidly evolving field of Robotic Process Automation (RPA) has made significant strides in automating repetitive processes, yet its effectiveness diminishes in scenarios requiring spontaneous or unpredictable tasks demanded by users. This paper introduces a novel approach, FlowMind, leveraging the capabilities of LLMs such as Generative Pretrained Transformer (GPT), to address this limitation and create an automatic workflow generation system. In FlowMind, we propose a generic prompt recipe for a lecture that helps ground LLM reasoning with reliable Application Programming Interfaces (APIs). With this, FlowMind not only mitigates the common issue of hallucinations in LLMs, but also eliminates direct interaction between LLMs and proprietary data or code, thus ensuring the integrity and confidentiality of information - a cornerstone in financial services. FlowMind further simplifies user interaction by presenting high-level descriptions of auto-generated workflows, enabling users to inspect and provide feedback effectively. We also introduce NCEN-QA, a new dataset in finance for benchmarking question-answering tasks from N-CEN reports on funds. We used NCEN-QA to evaluate the performance of workflows generated by FlowMind against baseline and ablation variants of FlowMind. We demonstrate the success of FlowMind, the importance of each component in the proposed lecture recipe, and the effectiveness of user interaction and feedback in FlowMind.
- 1984. Edgar. https://www.sec.gov/edgar.
- 2022. LangChain. https://github.com/langchain-ai/langchain.
- 2022. OpenAI API. https://platform.openai.com/docs/guides/embeddings.
- 2023. AutoGPT. https://github.com/Significant-Gravitas/Auto-GPT.
- 2023. Transformer Agent. https://huggingface.co/docs/transformers/main_classes/agent.
- Do as i can, not as i say: Grounding language in robotic affordances. arXiv preprint arXiv:2204.01691 (2022).
- Max Bachmann. 2021. maxbachmann/RapidFuzz: Release 1.8.0. https://doi.org/10.5281/zenodo.5584996
- PAL: Program-aided Language Models. ArXiv abs/2211.10435 (2022).
- A Consolidated Framework for Implementing Robotic Process Automation Projects. In Business Process Management, Dirk Fahland, Chiara Ghidini, Jörg Becker, and Marlon Dumas (Eds.). Springer International Publishing, Cham, 471–488.
- Robotic process automation. Electronic Markets 30 (2020), 99–106. https://doi.org/10.1007/s12525-019-00365-8
- Communication Robot for Elderly Based on Robotic Process Automation. In 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC), Vol. 2. 251–256. https://doi.org/10.1109/COMPSAC.2019.10215
- Code as policies: Language model programs for embodied control. In 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 9493–9500.
- Is your code generated by chatgpt really correct? rigorous evaluation of large language models for code generation. arXiv preprint arXiv:2305.01210 (2023).
- FinGPT: Democratizing Internet-scale Data for Financial Large Language Models. arXiv preprint arXiv:2307.10485 (2023).
- WebGPT: Browser-assisted question-answering with human feedback. arXiv:2112.09332 [cs.CL]
- Codegen: An open large language model for code with multi-turn program synthesis. arXiv preprint arXiv:2203.13474 (2022).
- Visconde: Multi-document QA with GPT-3 and Neural Reranking. In European Conference on Information Retrieval. Springer, 534–543.
- Synchromesh: Reliable code generation from pre-trained language models. arXiv preprint arXiv:2201.11227 (2022).
- Improving language understanding by generative pre-training. (2018).
- Language Models are Unsupervised Multitask Learners. (2019).
- In-context retrieval-augmented language models. arXiv preprint arXiv:2302.00083 (2023).
- Robotic Process Automation - Creating Value by Digitalizing Work in the Private Healthcare?. In Proceedings of the 22nd International Academic Mindtrek Conference (Tampere, Finland) (Mindtrek ’18). Association for Computing Machinery, New York, NY, USA, 222–227. https://doi.org/10.1145/3275116.3275129
- Learning to retrieve prompts for in-context learning. arXiv preprint arXiv:2112.08633 (2021).
- Toolformer: Language Models Can Teach Themselves to Use Tools.
- Modular Visual Question Answering via Code Generation. arXiv:2306.05392 [cs.CL]
- Robotic Process Automation: Contemporary themes and challenges. Computers in Industry 115 (2020), 103162. https://doi.org/10.1016/j.compind.2019.103162
- Expectation vs. experience: Evaluating the usability of code generation tools powered by large language models. In Chi conference on human factors in computing systems extended abstracts. 1–7.
- Robotic Process Automation. Business & Information Systems Engineering 60 (2018), 269–272. https://doi.org/10.1007/s12599-018-0542-4
- Chatgpt for robotics: Design principles and model abilities. Microsoft Auton. Syst. Robot. Res 2 (2023), 20.
- Alice Saldanha Villar and Nawaz Khan. 2021. Robotic process automation in banking industry: a case study on Deutsche Bank. Journal of Banking and Financial Technology 5, 1 (2021), 71–86.
- BloombergGPT: A Large Language Model for Finance. arXiv:2303.17564 [cs.LG]
- FinGPT: Open-Source Financial Large Language Models. arXiv preprint arXiv:2306.06031 (2023).
- Instruct-FinGPT: Financial Sentiment Analysis by Instruction Tuning of General-Purpose Large Language Models. arXiv preprint arXiv:2306.12659 (2023).