Open Assistant Toolkit -- version 2 (2403.00586v1)
Abstract: We present the second version of the Open Assistant Toolkit (OAT-v2), an open-source task-oriented conversational system for composing generative neural models. OAT-v2 is a scalable and flexible assistant platform supporting multiple domains and modalities of user interaction. It splits processing a user utterance into modular system components, including submodules such as action code generation, multimodal content retrieval, and knowledge-augmented response generation. Developed over multiple years of the Alexa TaskBot challenge, OAT-v2 is a proven system that enables scalable and robust experimentation in experimental and real-world deployment. OAT-v2 provides open models and software for research and commercial applications to enable the future of multimodal virtual assistants across diverse applications and types of rich interaction.
- Alexa, let’s work together: Introducing the second alexa prize taskbot challenge. Alexa Prize TaskBot Challenge, 2.
- Genie: A generator of natural language semantic parsers for virtual assistant commands. In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 394–410.
- Twiz: The wizard of multimodal conversational-stimulus. In Alexa Prize TaskBot Challenge 2 Proceedings.
- Vilt: Video instructions linking for complex tasks. In Proceedings of the 2nd International Workshop on Interactive Multimedia Retrieval, pages 41–47.
- Grillbot in practice: Lessons and tradeoffs deploying large language models for adaptable conversational task assistants.
- Grillbot-v2: Generative models for multi-modal task-oriented assistance. Alexa Prize TaskBot Challenge, 2.
- Carlos Gemmell and Jeffrey Dalton. 2023. Generate, transform, answer: Question specific tool synthesis for tabular data. arXiv preprint arXiv:2303.10138.
- Grillbot: A flexible conversational agent for solving complex real-world tasks. Alexa Prize TaskBot Challenge, 1.
- Alexa, let’s work together: Introducing the first alexa prize taskbot challenge on conversational task assistance. Alexa Prize TaskBot Challenge, 1.
- OpenAI. 2022. Chatgpt: Optimizing language models for dialogue.
- Learning transferable visual models from natural language supervision. In International conference on machine learning, pages 8748–8763. PMLR.
- Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10684–10695.
- Stanford alpaca: An instruction-following llama model. https://github.com/tatsu-lab/stanford_alpaca.
- Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971.
- Pydial: A multi-domain statistical dialogue system toolkit. In Proceedings of ACL 2017, System Demonstrations, pages 73–78.
- Deeppavlov dream: platform for building generative ai assistants. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 599–607.
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.