Towards a World-English Language Model for On-Device Virtual Assistants (2403.18783v1)

Published 27 Mar 2024 in cs.CL

Abstract: Neural Network LLMs (NNLMs) for Virtual Assistants (VAs) are generally language-, region-, and in some cases, device-dependent, which increases the effort to scale and maintain them. Combining NNLMs for one or more of the categories is one way to improve scalability. In this work, we combine regional variants of English to build a ``World English'' NNLM for on-device VAs. In particular, we investigate the application of adapter bottlenecks to model dialect-specific characteristics in our existing production NNLMs {and enhance the multi-dialect baselines}. We find that adapter modules are more effective in modeling dialects than specializing entire sub-networks. Based on this insight and leveraging the design of our production models, we introduce a new architecture for World English NNLM that meets the accuracy, latency, and memory constraints of our single-dialect models.

References (17)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/javaeeeee1/status/1774435396866175432

https://twitter.com/knishimae0531/status/1773322424798384580

Towards a World-English Language Model for On-Device Virtual Assistants (2403.18783v1)

Summary

Related Papers

Tweets