Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Word-Free Spoken Language Understanding for Mandarin-Chinese (2107.00186v1)

Published 1 Jul 2021 in cs.CL, cs.SD, and eess.AS

Abstract: Spoken dialogue systems such as Siri and Alexa provide great convenience to people's everyday life. However, current spoken language understanding (SLU) pipelines largely depend on automatic speech recognition (ASR) modules, which require a large amount of language-specific training data. In this paper, we propose a Transformer-based SLU system that works directly on phones. This acoustic-based SLU system consists of only two blocks and does not require the presence of ASR module. The first block is a universal phone recognition system, and the second block is a Transformer-based LLM for phones. We verify the effectiveness of the system on an intent classification dataset in Mandarin Chinese.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Zhiyuan Guo (13 papers)
  2. Yuexin Li (8 papers)
  3. Guo Chen (107 papers)
  4. Xingyu Chen (98 papers)
  5. Akshat Gupta (41 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.