Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 145 tok/s

Gemini 2.5 Pro 40 tok/s Pro

GPT-5 Medium 22 tok/s Pro

GPT-5 High 23 tok/s Pro

GPT-4o 107 tok/s Pro

Kimi K2 195 tok/s Pro

GPT OSS 120B 446 tok/s Pro

Claude Sonnet 4.5 36 tok/s Pro

2000 character limit reached

On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding (2210.05291v1)

Published 11 Oct 2022 in cs.CL, cs.SD, and eess.AS

Abstract: In this paper we examine the use of semantically-aligned speech representations for end-to-end spoken language understanding (SLU). We employ the recently-introduced SAMU-XLSR model, which is designed to generate a single embedding that captures the semantics at the utterance level, semantically aligned across different languages. This model combines the acoustic frame-level speech representation learning model (XLS-R) with the Language Agnostic BERT Sentence Embedding (LaBSE) model. We show that the use of the SAMU-XLSR model instead of the initial XLS-R model improves significantly the performance in the framework of end-to-end SLU. Finally, we present the benefits of using this model towards language portability in SLU.

Citations (9)

View on Semantic Scholar