Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Speech Signal Improvement Using Causal Generative Diffusion Models (2303.08674v1)

Published 15 Mar 2023 in eess.AS and cs.SD

Abstract: In this paper, we present a causal speech signal improvement system that is designed to handle different types of distortions. The method is based on a generative diffusion model which has been shown to work well in scenarios with missing data and non-linear corruptions. To guarantee causal processing, we modify the network architecture of our previous work and replace global normalization with causal adaptive gain control. We generate diverse training data containing a broad range of distortions. This work was performed in the context of an "ICASSP Signal Processing Grand Challenge" and submitted to the non-real-time track of the "Speech Signal Improvement Challenge 2023", where it was ranked fifth.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Julius Richter (20 papers)
  2. Simon Welker (22 papers)
  3. Jean-Marie Lemercier (19 papers)
  4. Bunlong Lay (9 papers)
  5. Tal Peer (11 papers)
  6. Timo Gerkmann (70 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.