Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Overview Of The 2023 Icassp Sp Clarity Challenge: Speech Enhancement For Hearing Aids (2311.14490v1)

Published 24 Nov 2023 in cs.SD and eess.AS

Abstract: This paper reports on the design and outcomes of the ICASSP SP Clarity Challenge: Speech Enhancement for Hearing Aids. The scenario was a listener attending to a target speaker in a noisy, domestic environment. There were multiple interferers and head rotation by the listener. The challenge extended the second Clarity Enhancement Challenge (CEC2) by fixing the amplification stage of the hearing aid; evaluating with a combined metric for speech intelligibility and quality; and providing two evaluation sets, one based on simulation and the other on real-room measurements. Five teams improved on the baseline system for the simulated evaluation set, but the performance on the measured evaluation set was much poorer. Investigations are on-going to determine the exact cause of the mismatch between the simulated and measured data sets. The presence of transducer noise in the measurements, lower order Ambisonics harming the ability for systems to exploit binaural cues and the differences between real and simulated room impulse responses are suggested causes

Definition Search Book Streamline Icon: https://streamlinehq.com
References (7)
  1. Fei Ge, “Brief review of recent researches in speech enhancement from filters to neural networks,” in Proc. Inter. Conf. on Computing and Data Science (CDS), 2020, pp. 260–264.
  2. “L3DAS22 challenge: Machine learning for 3D audio signal processing,” in ICASSP. IEEE, 2022.
  3. “The 2nd clarity enhancement challenge for hearing aid speech intelligibility enhancement: overview and outcomes,” in ICASSP. IEEE, 2023.
  4. “The hearing-aid speech perception index (HASPI),” Speech Communication, vol. 65, pp. 75–93, 2014.
  5. “The hearing-aid speech quality index (HASQI) version 2,” Journal of the Audio Engineering Society, vol. 62, pp. 99–117, 2014.
  6. “The National Acoustic Laboratories’(NAL) new procedure for selecting the gain and frequency response of a hearing aid,” Ear and hearing, vol. 7, no. 4, pp. 257–265, 1986.
  7. “Dataset of British English speech recordings for psychoacoustics and speech processing research: The Clarity speech corpus,” Data in Brief, vol. 41, no. 107951, Apr. 2022.
Citations (8)

Summary

We haven't generated a summary for this paper yet.