Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 99 tok/s
Gemini 2.5 Pro 43 tok/s Pro
GPT-5 Medium 33 tok/s Pro
GPT-5 High 30 tok/s Pro
GPT-4o 110 tok/s Pro
Kimi K2 207 tok/s Pro
GPT OSS 120B 467 tok/s Pro
Claude Sonnet 4 36 tok/s Pro
2000 character limit reached

Remixing Music for Hearing Aids Using Ensemble of Fine-Tuned Source Separators (2401.06203v2)

Published 11 Jan 2024 in eess.AS, cs.LG, cs.SD, and eess.SP

Abstract: This paper introduces our system submission for the Cadenza ICASSP 2024 Grand Challenge, which presents the problem of remixing and enhancing music for hearing aid users. Our system placed first in the challenge, achieving the best average Hearing-Aid Audio Quality Index (HAAQI) score on the evaluation data set. We describe the system, which uses an ensemble of deep learning music source separators that are fine tuned on the challenge data. We demonstrate the effectiveness of our system through the challenge results and analyze the importance of different system aspects through ablation studies.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (8)
  1. “Overview and results of the ICASSP SP cadenza challenge: Music demixing/remixing for hearing aids,” in 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024.
  2. “The hearing-aid audio quality index (HAAQI),” IEEE/ACM transactions on audio, speech, and language processing, vol. 24, no. 2, pp. 354–365, 2015.
  3. Alexandre Défossez, “Hybrid spectrogram and waveform source separation,” arXiv preprint arXiv:2111.03600, 2021.
  4. “MUSDB18-HQ - an uncompressed version of musdb18,” Dec. 2019.
  5. “The national acoustic laboratories’(NAL) new procedure for selecting the gain and frequency response of a hearing aid,” Ear and hearing, vol. 7, no. 4, pp. 257–265, 1986.
  6. “Music source separation based on a lightweight deep learning framework (DTTNET: Dual-path TFC-TDF UNet),” 2023.
  7. “KUIELab-MDX-Net: A two-stream neural network for music demixing,” 2021.
  8. Yi Luo and Jianwei Yu, “Music source separation with band-split rnn,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 1893–1901, 2023.
Citations (2)
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-Up Questions

We haven't generated follow-up questions for this paper yet.

Authors (1)

X Twitter Logo Streamline Icon: https://streamlinehq.com

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube