From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning (2403.08525v2)

Published 13 Mar 2024 in cs.SD, cs.LG, and eess.AS

Abstract: We propose an adaptive change point detection method (A-CPD) for machine guided weak label annotation of audio recording segments. The goal is to maximize the amount of information gained about the temporal activations of the target sounds. For each unlabeled audio recording, we use a prediction model to derive a probability curve used to guide annotation. The prediction model is initially pre-trained on available annotated sound event data with classes that are disjoint from the classes in the unlabeled dataset. The prediction model then gradually adapts to the annotations provided by the annotator in an active learning loop. We derive query segments to guide the weak label annotator towards strong labels, using change point detection on these probabilities. We show that it is possible to derive strong labels of high quality with a limited annotation budget, and show favorable results for A-CPD when compared to two baseline query segment strategies.

References (17)

Authors (4)

John Martinsson (7 papers)
Olof Mogren (18 papers)
Maria Sandsten (2 papers)
Tuomas Virtanen (112 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/ArxivSound/status/1768125894755766424

https://twitter.com/mlsp4audio/status/1768280750150058079

https://twitter.com/AudioAndSpeech/status/1828565768839024951

From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning (2403.08525v2)

Summary

Related Papers

Tweets