Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge (1905.11276v1)

Published 27 May 2019 in eess.AS and cs.SD

Abstract: In this paper, we present our system developed by the team from the New Technologies for the Information Society (NTIS) research center of the University of West Bohemia in Pilsen, for the Second DIHARD Speech Diarization Challenge. The base of our system follows the currently-standard approach of segmentation, i/x-vector extraction, clustering, and resegmentation. The hyperparameters for each of the subsystems were selected according to the domain classifier trained on the development set of DIHARD II. We compared our system with results from the Kaldi diarization (with i/x-vectors) and combined these systems. At the time of writing of this abstract, our best submission achieved a DER of 23.47% and a JER of 48.99% on the evaluation set (in Track 1 using reference SAD).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Zbyněk Zajíc (2 papers)
  2. Marie Kunešová (6 papers)
  3. Marek Hrúz (4 papers)
  4. Jan Vaněk (1 paper)
Citations (8)

Summary

We haven't generated a summary for this paper yet.