You are currently viewing a beta version of our website. If you spot anything unusual, kindly let us know.

Preprint
Article

Comparative Analysis of Audio Features for Unsupervised Speaker Change Detection

Altmetrics

Downloads

12

Views

12

Comments

0

Submitted:

14 October 2024

Posted:

16 October 2024

You are already at the latest version

Alerts
Abstract
This study analyzes how various audio features influence speaker change detection in different unsupervised approaches. Ten different audio features, including MFCC, mel-spectrogram, spectral centroid, and pitch, etc. were chosen for analysis. Two distinct unsupervised methods were selected for comparing the effectiveness of these feature: the Bayesian information criterion with Gaussian mixture model (BIC-GMM), a model-based approach, and the Kullback-Leibler divergence with Gaussian (KL-GMM), a metric-based approach. In order to gain a prior insight into which features have a significant impact on the SCD problem, we calculate statistic for two probabilities: the probability of speaker change given there is a feature change, and vice versa. To experimentally check the prior insights, a set of experiments carried out to analyze the influence of various audio feature for SCD. The results showed that in both methods, MFCC demonstrates a robust performance, and ranks at the top. Zero crossing rate, chroma and spectral contrast gives a higher performance in BIC-GMM. Mel-spectrogram consistently ranks at the bottom, suggesting it has the least impact on performance across both methods.
Keywords: 
Subject: Computer Science and Mathematics  -   Artificial Intelligence and Machine Learning
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated