Speaker weight estimation from speech signals using a fusion of the i-vector and NFA frameworks

In this paper, a novel approach for automatic speaker weight estimation from spontaneous telephone speech signals is proposed. In this method, each utterance is modeled using the i-vector framework which is based on the factor analysis on Gaussian Mixture Model (GMM) mean supervectors, and the Non-n...

Full description

Saved in:

Bibliographic Details
Published in	2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP) pp. 118 - 123
Main Authors	Poorjam, Amir Hossein, Bahari, Mohamad Hasan, Van Hamme, Hugo
Format	Conference Proceeding
Language	English
Published	IEEE 01.03.2015
Subjects	Correlation Estimation i-vector Kernel Least-Squares Support Vector Regression Non-negative Factor Analysis Speaker Weight Estimation Speech Support vector machines Testing Training
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In this paper, a novel approach for automatic speaker weight estimation from spontaneous telephone speech signals is proposed. In this method, each utterance is modeled using the i-vector framework which is based on the factor analysis on Gaussian Mixture Model (GMM) mean supervectors, and the Non-negative Factor Analysis (NFA) framework which is based on a constrained factor analysis on GMM weights. Then, the available information in both Gaussian means and Gaussian weights is exploited through a feature-level fusion of the i-vectors and the NFA vectors. Finally, a least-squares support vector regression (LS-SVR) is employed to estimate the weight of speakers from given utterances. The proposed approach is evaluated on the telephone speech signals of National Institute of Standards and Technology (NIST) 2008 and 2010 Speaker Recognition Evaluation (SRE) corpora. Experimental results over 2339 utterances show that the correlation coefficients between actual and estimated weights of male and female speakers are 0.56 and 0.49, respectively, which indicate the effectiveness of the proposed method in speaker weight estimation.
DOI:	10.1109/AISP.2015.7123494