Efficient Speaker Recognition Using Approximated Cross Entropy (ACE)
Techniques for efficient speaker recognition are presented. These techniques are based on approximating Gaussian mixture modeling (GMM) likelihood scoring using approximated cross entropy (ACE). Gaussian mixture modeling is used for representing both training and test sessions and is shown to perfor...
Saved in:
Published in | IEEE transactions on audio, speech, and language processing Vol. 15; no. 7; pp. 2033 - 2043 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
IEEE
01.09.2007
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Techniques for efficient speaker recognition are presented. These techniques are based on approximating Gaussian mixture modeling (GMM) likelihood scoring using approximated cross entropy (ACE). Gaussian mixture modeling is used for representing both training and test sessions and is shown to perform speaker recognition and retrieval extremely efficiently without any notable degradation in accuracy compared to classic GMM-based recognition. In addition, a GMM compression algorithm is presented. This algorithm decreases considerably the storage needed for speaker retrieval. |
---|---|
Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
ISSN: | 1558-7916 1558-7924 |
DOI: | 10.1109/TASL.2007.902059 |