Comparing performance of acoustic-to-articulatory inversion for mandarin accented english and american english speakers
This paper compares the performance of acoustic-to-articulatory inversion for both L1 and L2 speakers of English, as a function of the number of Gaussian Mixtures used in the inversion model. The inversion system is based on an HMM-GMM approach and is implemented on the Marquette Electromagnetic Art...
Saved in:
Published in | 2018 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) pp. 1 - 5 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.12.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | This paper compares the performance of acoustic-to-articulatory inversion for both L1 and L2 speakers of English, as a function of the number of Gaussian Mixtures used in the inversion model. The inversion system is based on an HMM-GMM approach and is implemented on the Marquette Electromagnetic Articulography corpus of Mandarin Accented English (EMAMAE) including 20 native English speakers and 19 native Mandarin speakers of English. Results indicate that for Mandarin speakers 12 Gaussian mixtures and for L1 American English speakers 11 Gaussian mixtures give the lowest Root-Mean-Squared error (RMSE) and highest correlation between the estimated and actual articulatory pattern. |
---|---|
DOI: | 10.1109/ISSPIT.2018.8705093 |