A study on the consistency analysis of energy parameter for Mandarin speech

In this study, a consistency analysis of energy parameter for Mandarin speech is presented. Identified as a result of inspection of the human pronunciation process, the consistency can be interpreted as a high correlation of a warping curve between the spectrum and the prosody intra a syllable. Thro...

Full description

Saved in:
Bibliographic Details
Published inEURASIP journal on audio, speech, and music processing Vol. 2012; no. 1; pp. 1 - 9
Main Authors Shen, Li-Te, Yeh, Cheng-Yu, Hwang, Shaw-Hwa
Format Journal Article
LanguageEnglish
Published Cham Springer International Publishing 17.12.2012
Springer Nature B.V
BioMed Central Ltd
Subjects
Online AccessGet full text

Cover

Loading…
Abstract In this study, a consistency analysis of energy parameter for Mandarin speech is presented. Identified as a result of inspection of the human pronunciation process, the consistency can be interpreted as a high correlation of a warping curve between the spectrum and the prosody intra a syllable. Through three steps in the procedure of the consistency analysis, the hidden Markov model (HMM) algorithm is used first to decode HMM-state sequences within a syllable at the same time as to divide them into three segments. Second, based on a designated syllable, the vector quantization (VQ) with the Linde–Buzo–Gray algorithm is used to train the VQ codebooks of each segment. Third, the energy vector of each segment is encoded as an index by VQ codebooks, and then the probability of each possible path is evaluated as a prerequisite to analyze the consistency. It is demonstrated experimentally that a consistency is definitely acquired in case the syllable is located exactly in the same word. These results offer a research direction that the energy warping process intra a syllable must be considered in a text-to-speech system to improve the synthesized speech quality.
AbstractList In this study, a consistency analysis of energy parameter for Mandarin speech is presented. Identified as a result of inspection of the human pronunciation process, the consistency can be interpreted as a high correlation of a warping curve between the spectrum and the prosody intra a syllable. Through three steps in the procedure of the consistency analysis, the hidden Markov model (HMM) algorithm is used first to decode HMM-state sequences within a syllable at the same time as to divide them into three segments. Second, based on a designated syllable, the vector quantization (VQ) with the Linde–Buzo–Gray algorithm is used to train the VQ codebooks of each segment. Third, the energy vector of each segment is encoded as an index by VQ codebooks, and then the probability of each possible path is evaluated as a prerequisite to analyze the consistency. It is demonstrated experimentally that a consistency is definitely acquired in case the syllable is located exactly in the same word. These results offer a research direction that the energy warping process intra a syllable must be considered in a text-to-speech system to improve the synthesized speech quality.
In this study, a consistency analysis of energy parameter for Mandarin speech is presented. Identified as a result of inspection of the human pronunciation process, the consistency can be interpreted as a high correlation of a warping curve between the spectrum and the prosody intra a syllable. Through three steps in the procedure of the consistency analysis, the hidden Markov model (HMM) algorithm is used first to decode HMM-state sequences within a syllable at the same time as to divide them into three segments. Second, based on a designated syllable, the vector quantization (VQ) with the Lindeâ[euro]"Buzoâ[euro]"Gray algorithm is used to train the VQ codebooks of each segment. Third, the energy vector of each segment is encoded as an index by VQ codebooks, and then the probability of each possible path is evaluated as a prerequisite to analyze the consistency. It is demonstrated experimentally that a consistency is definitely acquired in case the syllable is located exactly in the same word. These results offer a research direction that the energy warping process intra a syllable must be considered in a text-to-speech system to improve the synthesized speech quality.[PUBLICATION ABSTRACT]
Abstract In this study, a consistency analysis of energy parameter for Mandarin speech is presented. Identified as a result of inspection of the human pronunciation process, the consistency can be interpreted as a high correlation of a warping curve between the spectrum and the prosody intra a syllable. Through three steps in the procedure of the consistency analysis, the hidden Markov model (HMM) algorithm is used first to decode HMM-state sequences within a syllable at the same time as to divide them into three segments. Second, based on a designated syllable, the vector quantization (VQ) with the Linde–Buzo–Gray algorithm is used to train the VQ codebooks of each segment. Third, the energy vector of each segment is encoded as an index by VQ codebooks, and then the probability of each possible path is evaluated as a prerequisite to analyze the consistency. It is demonstrated experimentally that a consistency is definitely acquired in case the syllable is located exactly in the same word. These results offer a research direction that the energy warping process intra a syllable must be considered in a text-to-speech system to improve the synthesized speech quality.
ArticleNumber 28
Author Shen, Li-Te
Hwang, Shaw-Hwa
Yeh, Cheng-Yu
Author_xml – sequence: 1
  givenname: Li-Te
  surname: Shen
  fullname: Shen, Li-Te
  organization: Department of Electrical Engineering, National Taipei University of Technology
– sequence: 2
  givenname: Cheng-Yu
  surname: Yeh
  fullname: Yeh, Cheng-Yu
  email: cy.yeh@ncut.edu.tw
  organization: Department of Electrical Engineering, National Chin-Yi University of Technology
– sequence: 3
  givenname: Shaw-Hwa
  surname: Hwang
  fullname: Hwang, Shaw-Hwa
  organization: Department of Electrical Engineering, National Taipei University of Technology
BookMark eNp1kctOwzAQRS1UJFrgA9hZYsMm4FfcZFmVpyhiA2vLccaQKrGLnS7y9zgqQgXBxuMZn7m6up6hifMOEDqj5JLSQl5RWcwzMWcsY4SmozhA0-_ZZO9-hGYxrgnJeS7YFD0ucOy39YC9w_07YONdbGIPzgxYO90OqcPeYnAQ3ga80UF30EPA1gf8pF2tQ-Nw3ACY9xN0aHUb4fSrHqPX25uX5X22er57WC5WWSVy3meFJbySjBIrwTIjJBelqKTWtSWltVIyXlFTkiLnusolFbIkWuZMCGbqus75Mbre6VaN76A24PqgW7UJTafDoLxu1M8X4zs1BqDGANQYkGJFkrnYyWyC_9hC7FXXRANtqx34bVRU8HKefNIRPf-Frv02pHgSxRNXCEnKRNEdZYKPMYD9tkSJGn_pTxNstxMT694g7Cn_u_QJ4bCU_Q
CitedBy_id crossref_primary_10_1016_j_neucom_2015_12_012
Cites_doi 10.1109/TCE.2010.5606343
10.1109/TASL.2009.2035209
10.1121/1.395275
10.1109/5.18626
10.1109/TSA.2002.803437
10.1109/TCE.2009.5174430
10.1016/S0167-6393(00)00075-3
10.1016/j.specom.2009.04.004
10.1109/2.56867
10.1109/29.31286
10.1109/LES.2010.2052019
10.1109/TCOM.1980.1094577
10.1016/0167-6393(90)90021-Z
10.1049/ip-vis:20045095
10.1109/89.668817
10.1155/2009/169819
ContentType Journal Article
Copyright Shen et al.; licensee Springer. 2012. This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
The Author(s) 2012
Copyright_xml – notice: Shen et al.; licensee Springer. 2012. This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
– notice: The Author(s) 2012
DBID C6C
AAYXX
CITATION
8FE
8FG
ABUWG
AFKRA
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
HCIFZ
P5Z
P62
PIMPY
PQEST
PQQKQ
PQUKI
PRINS
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1186/1687-4722-2012-28
DatabaseName Springer_OA刊
CrossRef
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
AUTh Library subscriptions: ProQuest Central
Technology Collection
ProQuest One Community College
ProQuest Central
SciTech Premium Collection (Proquest) (PQ_SDU_P3)
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Publicly Available Content Database
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Publicly Available Content Database
Advanced Technologies & Aerospace Collection
Technology Collection
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest One Academic Eastern Edition
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Central China
ProQuest Central
Advanced Technologies & Aerospace Database
ProQuest One Academic UKI Edition
ProQuest Central Korea
ProQuest One Academic
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Publicly Available Content Database
CrossRef

Technology Research Database
Database_xml – sequence: 1
  dbid: C6C
  name: SpringerOpen
  url: http://www.springeropen.com/
  sourceTypes: Publisher
– sequence: 2
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 1687-4722
EndPage 9
ExternalDocumentID oai_biomedcentral_com_1687_4722_2012_28
2906821321
10_1186_1687_4722_2012_28
Genre Feature
GroupedDBID -A0
.4S
.DC
0R~
29G
2WC
4.4
40G
5GY
5VS
6OB
8FE
8FG
8R4
8R5
AAFWJ
AAJSJ
AAKKN
AAPBV
AAYZJ
ACACY
ACGFO
ACGFS
ADBBV
ADINQ
AENEX
AFGXO
AFKRA
AFNRJ
AFPKN
AHBXF
AHBYD
AHSBF
AIAGR
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
ARAPS
ARCSS
BAPOH
BCNDV
BENPR
BGLVJ
C24
C6C
CCPQU
CS3
E3Z
EBS
EDO
EJD
GROUPED_DOAJ
GX1
HCIFZ
HZ~
I-F
IL9
KQ8
M~E
O9-
OK1
P2P
P62
PIMPY
PROAC
Q2X
RHU
RNS
RSV
SEG
SOJ
TUS
U2A
AAYXX
ABEEZ
ACULB
CITATION
EBLON
2VQ
ABUWG
AZQEC
DWQXO
PQEST
PQQKQ
PQUKI
PRINS
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-b453t-8f03b6210f6ef2c463494b6aadf09ff6623b1c90853ab5614690a652442cddd53
IEDL.DBID BENPR
ISSN 1687-4722
1687-4714
IngestDate Wed May 22 07:14:40 EDT 2024
Fri Aug 16 02:26:51 EDT 2024
Thu Oct 10 20:14:17 EDT 2024
Fri Aug 23 03:28:18 EDT 2024
Sat Dec 16 11:59:38 EST 2023
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Hidden Markov Model (HMM)
Vector quantization (VQ)
Speech synthesis
Text-to-speech (TTS)
Consistency analysis
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-b453t-8f03b6210f6ef2c463494b6aadf09ff6623b1c90853ab5614690a652442cddd53
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
OpenAccessLink https://www.proquest.com/docview/1314384609?pq-origsite=%requestingapplication%
PQID 1314384609
PQPubID 237298
PageCount 9
ParticipantIDs biomedcentral_primary_oai_biomedcentral_com_1687_4722_2012_28
proquest_miscellaneous_1439762118
proquest_journals_1314384609
crossref_primary_10_1186_1687_4722_2012_28
springer_journals_10_1186_1687_4722_2012_28
PublicationCentury 2000
PublicationDate 2012-12-17
PublicationDateYYYYMMDD 2012-12-17
PublicationDate_xml – month: 12
  year: 2012
  text: 2012-12-17
  day: 17
PublicationDecade 2010
PublicationPlace Cham
PublicationPlace_xml – name: Cham
– name: New York
PublicationTitle EURASIP journal on audio, speech, and music processing
PublicationTitleAbbrev J AUDIO SPEECH MUSIC PROC
PublicationYear 2012
Publisher Springer International Publishing
Springer Nature B.V
BioMed Central Ltd
Publisher_xml – name: Springer International Publishing
– name: Springer Nature B.V
– name: BioMed Central Ltd
References Chen, Hwang, Wang (CR16) 1998; 6
Rabiner (CR20) 1989; 77
Yeh, Hwang (CR18) 2005; 152
Chalamandaris, Karabetsos, Tsiakoulis, Raptis (CR10) 2010; 56
Karabetsos, Tsiakoulis, Chalamandaris, Raptis (CR7) 2009; 55
Simsekli, Jylha, Erkut, Cemgil (CR21) 2011
Mattheyses, Latacz, Verhelst (CR5) 2009
Hwang, Chen, Wang (CR4) 1996
Zhu, Zhao, Xu, Niimi (CR15) 2002
Yue (CR9) 2010
Bellegarda, Dynamic (CR13) 2010; 18
Spelta, Manzoni, Corti, Goggi, Savaresi (CR8) 2010; 2
Mattheyses, Latacz, Verhelst (CR6) 2009
O’Malley (CR3) 1990; 23
Linde, Buzo, Gray (CR24) 1980; 28
Chou, Tseng, Lee (CR12) 2002; 10
Klatt (CR1) 1987; 82
Winters-Hilt, Jiang, Baribault (CR22) 2010
Huang, Acero, Hon (CR19) 2001
Zen, Tokuda, Black (CR23) 2009; 51
Wu, Chen (CR11) 2001; 35
Lee, Tseng, Ming (CR2) 1989; 37
Moulines, Charpentier (CR14) 1990; 9
Ying, Shi (CR17) 2001
S Karabetsos (65_CR7) 2009; 55
CH Wu (65_CR11) 2001; 35
FC Chou (65_CR12) 2002; 10
SH Hwang (65_CR4) 1996
H Zen (65_CR23) 2009; 51
JR Bellegarda (65_CR13) 2010; 18
Y Linde (65_CR24) 1980; 28
SH Chen (65_CR16) 1998; 6
E Moulines (65_CR14) 1990; 9
C Spelta (65_CR8) 2010; 2
DJ Yue (65_CR9) 2010
MH O’Malley (65_CR3) 1990; 23
XD Huang (65_CR19) 2001
CY Yeh (65_CR18) 2005; 152
Z Ying (65_CR17) 2001
LR Rabiner (65_CR20) 1989; 77
W Mattheyses (65_CR6) 2009
DH Klatt (65_CR1) 1987; 82
LS Lee (65_CR2) 1989; 37
S Winters-Hilt (65_CR22) 2010
U Simsekli (65_CR21) 2011
A Chalamandaris (65_CR10) 2010; 56
Y Zhu (65_CR15) 2002
W Mattheyses (65_CR5) 2009
References_xml – volume: 56
  start-page: 1890
  year: 2010
  end-page: 1897
  ident: CR10
  article-title: A unit selection text-to-speech synthesis system optimized for use with screen readers
  publication-title: IEEE Trans. Consum. Electron
  doi: 10.1109/TCE.2010.5606343
  contributor:
    fullname: Raptis
– start-page: 1652
  year: 2010
  end-page: 1656
  ident: CR9
  article-title: Two stage concatenation speech synthesis for embedded devices
  publication-title: Proceedings of the ICALIP
  contributor:
    fullname: Yue
– volume: 18
  start-page: 1455
  year: 2010
  end-page: 1463
  ident: CR13
  article-title: Cost weighting framework for unit selection text-to-speech synthesis
  publication-title: IEEE Trans. Audio Speech Lang. Process
  doi: 10.1109/TASL.2009.2035209
  contributor:
    fullname: Dynamic
– volume: 82
  start-page: 737
  year: 1987
  end-page: 793
  ident: CR1
  article-title: Review of text-to-speech conversion for English
  publication-title: J. Acoust. Soc. Am
  doi: 10.1121/1.395275
  contributor:
    fullname: Klatt
– start-page: 204
  year: 2002
  end-page: 207
  ident: CR15
  article-title: A Chinese text-to-speech system based on TD-PSOLA
  publication-title: Proceedings of the TENCON
  contributor:
    fullname: Niimi
– volume: 77
  start-page: 257
  year: 1989
  end-page: 286
  ident: CR20
  article-title: A tutorial on hidden Markov models and selected applications in speech recognition
  publication-title: Proc. IEEE
  doi: 10.1109/5.18626
  contributor:
    fullname: Rabiner
– year: 2010
  ident: CR22
  article-title: Hidden Markov model with duration side information for novel HMMD derivation, with application to eukaryotic gene finding
  publication-title: EURASIP J. Adv. Signal Process
  contributor:
    fullname: Baribault
– volume: 10
  start-page: 481
  year: 2002
  end-page: 494
  ident: CR12
  article-title: A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese
  publication-title: IEEE Trans. Speech Audio Process
  doi: 10.1109/TSA.2002.803437
  contributor:
    fullname: Lee
– volume: 55
  start-page: 613
  year: 2009
  end-page: 621
  ident: CR7
  article-title: Embedded unit selection text-to-speech synthesis for mobile devices
  publication-title: IEEE Trans. Consum. Electron
  doi: 10.1109/TCE.2009.5174430
  contributor:
    fullname: Raptis
– volume: 35
  start-page: 219
  year: 2001
  end-page: 237
  ident: CR11
  article-title: Automatic generation of synthesis units and prosodic information for Chinese concatenative synthesis
  publication-title: Speech Commun
  doi: 10.1016/S0167-6393(00)00075-3
  contributor:
    fullname: Chen
– volume: 51
  start-page: 1039
  year: 2009
  end-page: 1064
  ident: CR23
  article-title: Statistical parametric speech synthesis
  publication-title: Speech Commun
  doi: 10.1016/j.specom.2009.04.004
  contributor:
    fullname: Black
– volume: 23
  start-page: 17
  year: 1990
  end-page: 23
  ident: CR3
  article-title: Text-to-speech conversion technology
  publication-title: Computer
  doi: 10.1109/2.56867
  contributor:
    fullname: O’Malley
– start-page: 809
  year: 2001
  end-page: 812
  ident: CR17
  article-title: An RNN-based algorithm to detect prosodic phrase for Chinese TTS
  publication-title: Proceedings of the ICASSP
  contributor:
    fullname: Shi
– volume: 37
  start-page: 1309
  year: 1989
  end-page: 1320
  ident: CR2
  article-title: The synthesis rules in a Chinese text-to-speech system
  publication-title: IEEE T. Acoust. Speech
  doi: 10.1109/29.31286
  contributor:
    fullname: Ming
– volume: 2
  start-page: 39
  year: 2010
  end-page: 42
  ident: CR8
  article-title: Smartphone-based vehicle-to-driver/environment interaction system for motorcycles
  publication-title: IEEE Embed. Syst. Lett
  doi: 10.1109/LES.2010.2052019
  contributor:
    fullname: Savaresi
– volume: 28
  start-page: 84
  year: 1980
  end-page: 95
  ident: CR24
  article-title: An algorithm for vector quantizer design
  publication-title: IEEE Trans. Commun
  doi: 10.1109/TCOM.1980.1094577
  contributor:
    fullname: Gray
– start-page: 1421
  year: 1996
  end-page: 1424
  ident: CR4
  article-title: A Mandarin text-to-speech system
  publication-title: Proceedings of the ICSLP
  contributor:
    fullname: Wang
– volume: 9
  start-page: 453
  year: 1990
  end-page: 467
  ident: CR14
  article-title: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
  publication-title: Speech Commun
  doi: 10.1016/0167-6393(90)90021-Z
  contributor:
    fullname: Charpentier
– year: 2009
  ident: CR5
  article-title: On the importance of audiovisual coherence for the perceived quality of synthesized visual speech
  publication-title: EJASMP
  contributor:
    fullname: Verhelst
– year: 2009
  ident: CR6
  publication-title: On the importance of audiovisual coherence for the perceived quality of synthesized visual speech
  contributor:
    fullname: Verhelst
– year: 2011
  ident: CR21
  article-title: Real-time recognition of percussive sounds by a model-based method
  publication-title: EURASIP J. Adv. Signal Process
  contributor:
    fullname: Cemgil
– volume: 152
  start-page: 793
  year: 2005
  end-page: 793
  ident: CR18
  article-title: Efficient text analyzer with prosody generator-driven approach for Mandarin text-to-speech
  publication-title: IEE Proc. Vis. Image Signal Process
  doi: 10.1049/ip-vis:20045095
  contributor:
    fullname: Hwang
– start-page: 377
  year: 2001
  end-page: 413
  ident: CR19
  publication-title: Hidden Markov models, in Spoken Language Processing
  contributor:
    fullname: Hon
– volume: 6
  start-page: 226
  year: 1998
  end-page: 239
  ident: CR16
  article-title: An RNN-based prosodic information synthesizer for Mandarin text-to-speech
  publication-title: IEEE Trans. Speech Audio Process
  doi: 10.1109/89.668817
  contributor:
    fullname: Wang
– start-page: 1652
  volume-title: Proceedings of the ICALIP
  year: 2010
  ident: 65_CR9
  contributor:
    fullname: DJ Yue
– start-page: 1421
  volume-title: Proceedings of the ICSLP
  year: 1996
  ident: 65_CR4
  contributor:
    fullname: SH Hwang
– volume: 55
  start-page: 613
  year: 2009
  ident: 65_CR7
  publication-title: IEEE Trans. Consum. Electron
  doi: 10.1109/TCE.2009.5174430
  contributor:
    fullname: S Karabetsos
– volume: 18
  start-page: 1455
  year: 2010
  ident: 65_CR13
  publication-title: IEEE Trans. Audio Speech Lang. Process
  doi: 10.1109/TASL.2009.2035209
  contributor:
    fullname: JR Bellegarda
– volume-title: On the importance of audiovisual coherence for the perceived quality of synthesized visual speech
  year: 2009
  ident: 65_CR6
  doi: 10.1155/2009/169819
  contributor:
    fullname: W Mattheyses
– volume: 9
  start-page: 453
  year: 1990
  ident: 65_CR14
  publication-title: Speech Commun
  doi: 10.1016/0167-6393(90)90021-Z
  contributor:
    fullname: E Moulines
– volume-title: EURASIP J. Adv. Signal Process
  year: 2010
  ident: 65_CR22
  contributor:
    fullname: S Winters-Hilt
– volume: 23
  start-page: 17
  year: 1990
  ident: 65_CR3
  publication-title: Computer
  doi: 10.1109/2.56867
  contributor:
    fullname: MH O’Malley
– volume: 2
  start-page: 39
  year: 2010
  ident: 65_CR8
  publication-title: IEEE Embed. Syst. Lett
  doi: 10.1109/LES.2010.2052019
  contributor:
    fullname: C Spelta
– volume: 37
  start-page: 1309
  year: 1989
  ident: 65_CR2
  publication-title: IEEE T. Acoust. Speech
  doi: 10.1109/29.31286
  contributor:
    fullname: LS Lee
– year: 2009
  ident: 65_CR5
  publication-title: EJASMP
  doi: 10.1155/2009/169819
  contributor:
    fullname: W Mattheyses
– volume: 56
  start-page: 1890
  year: 2010
  ident: 65_CR10
  publication-title: IEEE Trans. Consum. Electron
  doi: 10.1109/TCE.2010.5606343
  contributor:
    fullname: A Chalamandaris
– volume: 152
  start-page: 793
  year: 2005
  ident: 65_CR18
  publication-title: IEE Proc. Vis. Image Signal Process
  doi: 10.1049/ip-vis:20045095
  contributor:
    fullname: CY Yeh
– volume: 35
  start-page: 219
  year: 2001
  ident: 65_CR11
  publication-title: Speech Commun
  doi: 10.1016/S0167-6393(00)00075-3
  contributor:
    fullname: CH Wu
– volume: 51
  start-page: 1039
  year: 2009
  ident: 65_CR23
  publication-title: Speech Commun
  doi: 10.1016/j.specom.2009.04.004
  contributor:
    fullname: H Zen
– start-page: 377
  volume-title: Hidden Markov models, in Spoken Language Processing
  year: 2001
  ident: 65_CR19
  contributor:
    fullname: XD Huang
– volume: 77
  start-page: 257
  year: 1989
  ident: 65_CR20
  publication-title: Proc. IEEE
  doi: 10.1109/5.18626
  contributor:
    fullname: LR Rabiner
– volume-title: EURASIP J. Adv. Signal Process
  year: 2011
  ident: 65_CR21
  contributor:
    fullname: U Simsekli
– volume: 10
  start-page: 481
  year: 2002
  ident: 65_CR12
  publication-title: IEEE Trans. Speech Audio Process
  doi: 10.1109/TSA.2002.803437
  contributor:
    fullname: FC Chou
– start-page: 809
  volume-title: Proceedings of the ICASSP
  year: 2001
  ident: 65_CR17
  contributor:
    fullname: Z Ying
– volume: 82
  start-page: 737
  year: 1987
  ident: 65_CR1
  publication-title: J. Acoust. Soc. Am
  doi: 10.1121/1.395275
  contributor:
    fullname: DH Klatt
– volume: 28
  start-page: 84
  year: 1980
  ident: 65_CR24
  publication-title: IEEE Trans. Commun
  doi: 10.1109/TCOM.1980.1094577
  contributor:
    fullname: Y Linde
– start-page: 204
  volume-title: Proceedings of the TENCON
  year: 2002
  ident: 65_CR15
  contributor:
    fullname: Y Zhu
– volume: 6
  start-page: 226
  year: 1998
  ident: 65_CR16
  publication-title: IEEE Trans. Speech Audio Process
  doi: 10.1109/89.668817
  contributor:
    fullname: SH Chen
SSID ssj0053542
ssib044736451
ssib008501525
Score 1.9129368
Snippet In this study, a consistency analysis of energy parameter for Mandarin speech is presented. Identified as a result of inspection of the human pronunciation...
Abstract In this study, a consistency analysis of energy parameter for Mandarin speech is presented. Identified as a result of inspection of the human...
SourceID biomedcentral
proquest
crossref
springer
SourceType Open Access Repository
Aggregation Database
Publisher
StartPage 1
SubjectTerms Acoustics
Algorithms
Consistency
Engineering
Engineering Acoustics
Mandarins
Mathematical models
Mathematics in Music
Segments
Signal,Image and Speech Processing
Speech
Syllables
Warping
SummonAdditionalLinks – databaseName: SpringerLink Journals (ICM)
  dbid: U2A
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA61XvQgPrFaJYInZXUfSbp78FDEUpR6stBbSLIJirottj34751Jd2tb6kHYWx4LM5nMN5nJF0IucwE-FZQbsJbVAXOCBdrlUZAJq7UQ3Ib-tYbes-j22eOAD2oknh9dFO83VUbSb9TeqlNxGwmwBmQ2BL1GeKN4g2widsCAqx-3q92XJ5zFZfZy7bCVi-0fy_7oF2Su5EW9u-nskp0SJ9L2TLF7pGaLfbK9wB54QJ7a1JPD0mFBAcZRg7WuY8TA31SVXCN06Kj11_soknx_YvELBZxKe_4E4a2g45G15vWQ9DsPL_fdoHwbIdCMJ5MgdWGiBcRrTlgXGyaQZkYLpXIXZs6hBnRkMgBUidLI9glRsBIcnHls8jznyRGpF8PCHmNxE1MWXL1xxrGUO-VCk1rOE91iCsy5Qe6WJCZHMx4MiczUyy2gMIkSlyhxiRKXcdogV5WE50N96JGKdZ2blQ5kaU1jGSX4SDsTYdYgF_NmsANMbqjCDqfQxyMrCGdhiutKdwtT_PXDk3_1PiVbfh3h12qS-uRras8AlUz0uV-GP1XR2h4
  priority: 102
  providerName: Springer Nature
Title A study on the consistency analysis of energy parameter for Mandarin speech
URI https://link.springer.com/article/10.1186/1687-4722-2012-28
https://www.proquest.com/docview/1314384609
https://search.proquest.com/docview/1439762118
http://dx.doi.org/10.1186/1687-4722-2012-28
Volume 2012
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3dS8MwEA9-vOiD-Inziwg-KcV-JFn3IDLHpigTEQd7K0maoKDtdNuDL_7t3mXpdKJCOWiTtnB3SX6Xu9wRcpQLWFNBuAGrGxUwK1igbB4FDWGUEoKb0FVr6N6Kqx677vO-33Ab-rDKak50E3VeatwjP40SLNTNRNg4H7wGWDUKvau-hMY8WYwjhm7axYv27d39VKNSHmKBn-qesTq63aZBIDzhrrxOJGCowTTNvN8zSsWpfwa2GqyReIT5x5H459mV7Aue_vCouoWqs0pWPMKkzYlKrJE5U6yT5W95BzfITZO6tLK0LCgAQKoxSnaI6PmdSp-lhJaWGncwkGJ68BcMm6GAcGnX7T08FXQ4MEY_bpJep_3Qugp8VYVAMZ6MgtSGiRJg6VlhbKyZwAQ1SkiZ27BhLcpORboBrEukwjyhYD9LwQEGxDrPc55skYWiLMw2hkUxaQAkaKstS7mVNtSp4TxRdSZhIqiRsxmOZYNJBo0Mc1rPtoCoM-R4hhzPkONZnNbIccXh6avOaEnFb533KhlkfhwOsy-tqZHDaTOMIHSLyMKUY-jjMBkYwvCJk0p23z7x1w93_v_hLllyioNXfY8sjN7GZh8AzEgdkHkWXgJNO0AnGgt3rZghFa0DtzEA9LIfAe1-tIH24uYnRfnwIg
link.rule.ids 230,315,786,790,870,891,12792,21416,27955,27956,33406,33407,33777,33778,40934,41152,41153,41556,42003,42221,42222,42625,43633,43838,51609,52128,52266,52267,74390,74657
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8MwDI54HIAD4inGM0icQBXt8lh3QAghxniME0jcoiRNBBK0g40D_x47pIMhQOqlTZpItuN8jh2bkL1Cwp4KzE14y5mEe8kT44ssaUtnjJTCpaFaQ-9Gdu_45b24jwdugxhWWevEoKiLyuIZ-WHGsFA3l2n7uP-SYNUo9K7GEhqTZJozyVDO8875SJ5ykWJ5n_qd8xY63UYhIIKJUFwnk7DQQEnz6PXMcnkYv4GlBjskXmD-cSH-aXwf-wKnP_ypYZvqLJD5iC_pyadALJIJVy6RuW9ZB5fJ1QkNSWVpVVKAf9RijOwAsfM71TFHCa08deFaIMXk4M8YNEMB39JeOHl4LOmg75x9WCF3nbPb024Sayokhgs2THKfMiPBzvPS-ablEtPTGKl14dO298g5k9k2kI5pg1lCwXrWUgAIaNqiKARbJVNlVbo1DIri2gFEsN56nguvfWpzJwQzLa5BDTTI0RjFVP8zf4bCjNbjLcBohRRXSHGFFFfNvEH2awqPfg0mSy5_67xZ80DFVThQXzLTILujZlg_6BTRpaveoE9AZGAGwxAHNe--DfHXhOv_T7hDZrq3vWt1fXFztUFmgxDh09okU8PXN7cFUGZotoO8fgAQGepQ
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LSwMxEA4-QPQgPrE-I3hSlu52k3R7EClqfYsHBW8hySYo6G617cF_70yarVZU2MtusgnMTCbfZCYzhOzlAvZUYG7EmlZHzAkWaZcnUUtYrYXgNvbVGm5uxfkDu3zkjyH-qRfCKiud6BV1Xho8I68nKRbqZiJu1V0Ii7g76Rx13yKsIIWe1lBOY5JMI8jGMg5Z52wkWxmPsdRP9c5YEx1wo3AQnnJfaCcRsOhAYbPgAU0yUQ_fwGqD3RIvM_-4HP8yvqd9AdUfvlW_ZXUWyHzAmrQ9FI5FMmGLJTL3LQPhMrlqU59glpYFBShIDcbL9hBHf1AV8pXQ0lHrrwhSTBT-igE0FLAuvfGnEM8F7XWtNU8r5KFzen98HoX6CpFmPO1HmYtTLcDmc8K6hmECU9VooVTu4pZzyEWdmBaQLlUaM4aCJa0EB0DQMHme83SVTBVlYdcwQIopC3DBOONYxp1yscks56luMgUqoUYOxygmu8NcGhKzW4-3ANMlUlwixSVSXDayGtmvKDz61Zsvmfit82bFAxlWZE9-yU-N7I6aYS2hg0QVthxAH4_OwCSGIQ4q3n0b4q8J1_-fcIfMgKjK64vbqw0y62UIn-Ymmeq_D-wWoJq-3vbi-glfLe6F
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+study+on+the+consistency+analysis+of+energy+parameter+for+Mandarin+speech&rft.jtitle=EURASIP+journal+on+audio%2C+speech%2C+and+music+processing&rft.au=Shen%2C+Li-te&rft.au=Yeh%2C+Cheng-yu&rft.au=Hwang%2C+Shaw-hwa&rft.date=2012-12-17&rft.pub=Springer+Nature+B.V&rft.issn=1687-4714&rft.eissn=1687-4722&rft.volume=2012&rft.spage=1&rft_id=info:doi/10.1186%2F1687-4722-2012-28&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=2906821321
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1687-4722&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1687-4722&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1687-4722&client=summon