Vowel, Digit and Continuous Speech Recognition Based on Statistical, Neural and Hybrid Modelling by Using ASRSRL
In the first part of this paper a recognizer based on hidden Markov models ( HMMs ) is compared in the simple task of vowel recognition with a recognizer based on the multilayer perceptron ( MLP ). In this situation, we have obtained better results for the last recognizer, fact which highlights the...
Saved in:
Published in | EUROCON 2007 - The International Conference on "Computer as a Tool" pp. 856 - 863 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.09.2007
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | In the first part of this paper a recognizer based on hidden Markov models ( HMMs ) is compared in the simple task of vowel recognition with a recognizer based on the multilayer perceptron ( MLP ). In this situation, we have obtained better results for the last recognizer, fact which highlights the advantage of the discriminative training of the perceptron versus the maximum likelihood training of the HMM. Because MLPs have problems with accommodating time sequences like speech, a combination of a HMM with a MLP could be a good idea. In the second part of the paper, the hybrid structure HMMMLP is compared with the simple HMM in a digit recognition task. The hybrid structure has recognition rates improved with around 2%. In the last part of the paper are describes the continuous speech recognition experiments for Romanian language, by using HMM modelling. The progresses concern enhancement of modelling by taking into account the context in form of triphones, improvement of speaker independence by applying a gender specific training and enlargement of the feature categories used to describe speech sequences. In order to easier handling the recognition experiments an Automatic Speech Recognition System for Romanian Language ( ASRS_RL ) was designed. |
---|---|
AbstractList | In the first part of this paper a recognizer based on hidden Markov models ( HMMs ) is compared in the simple task of vowel recognition with a recognizer based on the multilayer perceptron ( MLP ). In this situation, we have obtained better results for the last recognizer, fact which highlights the advantage of the discriminative training of the perceptron versus the maximum likelihood training of the HMM. Because MLPs have problems with accommodating time sequences like speech, a combination of a HMM with a MLP could be a good idea. In the second part of the paper, the hybrid structure HMMMLP is compared with the simple HMM in a digit recognition task. The hybrid structure has recognition rates improved with around 2%. In the last part of the paper are describes the continuous speech recognition experiments for Romanian language, by using HMM modelling. The progresses concern enhancement of modelling by taking into account the context in form of triphones, improvement of speaker independence by applying a gender specific training and enlargement of the feature categories used to describe speech sequences. In order to easier handling the recognition experiments an Automatic Speech Recognition System for Romanian Language ( ASRS_RL ) was designed. |
Author | Gavat, I. Dumitru, C.O. |
Author_xml | – sequence: 1 givenname: C.O. surname: Dumitru fullname: Dumitru, C.O. organization: Univ. Politehnica Bucharest, Bucharest – sequence: 2 givenname: I. surname: Gavat fullname: Gavat, I. organization: Univ. Politehnica Bucharest, Bucharest |
BookMark | eNp9j8FKw0AURUdU0Gq_oJv3AZq-yYTGLDVWurAVEivuyjR5xifjTMhMkPy9UQruXN1zuZzFnYgT6ywJMZMYSYnZfLkt8qdNFCOmUZIgKrU4EhOZxGO5ker1-K_E8kxMvf9ARJkuVJal56J9cV9kruCeGw6gbQ25s4Ft73oPZUtUvUNBlWssB3YW7rSnGkYogw7sA1d6tDfUd9r86qth33ENa1eTMWwb2A-w9T9wWxZl8XgpTt-08TQ95IWYPSyf89U1E9Gu7fhTd8Pu8ET9v34DL2pNdA |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/EURCON.2007.4400336 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 142440813X 9781424408139 |
EndPage | 863 |
ExternalDocumentID | 4400336 |
Genre | orig-research |
GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AARBI ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IERZE OCL RIE RIL |
ID | FETCH-ieee_primary_44003363 |
IEDL.DBID | RIE |
ISBN | 1424408121 9781424408122 |
IngestDate | Wed Jun 26 19:43:32 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-ieee_primary_44003363 |
ParticipantIDs | ieee_primary_4400336 |
PublicationCentury | 2000 |
PublicationDate | 2007-Sept. |
PublicationDateYYYYMMDD | 2007-09-01 |
PublicationDate_xml | – month: 09 year: 2007 text: 2007-Sept. |
PublicationDecade | 2000 |
PublicationTitle | EUROCON 2007 - The International Conference on "Computer as a Tool" |
PublicationTitleAbbrev | EURCON |
PublicationYear | 2007 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0001763997 |
Score | 2.8281794 |
Snippet | In the first part of this paper a recognizer based on hidden Markov models ( HMMs ) is compared in the simple task of vowel recognition with a recognizer based... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 856 |
SubjectTerms | HMM Hybrid LPC MFCC MLP PLP Speech recognition |
Title | Vowel, Digit and Continuous Speech Recognition Based on Statistical, Neural and Hybrid Modelling by Using ASRSRL |
URI | https://ieeexplore.ieee.org/document/4400336 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjZ1BT8JAEIUnwMmTGjAqavbgkRZoC7VHRQgxgqYVw43sdgclkJZoG4O_3p1tC9Fw8LbdZDeT7OFNpvO-AbhWqsOFfUO2X882iEhlCCHQENhFy-J2Z26Rd3g07g4nzsO0My1BY-uFQUTdfIYmLfW_fBmHKZXKmo5Do8e6ZSi7npd5tXb1FJe01i28W0rprHaBdMq_rZw61G55zf7E7z2NM4Rhfu2v-SpaXgaHMCoCy7pKlmaaCDP8_sNs_G_kR1DbGfnY81aijqGEURXWr_EXrhrsfvG2SBiPJCNE1SJK4_STBWvE8J35RV9RHLE7JXSSqQUlpprrzNVponrwlT4-3JDti9FYNU34ZmLDdCsCuw38wH-sQX3Qf-kNDYp4ts74FrM8WPsEKlEc4SkwlJ2wJeWcy_bcse3QE16IDlfpk8ouXMQzqO674Xz_dh0OsgopdWpdQCX5SPFSSXsirvSb_gDdeqRK |
link.rule.ids | 310,311,783,787,792,793,799,27939,55088 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjZ3LT8JAEMYniAc9qQGj4mMPHimPPsAeFSFVAU0Bw410uwMSSUu0jcG_3p1tC9Fw8LbdZDeT7OGbTOf7DcC1VB2PGzdk-7UNjYhUGuccNY4N1HXPsKY6eYd7_YYzMh_H1jgH5bUXBhFV8xlWaKn-5YvQj6lUVjVNGj3W2IFdi_KKxK21qag0SW2bmXtLap1ez6BO6beecofqNbvaHrmt534CMUwv_jVhRQlM5wB6WWhJX8l7JY54xf_-Q238b-yHUNxY-djLWqSOIIdBAZav4Rcuyux-PptHzAsEI0jVPIjD-JMNloj-G3OzzqIwYHdS6gSTC0pNFdnZk6eJ6-Et1HFnRcYvRoPVFOOb8RVTzQjsduAO3G4RSp32sOVoFPFkmRAuJmmwxjHkgzDAE2AoLL8mxNQT9alpGL7NbR9NTyZQMr9oIp5CYdsNZ9u3r2DPGfa6k-5D_6kE-0m9lPq2ziEffcR4IYU-4pfqfX8Ap7anlw |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=EUROCON+2007+-+The+International+Conference+on+%22Computer+as+a+Tool%22&rft.atitle=Vowel%2C+Digit+and+Continuous+Speech+Recognition+Based+on+Statistical%2C+Neural+and+Hybrid+Modelling+by+Using+ASRSRL&rft.au=Dumitru%2C+C.O.&rft.au=Gavat%2C+I.&rft.date=2007-09-01&rft.pub=IEEE&rft.isbn=9781424408122&rft.spage=856&rft.epage=863&rft_id=info:doi/10.1109%2FEURCON.2007.4400336&rft.externalDocID=4400336 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424408122/lc.gif&client=summon&freeimage=true |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424408122/mc.gif&client=summon&freeimage=true |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424408122/sc.gif&client=summon&freeimage=true |