Vowel, Digit and Continuous Speech Recognition Based on Statistical, Neural and Hybrid Modelling by Using ASRSRL

In the first part of this paper a recognizer based on hidden Markov models ( HMMs ) is compared in the simple task of vowel recognition with a recognizer based on the multilayer perceptron ( MLP ). In this situation, we have obtained better results for the last recognizer, fact which highlights the...

Full description

Saved in:
Bibliographic Details
Published inEUROCON 2007 - The International Conference on "Computer as a Tool" pp. 856 - 863
Main Authors Dumitru, C.O., Gavat, I.
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.09.2007
Subjects
Online AccessGet full text

Cover

Loading…
Abstract In the first part of this paper a recognizer based on hidden Markov models ( HMMs ) is compared in the simple task of vowel recognition with a recognizer based on the multilayer perceptron ( MLP ). In this situation, we have obtained better results for the last recognizer, fact which highlights the advantage of the discriminative training of the perceptron versus the maximum likelihood training of the HMM. Because MLPs have problems with accommodating time sequences like speech, a combination of a HMM with a MLP could be a good idea. In the second part of the paper, the hybrid structure HMMMLP is compared with the simple HMM in a digit recognition task. The hybrid structure has recognition rates improved with around 2%. In the last part of the paper are describes the continuous speech recognition experiments for Romanian language, by using HMM modelling. The progresses concern enhancement of modelling by taking into account the context in form of triphones, improvement of speaker independence by applying a gender specific training and enlargement of the feature categories used to describe speech sequences. In order to easier handling the recognition experiments an Automatic Speech Recognition System for Romanian Language ( ASRS_RL ) was designed.
AbstractList In the first part of this paper a recognizer based on hidden Markov models ( HMMs ) is compared in the simple task of vowel recognition with a recognizer based on the multilayer perceptron ( MLP ). In this situation, we have obtained better results for the last recognizer, fact which highlights the advantage of the discriminative training of the perceptron versus the maximum likelihood training of the HMM. Because MLPs have problems with accommodating time sequences like speech, a combination of a HMM with a MLP could be a good idea. In the second part of the paper, the hybrid structure HMMMLP is compared with the simple HMM in a digit recognition task. The hybrid structure has recognition rates improved with around 2%. In the last part of the paper are describes the continuous speech recognition experiments for Romanian language, by using HMM modelling. The progresses concern enhancement of modelling by taking into account the context in form of triphones, improvement of speaker independence by applying a gender specific training and enlargement of the feature categories used to describe speech sequences. In order to easier handling the recognition experiments an Automatic Speech Recognition System for Romanian Language ( ASRS_RL ) was designed.
Author Gavat, I.
Dumitru, C.O.
Author_xml – sequence: 1
  givenname: C.O.
  surname: Dumitru
  fullname: Dumitru, C.O.
  organization: Univ. Politehnica Bucharest, Bucharest
– sequence: 2
  givenname: I.
  surname: Gavat
  fullname: Gavat, I.
  organization: Univ. Politehnica Bucharest, Bucharest
BookMark eNp9j8FKw0AURUdU0Gq_oJv3AZq-yYTGLDVWurAVEivuyjR5xifjTMhMkPy9UQruXN1zuZzFnYgT6ywJMZMYSYnZfLkt8qdNFCOmUZIgKrU4EhOZxGO5ker1-K_E8kxMvf9ARJkuVJal56J9cV9kruCeGw6gbQ25s4Ft73oPZUtUvUNBlWssB3YW7rSnGkYogw7sA1d6tDfUd9r86qth33ENa1eTMWwb2A-w9T9wWxZl8XgpTt-08TQ95IWYPSyf89U1E9Gu7fhTd8Pu8ET9v34DL2pNdA
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/EURCON.2007.4400336
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 142440813X
9781424408139
EndPage 863
ExternalDocumentID 4400336
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AARBI
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IERZE
OCL
RIE
RIL
ID FETCH-ieee_primary_44003363
IEDL.DBID RIE
ISBN 1424408121
9781424408122
IngestDate Wed Jun 26 19:43:32 EDT 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-ieee_primary_44003363
ParticipantIDs ieee_primary_4400336
PublicationCentury 2000
PublicationDate 2007-Sept.
PublicationDateYYYYMMDD 2007-09-01
PublicationDate_xml – month: 09
  year: 2007
  text: 2007-Sept.
PublicationDecade 2000
PublicationTitle EUROCON 2007 - The International Conference on "Computer as a Tool"
PublicationTitleAbbrev EURCON
PublicationYear 2007
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001763997
Score 2.8281794
Snippet In the first part of this paper a recognizer based on hidden Markov models ( HMMs ) is compared in the simple task of vowel recognition with a recognizer based...
SourceID ieee
SourceType Publisher
StartPage 856
SubjectTerms HMM
Hybrid
LPC
MFCC
MLP
PLP
Speech recognition
Title Vowel, Digit and Continuous Speech Recognition Based on Statistical, Neural and Hybrid Modelling by Using ASRSRL
URI https://ieeexplore.ieee.org/document/4400336
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjZ1BT8JAEIUnwMmTGjAqavbgkRZoC7VHRQgxgqYVw43sdgclkJZoG4O_3p1tC9Fw8LbdZDeT7OFNpvO-AbhWqsOFfUO2X882iEhlCCHQENhFy-J2Z26Rd3g07g4nzsO0My1BY-uFQUTdfIYmLfW_fBmHKZXKmo5Do8e6ZSi7npd5tXb1FJe01i28W0rprHaBdMq_rZw61G55zf7E7z2NM4Rhfu2v-SpaXgaHMCoCy7pKlmaaCDP8_sNs_G_kR1DbGfnY81aijqGEURXWr_EXrhrsfvG2SBiPJCNE1SJK4_STBWvE8J35RV9RHLE7JXSSqQUlpprrzNVponrwlT4-3JDti9FYNU34ZmLDdCsCuw38wH-sQX3Qf-kNDYp4ts74FrM8WPsEKlEc4SkwlJ2wJeWcy_bcse3QE16IDlfpk8ouXMQzqO674Xz_dh0OsgopdWpdQCX5SPFSSXsirvSb_gDdeqRK
link.rule.ids 310,311,783,787,792,793,799,27939,55088
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjZ3LT8JAEMYniAc9qQGj4mMPHimPPsAeFSFVAU0Bw410uwMSSUu0jcG_3p1tC9Fw8LbdZDeT7OGbTOf7DcC1VB2PGzdk-7UNjYhUGuccNY4N1HXPsKY6eYd7_YYzMh_H1jgH5bUXBhFV8xlWaKn-5YvQj6lUVjVNGj3W2IFdi_KKxK21qag0SW2bmXtLap1ez6BO6beecofqNbvaHrmt534CMUwv_jVhRQlM5wB6WWhJX8l7JY54xf_-Q238b-yHUNxY-djLWqSOIIdBAZav4Rcuyux-PptHzAsEI0jVPIjD-JMNloj-G3OzzqIwYHdS6gSTC0pNFdnZk6eJ6-Et1HFnRcYvRoPVFOOb8RVTzQjsduAO3G4RSp32sOVoFPFkmRAuJmmwxjHkgzDAE2AoLL8mxNQT9alpGL7NbR9NTyZQMr9oIp5CYdsNZ9u3r2DPGfa6k-5D_6kE-0m9lPq2ziEffcR4IYU-4pfqfX8Ap7anlw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=EUROCON+2007+-+The+International+Conference+on+%22Computer+as+a+Tool%22&rft.atitle=Vowel%2C+Digit+and+Continuous+Speech+Recognition+Based+on+Statistical%2C+Neural+and+Hybrid+Modelling+by+Using+ASRSRL&rft.au=Dumitru%2C+C.O.&rft.au=Gavat%2C+I.&rft.date=2007-09-01&rft.pub=IEEE&rft.isbn=9781424408122&rft.spage=856&rft.epage=863&rft_id=info:doi/10.1109%2FEURCON.2007.4400336&rft.externalDocID=4400336
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424408122/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424408122/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424408122/sc.gif&client=summon&freeimage=true