MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC) FEATURE EXTRACTION ENHANCEMENT IN THE APPLICATION OF SPEECH RECOGNITION: A COMPARISON STUDY
Mel Frequency Cepstral Coefficients (MFCCs) are the most widely used features in the majority of the speaker and speech recognition applications. Since 1980s, remarkable efforts have been undertaken for the development of these features. Issues such as use suitable spectral estimation methods, desig...
Saved in:
Published in | Journal of Theoretical and Applied Information Technology Vol. 79; no. 1; p. 38 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Islamabad
Journal of Theoretical and Applied Information
01.09.2015
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Mel Frequency Cepstral Coefficients (MFCCs) are the most widely used features in the majority of the speaker and speech recognition applications. Since 1980s, remarkable efforts have been undertaken for the development of these features. Issues such as use suitable spectral estimation methods, design of effective filter banks, and the number of chosen features all play an important role in the performance and robustness of the speech recognition systems. This paper provides an overview of MFCC's enhancement techniques that are applied in speech recognition systems. The details such as accuracy, types of environments, the nature of data, and the number of features are investigated and summarized in the table combined with the corresponding key references. Benefits and drawbacks of these MFCC's enhancement techniques have been discussed. This study will hopefully contribute to raising initiatives towards the enhancement of MFCC in terms of robustness features, high accuracy, and less complexity. |
---|---|
AbstractList | Mel Frequency Cepstral Coefficients (MFCCs) are the most widely used features in the majority of the speaker and speech recognition applications. Since 1980s, remarkable efforts have been undertaken for the development of these features. Issues such as use suitable spectral estimation methods, design of effective filter banks, and the number of chosen features all play an important role in the performance and robustness of the speech recognition systems. This paper provides an overview of MFCC's enhancement techniques that are applied in speech recognition systems. The details such as accuracy, types of environments, the nature of data, and the number of features are investigated and summarized in the table combined with the corresponding key references. Benefits and drawbacks of these MFCC's enhancement techniques have been discussed. This study will hopefully contribute to raising initiatives towards the enhancement of MFCC in terms of robustness features, high accuracy, and less complexity. |
Author | Idbeaa, Tariq F Husain, Hafizah Majeed, Sayf A Samad, Salina Abdul |
Author_xml | – sequence: 1 givenname: Sayf surname: Majeed middlename: A fullname: Majeed, Sayf A – sequence: 2 givenname: Hafizah surname: Husain fullname: Husain, Hafizah – sequence: 3 givenname: Salina surname: Samad middlename: Abdul fullname: Samad, Salina Abdul – sequence: 4 givenname: Tariq surname: Idbeaa middlename: F fullname: Idbeaa, Tariq F |
BookMark | eNpdzslqwzAQgGEfUmiS9h0EvaQHg2TJltybUMexwVu9QHMKtiNDQ2qncfwKfe6qy6mngZmPn1lZi2Ec9MJaEkG4TYnv3lqraTpi7DnMd5fWZwIxCgp4qSFVO6QgL6tCxkhlEASRiiCtSrRJAqUeUQCyqgtA8GqIqqIsRZCGMlWQGIaiFFUhIJnncaTkzzkLUJkDqBAVoLJtGn1vn5A0-SSXRVQaU1b18-7Ouumb06Tv_-baqgOoVGjH2dbEYvtMKLnaB6GZcFnjdQ5ze62p2zFHYOJ4tOHk4AjS-J4grUNa3HPmtz3mDicMd62rqUfo2tr8ds-X8WPW03X__jZ1-nRqBj3O054ILDD3KeOGPvyjx3G-DOa7PeHc96npuvQL-6Rfcw |
ContentType | Journal Article |
Copyright | Copyright Journal of Theoretical and Applied Information Sep 2015 |
Copyright_xml | – notice: Copyright Journal of Theoretical and Applied Information Sep 2015 |
DBID | 7SC 8FD JQ2 L7M L~C L~D |
DatabaseName | Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Computer and Information Systems Abstracts Computer and Information Systems Abstracts |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EndPage | 38 |
ExternalDocumentID | 4017422371 |
Genre | Feature |
GroupedDBID | .DC 29L 2WC 5GY 5VS 7SC 8FD ALMA_UNASSIGNED_HOLDINGS E3Z GROUPED_DOAJ JQ2 KQ8 L7M L~C L~D M~E OK1 P2P RNS TR2 |
ID | FETCH-LOGICAL-p131t-d8e4854a6c245fee35c42801263a71d281a9681b21b0f749bf0727140cb5e3613 |
ISSN | 1817-3195 |
IngestDate | Sat Aug 17 01:29:08 EDT 2024 Fri Sep 13 04:28:15 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | true |
Issue | 1 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-p131t-d8e4854a6c245fee35c42801263a71d281a9681b21b0f749bf0727140cb5e3613 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
PQID | 1779937145 |
PQPubID | 2040122 |
PageCount | 1 |
ParticipantIDs | proquest_miscellaneous_1808079347 proquest_journals_1779937145 |
PublicationCentury | 2000 |
PublicationDate | 20150901 |
PublicationDateYYYYMMDD | 2015-09-01 |
PublicationDate_xml | – month: 09 year: 2015 text: 20150901 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | Islamabad |
PublicationPlace_xml | – name: Islamabad |
PublicationTitle | Journal of Theoretical and Applied Information Technology |
PublicationYear | 2015 |
Publisher | Journal of Theoretical and Applied Information |
Publisher_xml | – name: Journal of Theoretical and Applied Information |
SSID | ssj0062495 |
Score | 2.264686 |
Snippet | Mel Frequency Cepstral Coefficients (MFCCs) are the most widely used features in the majority of the speaker and speech recognition applications. Since 1980s,... |
SourceID | proquest |
SourceType | Aggregation Database |
StartPage | 38 |
SubjectTerms | Design engineering Feature extraction Filter banks Information technology Robustness Spectra Speech recognition Tables (data) |
Title | MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC) FEATURE EXTRACTION ENHANCEMENT IN THE APPLICATION OF SPEECH RECOGNITION: A COMPARISON STUDY |
URI | https://www.proquest.com/docview/1779937145/abstract/ https://search.proquest.com/docview/1808079347 |
Volume | 79 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lj9MwELbKnuCAeIpdFmQkDqAoaN04L25V5JBAm3bbVCqnykkcaZHovtoD-wP43cwkzqNaQCyXKHKsKMl8GX8zngchb32mSqcsS9NSHscWZr4JUOYmYMezgZAUuY-JwpPEiZb888peDQb9CsG7bfYhv_ltXsn_SBXGQK6YJXsHybY3hQE4B_nCESQMx3-S8USMjXAuTpciCb4agZhVHZCNYArsM0bvUYpWujcJgwCt_1DUQQ5iBdOq4BFDJBF2p8GK_kacVPE_o9msSS_GkKDFTIggMuYimH5KYhzVyezTyWw0jxcwC1nln1hu2kuURB99w3p1GlSFvtvu_Yn8ppT2WP8oO39rtLuWddGDSJZnN50veyG_Sz0fM4yNUVZ0EY9xkSlZkeRUXp1d6mBm7epgdhvLpZehOz18T6l7DH2xdTPPRuvXLWz20F2r8LrYzH5l7mS6Dpfj8ToFAe1frZgA2KguB56FtQzuWQwjS7-ctptZDrb3Rru_eYxby37FZdJH5KF-QzqqEfWYDNTmCXnQK035lPwEbNEWW7TBFu1ji75DZL2nGle0wxXt4YrGCQVc0R6u6DSkNa5oD1cf6Yh2qKIVqp6RZSjSIDJ1zw7zgllsaxae4p7NpZMPuV0qZdk5GLjAghxLuqwYekz6DphKQ5adlC73s_IEGDRY-XlmKwu45XNysDnfqBeEutIZZkNQIUCKuedkwKN8mTOrUJZVZL5zSI6bz7jWP-X1mrkuMm7G7UPypr0MKhP3weRGne9gDtZShXWJu0d_v8VLcr-D4TE52F7t1CvgoNvsdSXhX16jee0 |
link.rule.ids | 315,786,790 |
linkProvider | Colorado Alliance of Research Libraries |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=MEL+FREQUENCY+CEPSTRAL+COEFFICIENTS+%28MFCC%29+FEATURE+EXTRACTION+ENHANCEMENT+IN+THE+APPLICATION+OF+SPEECH+RECOGNITION%3A+A+COMPARISON+STUDY&rft.jtitle=Journal+of+Theoretical+and+Applied+Information+Technology&rft.au=Majeed%2C+Sayf+A&rft.au=Husain%2C+Hafizah&rft.au=Samad%2C+Salina+Abdul&rft.au=Idbeaa%2C+Tariq+F&rft.date=2015-09-01&rft.pub=Journal+of+Theoretical+and+Applied+Information&rft.issn=1817-3195&rft.volume=79&rft.issue=1&rft.spage=38&rft.externalDBID=NO_FULL_TEXT&rft.externalDocID=4017422371 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1817-3195&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1817-3195&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1817-3195&client=summon |