A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions
One of the major parts of the voice recognition field is the choice of acoustic features which have to be robust against the variability of the speech signal, mismatched conditions, and noisy environments. Thus, different speech feature extraction techniques have been developed. In this paper, we in...
Saved in:
Main Authors | , , |
---|---|
Format | Journal Article |
Language | English |
Published |
23.10.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | One of the major parts of the voice recognition field is the choice of
acoustic features which have to be robust against the variability of the speech
signal, mismatched conditions, and noisy environments. Thus, different speech
feature extraction techniques have been developed. In this paper, we
investigate the robustness of several front-end techniques in Arabic speaker
identification. We evaluate five different features in babble, factory and
subway conditions at the various signal to noise ratios (SNR). The obtained
results showed that two of the auditory feature i.e. gammatone frequency
cepstral coefficient (GFCC) and power normalization cepstral coefficients
(PNCC), unlike their combination performs substantially better than a
conventional speaker features i.e. Mel-frequency cepstral coefficients (MFCC). |
---|---|
AbstractList | One of the major parts of the voice recognition field is the choice of
acoustic features which have to be robust against the variability of the speech
signal, mismatched conditions, and noisy environments. Thus, different speech
feature extraction techniques have been developed. In this paper, we
investigate the robustness of several front-end techniques in Arabic speaker
identification. We evaluate five different features in babble, factory and
subway conditions at the various signal to noise ratios (SNR). The obtained
results showed that two of the auditory feature i.e. gammatone frequency
cepstral coefficient (GFCC) and power normalization cepstral coefficients
(PNCC), unlike their combination performs substantially better than a
conventional speaker features i.e. Mel-frequency cepstral coefficients (MFCC). |
Author | Benhafid, Zhor Amrouche, Abderrahmane Zergat, Kawthar Yasmine |
Author_xml | – sequence: 1 givenname: Zhor surname: Benhafid fullname: Benhafid, Zhor – sequence: 2 givenname: Kawthar Yasmine surname: Zergat fullname: Zergat, Kawthar Yasmine – sequence: 3 givenname: Abderrahmane surname: Amrouche fullname: Amrouche, Abderrahmane |
BackLink | https://doi.org/10.48550/arXiv.2110.12304$$DView paper in arXiv |
BookMark | eNotj0FrwyAYhj1sh67bD9ip_oF0Go2aYwjtVijbob2HL2pA1mrRpCz_frbb6YWHhxeeJ_Tgg7cIvVKy5qqqyBvEH3ddlzQDWjLCF6hv8GGczIzDgBsdpjQ6jbcWxinahJ3HTYQ-o8PFwreNeGesH93gNIwueDx5k-FncGnGG391MfhzFuCE2-CNuznpGT0OcEr25X-X6LjdHNuPYv_1vmubfQFC8qLnrGdcElCV1MZCpbSkxjBChaLCaAAGrFZcGFGzGgaeERGaayVLMEyzJVr93d4ju0t0Z4hzd4vt7rHsF1W1UkE |
ContentType | Journal Article |
Copyright | http://creativecommons.org/licenses/by/4.0 |
Copyright_xml | – notice: http://creativecommons.org/licenses/by/4.0 |
DBID | AKY GOX |
DOI | 10.48550/arxiv.2110.12304 |
DatabaseName | arXiv Computer Science arXiv.org |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
ExternalDocumentID | 2110_12304 |
GroupedDBID | AKY GOX |
ID | FETCH-LOGICAL-a674-b43b3470a857cdea58c71dd3016816dcaa3a39846d6939af4dca06c4c872ad3c3 |
IEDL.DBID | GOX |
IngestDate | Mon Jan 08 05:41:03 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-a674-b43b3470a857cdea58c71dd3016816dcaa3a39846d6939af4dca06c4c872ad3c3 |
OpenAccessLink | https://arxiv.org/abs/2110.12304 |
ParticipantIDs | arxiv_primary_2110_12304 |
PublicationCentury | 2000 |
PublicationDate | 2021-10-23 |
PublicationDateYYYYMMDD | 2021-10-23 |
PublicationDate_xml | – month: 10 year: 2021 text: 2021-10-23 day: 23 |
PublicationDecade | 2020 |
PublicationYear | 2021 |
Score | 1.8259196 |
SecondaryResourceType | preprint |
Snippet | One of the major parts of the voice recognition field is the choice of
acoustic features which have to be robust against the variability of the speech
signal,... |
SourceID | arxiv |
SourceType | Open Access Repository |
SubjectTerms | Computer Science - Sound |
Title | A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions |
URI | https://arxiv.org/abs/2110.12304 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEB7anryIolKf5OA1urvJJrvHpbQWwXqwQm9l8lgowm7ZPrD_3iS7Ui9eJ0MCk8dM5vENwGOAbIljpFGmOOVGKprrUlIhEyyt5Foa74d8m4npJ39dpIsekN9aGGy-V_sWH1htnv3v5Cn2fss-9JPEp2y9vC_a4GSA4ur4j3zOxgykP0picgannXVHinY7zqFnqwtQBfHZegdSl6TQdWifRbzptXNfXbKqHDsqR_pYW_yyDWlrZ8vOmUZ8lVdDZvVqcyDjY1maW2ZU-3izPzeXMJ-M56Mp7VobUBSSU8WZYlxGmKVSG4tppmVsjLtsIouF0YgMWe5MAyNylmPJHSkSmuvMidAwza5gUNWVHQKRzOhUMB4x66HiS_QRWjcvs9Jp-1RdwzAIZLlu0SuWXlbLIKub_4du4STxyRvukU7YHQy2zc7eO-27VQ9hC34AnJiFAg |
link.rule.ids | 228,230,786,891 |
linkProvider | Cornell University |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Study+of+Acoustic+Features+in+Arabic+Speaker+Identification+under+Noisy+Environmental+Conditions&rft.au=Benhafid%2C+Zhor&rft.au=Zergat%2C+Kawthar+Yasmine&rft.au=Amrouche%2C+Abderrahmane&rft.date=2021-10-23&rft_id=info:doi/10.48550%2Farxiv.2110.12304&rft.externalDocID=2110_12304 |