A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions

One of the major parts of the voice recognition field is the choice of acoustic features which have to be robust against the variability of the speech signal, mismatched conditions, and noisy environments. Thus, different speech feature extraction techniques have been developed. In this paper, we in...

Full description

Saved in:

Bibliographic Details
Main Authors	Benhafid, Zhor, Zergat, Kawthar Yasmine, Amrouche, Abderrahmane
Format	Journal Article
Language	English
Published	23.10.2021
Subjects	Computer Science - Sound
Online Access	Get full text

Cover

Loading…

Abstract	One of the major parts of the voice recognition field is the choice of acoustic features which have to be robust against the variability of the speech signal, mismatched conditions, and noisy environments. Thus, different speech feature extraction techniques have been developed. In this paper, we investigate the robustness of several front-end techniques in Arabic speaker identification. We evaluate five different features in babble, factory and subway conditions at the various signal to noise ratios (SNR). The obtained results showed that two of the auditory feature i.e. gammatone frequency cepstral coefficient (GFCC) and power normalization cepstral coefficients (PNCC), unlike their combination performs substantially better than a conventional speaker features i.e. Mel-frequency cepstral coefficients (MFCC).
AbstractList	One of the major parts of the voice recognition field is the choice of acoustic features which have to be robust against the variability of the speech signal, mismatched conditions, and noisy environments. Thus, different speech feature extraction techniques have been developed. In this paper, we investigate the robustness of several front-end techniques in Arabic speaker identification. We evaluate five different features in babble, factory and subway conditions at the various signal to noise ratios (SNR). The obtained results showed that two of the auditory feature i.e. gammatone frequency cepstral coefficient (GFCC) and power normalization cepstral coefficients (PNCC), unlike their combination performs substantially better than a conventional speaker features i.e. Mel-frequency cepstral coefficients (MFCC).
Author	Benhafid, Zhor Amrouche, Abderrahmane Zergat, Kawthar Yasmine
Author_xml	– sequence: 1 givenname: Zhor surname: Benhafid fullname: Benhafid, Zhor – sequence: 2 givenname: Kawthar Yasmine surname: Zergat fullname: Zergat, Kawthar Yasmine – sequence: 3 givenname: Abderrahmane surname: Amrouche fullname: Amrouche, Abderrahmane
BackLink	https://doi.org/10.48550/arXiv.2110.12304$$DView paper in arXiv
BookMark	eNotj0FrwyAYhj1sh67bD9ip_oF0Go2aYwjtVijbob2HL2pA1mrRpCz_frbb6YWHhxeeJ_Tgg7cIvVKy5qqqyBvEH3ddlzQDWjLCF6hv8GGczIzDgBsdpjQ6jbcWxinahJ3HTYQ-o8PFwreNeGesH93gNIwueDx5k-FncGnGG391MfhzFuCE2-CNuznpGT0OcEr25X-X6LjdHNuPYv_1vmubfQFC8qLnrGdcElCV1MZCpbSkxjBChaLCaAAGrFZcGFGzGgaeERGaayVLMEyzJVr93d4ju0t0Z4hzd4vt7rHsF1W1UkE
ContentType	Journal Article
Copyright	http://creativecommons.org/licenses/by/4.0
Copyright_xml	– notice: http://creativecommons.org/licenses/by/4.0
DBID	AKY GOX
DOI	10.48550/arxiv.2110.12304
DatabaseName	arXiv Computer Science arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	2110_12304
GroupedDBID	AKY GOX
ID	FETCH-LOGICAL-a674-b43b3470a857cdea58c71dd3016816dcaa3a39846d6939af4dca06c4c872ad3c3
IEDL.DBID	GOX
IngestDate	Mon Jan 08 05:41:03 EST 2024
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-a674-b43b3470a857cdea58c71dd3016816dcaa3a39846d6939af4dca06c4c872ad3c3
OpenAccessLink	https://arxiv.org/abs/2110.12304
ParticipantIDs	arxiv_primary_2110_12304
PublicationCentury	2000
PublicationDate	2021-10-23
PublicationDateYYYYMMDD	2021-10-23
PublicationDate_xml	– month: 10 year: 2021 text: 2021-10-23 day: 23
PublicationDecade	2020
PublicationYear	2021
Score	1.8259196
SecondaryResourceType	preprint
Snippet	One of the major parts of the voice recognition field is the choice of acoustic features which have to be robust against the variability of the speech signal,...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Sound
Title	A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions
URI	https://arxiv.org/abs/2110.12304
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEB7anryIolKf5OA1urvJJrvHpbQWwXqwQm9l8lgowm7ZPrD_3iS7Ui9eJ0MCk8dM5vENwGOAbIljpFGmOOVGKprrUlIhEyyt5Foa74d8m4npJ39dpIsekN9aGGy-V_sWH1htnv3v5Cn2fss-9JPEp2y9vC_a4GSA4ur4j3zOxgykP0picgannXVHinY7zqFnqwtQBfHZegdSl6TQdWifRbzptXNfXbKqHDsqR_pYW_yyDWlrZ8vOmUZ8lVdDZvVqcyDjY1maW2ZU-3izPzeXMJ-M56Mp7VobUBSSU8WZYlxGmKVSG4tppmVsjLtsIouF0YgMWe5MAyNylmPJHSkSmuvMidAwza5gUNWVHQKRzOhUMB4x66HiS_QRWjcvs9Jp-1RdwzAIZLlu0SuWXlbLIKub_4du4STxyRvukU7YHQy2zc7eO-27VQ9hC34AnJiFAg
link.rule.ids	228,230,786,891
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Study+of+Acoustic+Features+in+Arabic+Speaker+Identification+under+Noisy+Environmental+Conditions&rft.au=Benhafid%2C+Zhor&rft.au=Zergat%2C+Kawthar+Yasmine&rft.au=Amrouche%2C+Abderrahmane&rft.date=2021-10-23&rft_id=info:doi/10.48550%2Farxiv.2110.12304&rft.externalDocID=2110_12304