Modified Cepstral Feature for Speech Anti-spoofing

TN912.3; The hidden danger of the automatic speaker verification(ASV)system is various spoofed speeches.These threats can be classified into two categories,namely logical access(LA)and physical access(PA).To improve identification capability of spoofed speech detection,this paper considers the resea...

Full description

Saved in:
Bibliographic Details
Published in东华大学学报(英文版) Vol. 40; no. 2; pp. 193 - 201
Main Authors HE Mingrui, ZAIDI Syed Faham Ali, TIAN Mianxin, SHAN Zhiyong, JIANG Zhengru, XU Longting
Format Journal Article
LanguageEnglish
Published College of Information Science and Technology,Donghua University,Shanghai 200051,China%SPIC Jiangsu Offshore Wind Power Co.,Ltd.,Yancheng 224001,China 01.04.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract TN912.3; The hidden danger of the automatic speaker verification(ASV)system is various spoofed speeches.These threats can be classified into two categories,namely logical access(LA)and physical access(PA).To improve identification capability of spoofed speech detection,this paper considers the research on features.Firstly,following the idea of modifying the constant-Q-based features,this work considered adding variance or mean to the constant-Q-based cepstral domain to obtain good performance.Secondly,linear frequency cepstral coefficients(LFCCs)performed comparably with constant-Q-based features.Finally,we proposed linear frequency variance-based cepstral coefficients(LVCCs)and linear frequency mean-based cepstral coefficients(LMCCs)for identification of speech spoofing.LVCCs and LMCCs could be attained by adding the frame variance or the mean to the log magnitude spectrum based on LFCC features.The proposed novel features were evaluated on ASVspoof 2019 datase.The experimental results show that compared with known hand-crafted features,LVCCs and LMCCs are more effective in resisting spoofed speech attack.
AbstractList TN912.3; The hidden danger of the automatic speaker verification(ASV)system is various spoofed speeches.These threats can be classified into two categories,namely logical access(LA)and physical access(PA).To improve identification capability of spoofed speech detection,this paper considers the research on features.Firstly,following the idea of modifying the constant-Q-based features,this work considered adding variance or mean to the constant-Q-based cepstral domain to obtain good performance.Secondly,linear frequency cepstral coefficients(LFCCs)performed comparably with constant-Q-based features.Finally,we proposed linear frequency variance-based cepstral coefficients(LVCCs)and linear frequency mean-based cepstral coefficients(LMCCs)for identification of speech spoofing.LVCCs and LMCCs could be attained by adding the frame variance or the mean to the log magnitude spectrum based on LFCC features.The proposed novel features were evaluated on ASVspoof 2019 datase.The experimental results show that compared with known hand-crafted features,LVCCs and LMCCs are more effective in resisting spoofed speech attack.
Author ZAIDI Syed Faham Ali
TIAN Mianxin
JIANG Zhengru
HE Mingrui
XU Longting
SHAN Zhiyong
AuthorAffiliation College of Information Science and Technology,Donghua University,Shanghai 200051,China%SPIC Jiangsu Offshore Wind Power Co.,Ltd.,Yancheng 224001,China
AuthorAffiliation_xml – name: College of Information Science and Technology,Donghua University,Shanghai 200051,China%SPIC Jiangsu Offshore Wind Power Co.,Ltd.,Yancheng 224001,China
Author_xml – sequence: 1
  fullname: HE Mingrui
– sequence: 2
  fullname: ZAIDI Syed Faham Ali
– sequence: 3
  fullname: TIAN Mianxin
– sequence: 4
  fullname: SHAN Zhiyong
– sequence: 5
  fullname: JIANG Zhengru
– sequence: 6
  fullname: XU Longting
BookMark eNo9j01Lw0AYhPdQwVr7FyRXD4nvfmQ3eyzFqlDxoJ7DfrzbJpTdkE2xP9-A0jnMwHOYYe7IIqaIhDxQqKhuGvHUV1QqVtaMQcVg9hpALcjySm_JOuceZkmmBOglYe_Jd6FDX2xxyNNoTsUOzXQesQhpLD4HRHcsNnHqyjykFLp4uCc3wZwyrv9zRb53z1_b13L_8fK23ezLTEHo0lMBYK3lirpgoFHScNTeaS6xCShccJ5yLrUXdVA2WBoQqQQbnBXaA1-Rx7_eHxODiYe2T-cxzoutP_rLxbY4f-TAADT_Ba12Sss
ClassificationCodes TN912.3
ContentType Journal Article
Copyright Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
Copyright_xml – notice: Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
DBID 2B.
4A8
92I
93N
PSX
TCJ
DOI 10.19884/j.1672-5220.202205007
DatabaseName Wanfang Data Journals - Hong Kong
WANFANG Data Centre
Wanfang Data Journals
万方数据期刊 - 香港版
China Online Journals (COJ)
China Online Journals (COJ)
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EndPage 201
ExternalDocumentID dhdxxb_e202302009
GroupedDBID -02
-0B
-SB
-S~
188
2B.
4A8
5VR
5XA
5XC
8RM
92D
92I
92M
93N
9D9
9DB
ABJNI
ACGFS
ADMLS
AFUIB
ALMA_UNASSIGNED_HOLDINGS
CAJEB
CCEZO
CDRFL
CHBEP
CW9
FA0
JUIAU
PSX
Q--
R-B
RT2
S..
T8R
TCJ
TGH
TTC
U1F
U1G
U5B
U5L
UGNYK
UZ2
UZ4
ID FETCH-LOGICAL-s1049-d1400bbb371cfa0876a3e9dc936e8fe4cfcd13369d45f7bfb1fee160bfcb49d03
ISSN 1672-5220
IngestDate Thu May 29 03:59:43 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords linear frequency cepstral coefficient(LFCC)
hand-crafted feature
log magnitude spectrum
spoofed speech detection
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-s1049-d1400bbb371cfa0876a3e9dc936e8fe4cfcd13369d45f7bfb1fee160bfcb49d03
PageCount 9
ParticipantIDs wanfang_journals_dhdxxb_e202302009
PublicationCentury 2000
PublicationDate 2023-04-01
PublicationDateYYYYMMDD 2023-04-01
PublicationDate_xml – month: 04
  year: 2023
  text: 2023-04-01
  day: 01
PublicationDecade 2020
PublicationTitle 东华大学学报(英文版)
PublicationTitle_FL Journal of Donghua University(English Edition)
PublicationYear 2023
Publisher College of Information Science and Technology,Donghua University,Shanghai 200051,China%SPIC Jiangsu Offshore Wind Power Co.,Ltd.,Yancheng 224001,China
Publisher_xml – name: College of Information Science and Technology,Donghua University,Shanghai 200051,China%SPIC Jiangsu Offshore Wind Power Co.,Ltd.,Yancheng 224001,China
SSID ssj0000627409
Score 2.2348895
Snippet TN912.3; The hidden danger of the automatic speaker verification(ASV)system is various spoofed speeches.These threats can be classified into two...
SourceID wanfang
SourceType Aggregation Database
StartPage 193
Title Modified Cepstral Feature for Speech Anti-spoofing
URI https://d.wanfangdata.com.cn/periodical/dhdxxb-e202302009
Volume 40
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnR1db5sw0OrSl-1h2qf2LTTNT4iODwPmkSRESdV0k0i1bi8VBrsgTaRqEindr98dOIStfej2QizbsbHvuC_fnQn5xKPQy30_tCJRBBYLGQM66GcWqkAo4duuQkVxfhpMz9jxuX9-8ID2vJY2a3GU_7ozruR_oAp1AFeMkv0HyHaDQgWUAb7wBAjD814wni-LSlWNjfaqMVmYKNHhkQA6D6ZXUuYlpgeoLFBeYSbNpbQsShNGh5xGI5r4lI8pT7AQMxqHTWFM46BXCCiPaezTZEKHI8o5TTjlQzp0sCmCVvhXSHnUNLV9OiPDNDHnMPv1puoM1fFsPDPTG3j5CcZqm_HPrm0xi0-hf1Zvqw510ynU_Sirm6VehLZUuF7PwaVxGNpbQnSoVUsTNQ3Dc4L9cQIg2BgGLDdZz0MFKlM0o5dZhQkXgZ5ATXPPOHX99Cug7TG82-VqY35RalUuYbu_VRhzgRfOAYFFenuyLvDnO3xTpawvzcaJthunxwmCELV01-6zijazlP4k3B7dd9prHrUI4barvsWdIs5Zy552gx-5TaSz3d78-1fm76IstltxIXEv7TZM9dAFbcgdkMN4PD9JO2Mi5ppmjTtTN7KOhscpP985YROlVivYr55AtXhCHmtNyIhbtH5KDmT9jDzq5cd8Ttwdghs7BDc0ghsAWaNFcOMPBH9BzibJYjS19CUf1soB7dQqQMO3hRBe6OQqwwSJmSejIo-8QHIlWa7ywvG8ICqYr0KhhKOkdAJbqFywqLC9l2RQL2v5ihgecMwMVooJjpjtFjzzbRUEwM5FEGW2_5p81Cu-0B_x6uLWHr-5T6e35OEewd-Rwfp6I9-DcLoWHzRofgMzrH4P
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Modified+Cepstral+Feature+for+Speech+Anti-spoofing&rft.jtitle=%E4%B8%9C%E5%8D%8E%E5%A4%A7%E5%AD%A6%E5%AD%A6%E6%8A%A5%EF%BC%88%E8%8B%B1%E6%96%87%E7%89%88%EF%BC%89&rft.au=HE+Mingrui&rft.au=ZAIDI+Syed+Faham+Ali&rft.au=TIAN+Mianxin&rft.au=SHAN+Zhiyong&rft.date=2023-04-01&rft.pub=College+of+Information+Science+and+Technology%2CDonghua+University%2CShanghai+200051%2CChina%25SPIC+Jiangsu+Offshore+Wind+Power+Co.%2CLtd.%2CYancheng+224001%2CChina&rft.issn=1672-5220&rft.volume=40&rft.issue=2&rft.spage=193&rft.epage=201&rft_id=info:doi/10.19884%2Fj.1672-5220.202205007&rft.externalDocID=dhdxxb_e202302009
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fwww.wanfangdata.com.cn%2Fimages%2FPeriodicalImages%2Fdhdxxb-e%2Fdhdxxb-e.jpg