基于相位谱的翻录语音攻击检测算法

因与原始语音具有高度相似性,经高保真设备回放的翻录语音常被不法分子用于对说话人认证(ASV)系统进行攻击,以达到非法认证的目的。为提高系统抵抗翻录语音攻击的顽健性,通过研究原始语音与翻录语音产生的实际过程,发现两者在频率域相位上有明显差异,并在此基础上提出了一种基于相位谱的翻录语音检测方法。分析讨论了FFT和不同偷录、回放设备对翻录语音检测率的影响。实验结果表明,该方法能够准确地判断待测语音是否为翻录语音,其检测率达到了99.04%。并且,将该算法加载到说话人识别系统中,使系统的等错误概率(EER)降低了约22%,有效提高了系统抵抗翻录语音攻击的性能。...

Full description

Saved in:
Bibliographic Details
Published in电信科学 Vol. 33; no. 8; pp. 145 - 154
Main Author 李璨 王让定 严迪群 陈亚楠
Format Journal Article
LanguageChinese
Published 中国通信学会 01.08.2017
人民邮电出版社有限公司
宁波大学信息科学与工程学院,浙江宁波,315211
Subjects
Online AccessGet full text
ISSN1000-0801
DOI10.11959/j.issn.1000-0801.2017126

Cover

More Information
Summary:因与原始语音具有高度相似性,经高保真设备回放的翻录语音常被不法分子用于对说话人认证(ASV)系统进行攻击,以达到非法认证的目的。为提高系统抵抗翻录语音攻击的顽健性,通过研究原始语音与翻录语音产生的实际过程,发现两者在频率域相位上有明显差异,并在此基础上提出了一种基于相位谱的翻录语音检测方法。分析讨论了FFT和不同偷录、回放设备对翻录语音检测率的影响。实验结果表明,该方法能够准确地判断待测语音是否为翻录语音,其检测率达到了99.04%。并且,将该算法加载到说话人识别系统中,使系统的等错误概率(EER)降低了约22%,有效提高了系统抵抗翻录语音攻击的性能。
Bibliography:ASV system; recaptured voice detection; phase spectrum
LI Can, WANG Rangding, YAN Diqun, CHEN Yanan( College of Information Science and Engineering, Ningbo University, Ningbo 315211, China)
11-2103/TN
Due to a high similarity between the recaptured voice recorded by high-fidelity ripping equipment and the original voice, the automatic speaker verification(ASV)system used to be attacked illegally by the recaptured voice. In order to improve the ability of resisting the attack, a recaptured voice detection method was proposed based on the difference of phase spectrum between original and recaptured voices for the ASV system. In addition, the effects of different recording and replay devices, the FFT were discussed. Experimental results show that the proposed method can accurately recognize the recording voice, of which detection rate is 99.04%.Meanwhile, the equal error rate(EER) of the ASV system has dropped about 22% with this method being integrated, which indicates that the system's ability of resisting playba
ISSN:1000-0801
DOI:10.11959/j.issn.1000-0801.2017126