Content-Dependent Watermarking Scheme in Compressed Speech With Identifying Manner and Location of Attacks

As speech compression technologies have advanced, digital recording devices have become increasingly popular. However, data formats used in popular speech codecs are known a priori, such that compressed data can be modified easily via insertion, deletion, and replacement. This work proposes a conten...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on audio, speech, and language processing Vol. 15; no. 5; pp. 1605 - 1616
Main Authors	Chen, O.T.-C., Chia-Hsiung Liu
Format	Journal Article
Language	English
Published	Piscataway, NJ IEEE 01.07.2007 Institute of Electrical and Electronics Engineers
Subjects	Applied sciences Authentication Codec Coding, codes Compressed Content-dependent watermark Cryptography Data mining Deletion Digital recording Exact sciences and technology Frames Frequency Image processing Information, signal and communications theory Insertion Position (location) Protection Robustness Signal and communications theory Signal processing Speech Speech analysis speech codec Speech codecs Speech processing speech watermark Telecommunications and information theory watermark attack Watermarking Image processing Sound quality Digital recording Accuracy Linear prediction Codebook Speech coding Information protection Advanced technology Localization Digital watermarking Speech codecs Spectral method Acoustic signal Record format speech codec watermark attack Line spectrum speech watermark Quality control Predictive coding A priori estimation Pitch(acoustics) Content-dependent watermark Speech processing
Online Access	Get full text

Cover

Loading…

More Information
Summary:	As speech compression technologies have advanced, digital recording devices have become increasingly popular. However, data formats used in popular speech codecs are known a priori, such that compressed data can be modified easily via insertion, deletion, and replacement. This work proposes a content-dependent watermarking scheme suitable for codebook-excited linear prediction (CELP)-based speech codec that ensures the integrity of compressed speech data. Speech data are initially partitioned into many groups, each of which includes multiple speech frames. The watermark embedded in each frame is then generated according to the line spectrum frequency (LSF) feature in the current frame, the pitch extracted from the succeeding frame, the watermark embedded in the preceding frame, and the group index which is determined by the location of the current frame. Finally, some of the least significant bits (LSBs) of the indices indicating the excitation pulse positions or excitation vectors are substituted for the watermark. Conventional watermarking schemes can only detect whether compressed speech data are intact. They cannot determine where compressed speech data are altered by insertion, deletion, or replacement, whereas the proposed scheme can. Experiments established that the proposed scheme used in the G.723.1 6.3 kb/s speech codecs embeds 12 bits in each compressed speech frame with 189 bits, and only decreases the perceptual evaluation of speech quality (PESQ) by 0.11. Additionally, its accuracy in detecting the locations of attacked frames is very high, with only two normal frames mistaken as attacked frames. Therefore, the proposed watermarking scheme effectively ensures the integrity of compressed speech data.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23
ISSN:	1558-7916
DOI:	10.1109/TASL.2007.896658