Streamlined Genome Sequence Compression using Distributed Source Coding

We aim at developing a streamlined genome sequence compression algorithm to support alternative miniaturized sequencing devices, which have limited communication, storage, and computation power. Existing techniques that require heavy client (encoder side) cannot be applied. To tackle this challenge,...

Full description

Saved in:
Bibliographic Details
Published inCancer informatics Vol. 2014; no. Suppl. 1; pp. 123 - 131
Main Authors Wang, Shuang, Jiang, Xiaoqian, Chen, Feng, Cui, Lijuan, Cheng, Samuel
Format Journal Article
LanguageEnglish
Published London, England SAGE Publishing 01.01.2014
SAGE Publications
Libertas Academica
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We aim at developing a streamlined genome sequence compression algorithm to support alternative miniaturized sequencing devices, which have limited communication, storage, and computation power. Existing techniques that require heavy client (encoder side) cannot be applied. To tackle this challenge, we carefully examined distributed source coding theory and developed a customized reference-based genome compression protocol to meet the low-complexity need at the client side. Based on the variation between source and reference, our protocol will pick adaptively either syndrome coding or hash coding to compress subsequences of changing code length. Our experimental results showed promising performance of the proposed method when compared with the state-of-the-art algorithm (GRS).
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1176-9351
1176-9351
DOI:10.4137/CIN.S13879