Structural visualization of sequential DNA data

To date, comparing and visualizing genome sequences remain challenging due to the large genome size. Existing approaches take advantage of the stable property of oligonucleotides and exhibit the main characteristics of the whole genome, yet they commonly fail to show progression patterns of the geno...

Full description

Saved in:

Bibliographic Details
Published in	Frontiers of information technology & electronic engineering Vol. 12; no. 4; pp. 263 - 272
Main Authors	Mao, Xiao-hong, Fu, Jing-hua, Chen, Wei, You, Qian, Fang, Shiao-fen, Peng, Qun-sheng
Format	Journal Article
Language	English
Published	Heidelberg SP Zhejiang University Press 01.04.2011 Springer Nature B.V
Subjects	Communications Engineering Computer Hardware Computer Science Computer Systems Organization and Communication Networks Deoxyribonucleic acid DNA Electrical Engineering Electronics and Microelectronics Gene sequencing Instrumentation Networks Nucleotides Oligonucleotides Sequential analysis Bio-information visualization TP391.1 Genome sequence Sequential visualization R394.3
Online Access	Get full text

Cover

Loading…

More Information
Summary:	To date, comparing and visualizing genome sequences remain challenging due to the large genome size. Existing approaches take advantage of the stable property of oligonucleotides and exhibit the main characteristics of the whole genome, yet they commonly fail to show progression patterns of the genome adjustably. This paper presents a novel visual encoding technique, which not only supports the binning process （phylogenetic analysis）, but also allows the sequential analysis of the genome. The key idea is to regard the combination of each k-nucleotide and its reverse complement as a visual word, and to represent a long genome sequence with a list of local statistical feature vectors derived from the local frequency of the visual words. Experimental results on a variety of examples demonstrate that the presented approach has the ability to quickly and intuitively visualize DNA sequences, and to help the user identify regions of differences among multiple datasets.
Bibliography:	Genome sequence, Sequential visualization, Bio-information visualization TP391.1 R394.3 ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23
ISSN:	1869-1951 2095-9184 1869-196X 2095-9230
DOI:	10.1631/jzus.C1000091