Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals

Jeong Sun-Seo and colleagues report whole-genome sequencing of ten Korean individuals and exome sequencing on an additional eight Korean individuals. They also performed transcriptome sequencing on 17 of these individuals. The authors identified approximately 1.83 million previously unidentified SNP...

Full description

Saved in:
Bibliographic Details
Published inNature genetics Vol. 43; no. 8; pp. 745 - 752
Main Authors Seo, Jeong-Sun, Ju, Young Seok, Kim, Jong-Il, Kim, Sheehyun, Hong, Dongwan, Park, Hansoo, Shin, Jong-Yeon, Lee, Seungbok, Lee, Won-Chul, Kim, Sujung, Yu, Saet-Byeol, Park, Sung-Soo, Seo, Seung-Hyun, Yun, Ji-Young, Kim, Hyun-Jin, Lee, Dong-Sung, Yavartanoo, Maryam, Kang, Hyunseok Peter, Gokcumen, Omer, Govindaraju, Diddahally R, Jung, Jung Hee, Chong, Hyonyong, Yang, Kap-Seok, Kim, Hyungtae, Lee, Charles
Format Journal Article
LanguageEnglish
Published New York Nature Publishing Group US 01.08.2011
Nature Publishing Group
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Jeong Sun-Seo and colleagues report whole-genome sequencing of ten Korean individuals and exome sequencing on an additional eight Korean individuals. They also performed transcriptome sequencing on 17 of these individuals. The authors identified approximately 1.83 million previously unidentified SNPs. Massively parallel sequencing technologies have identified a broad spectrum of human genome diversity. Here we deep sequenced and correlated 18 genomes and 17 transcriptomes of unrelated Korean individuals. This has allowed us to construct a genome-wide map of common and rare variants and also identify variants formed during DNA-RNA transcription. We identified 9.56 million genomic variants, 23.2% of which appear to be previously unidentified. From transcriptome sequencing, we discovered 4,414 transcripts not previously annotated. Finally, we revealed 1,809 sites of transcriptional base modification, where the transcriptional landscape is different from the corresponding genomic sequences, and 580 sites of allele-specific expression. Our findings suggest that a considerable number of unexplored genomic variants still remain to be identified in the human genome, and that the integrated analysis of genome and transcriptome sequencing is powerful for understanding the diversity and functional aspects of human genomic variants.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ISSN:1061-4036
1546-1718
DOI:10.1038/ng.872