The Assembled Genome of the Stroke-Prone Spontaneously Hypertensive Rat

We report the creation and evaluation of a de novo assembly of the genome of the spontaneously hypertensive rat, the most widely used model of human cardiovascular disease. The genome is assembled from long read sequencing (PacBio HiFi and continuous long read data [CLR]) and scaffolded with long-ra...

Full description

Saved in:
Bibliographic Details
Published inHypertension (Dallas, Tex. 1979) Vol. 80; no. 1; pp. 138 - 146
Main Authors Kalbfleisch, Theodore S., Hussien AbouEl Ela, Nahla A., Li, Kai, Brashear, Wesley A., Kochan, Kelli J., Hillhouse, Andrew E., Zhu, Yaming, Dhande, Isha S., Kline, Eric J., Hudson, Elizabeth A., Murphy, Terence D., Thibaud-Nissen, Françoise, Smith, Melissa L., Doris, Peter A.
Format Journal Article
LanguageEnglish
Published United States Lippincott Williams & Wilkins 01.01.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We report the creation and evaluation of a de novo assembly of the genome of the spontaneously hypertensive rat, the most widely used model of human cardiovascular disease. The genome is assembled from long read sequencing (PacBio HiFi and continuous long read data [CLR]) and scaffolded with long-range structural information obtained from Bionano optical maps and proximity ligation sequencing proximity analysis of the genome. The genome assembly was polished with Illumina short reads. Completeness of the assembly was investigated using Benchmarking Universal Single Copy Orthologs analysis. The genome assembly was also evaluated with the rat reference gene set, using NCBI automated protocols. We also generated orthogonal single molecule transcript sequence reads (Iso-Seq) from 8 tissues and used them to validate the coding assembly, to annotate the assembly with RNA transcripts representing unique full length transcript isoforms for each gene and to determine whether divergences between RefSeq sequences and the assembly were attributable to assembly errors or polymorphisms. The assembly analysis indicates that this assembly is comparable in contiguity and completeness to the current rat reference assembly, while the use of HiFi sequencing yields an assembly that is more correct at the single base level. Synteny analysis was performed to uncover the extent of synteny and the presence and distribution of chromosomal rearrangements between the reference and this assembly. The resulting genome assembly is reference quality and captures significant structural variation.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
denotes equal contribution
Software: TSK, FTN, TM
Funding acquisition: PAD, TSK, MLS
Investigation: ISD, YZ, WAB, NAH, KL
Writing – review & editing: PAD, TSK, MLS, AEH, TM, FTN
Project administration: PAD, TSK, MLS
Data curation: PAD, TSK, MLS
Resources: PAD, TSK, MLS, AEH
Lead Contact: Peter A Doris, Ph.D., Center for Human Genetics, Brown Foundation Institute of Molecular Medicine, University of Texas McGovern School of Medicine, 1825 Pressler St, Houston, TX 77030
Conceptualization: PAD, TSK, MLS
Writing – original draft: PAD, TSK, MLS
Author Contributions
Methodology: KK, AEH, MLS, EJK, EAH, ISD
Formal analysis: PAD, TSK, MLS, FTN, TM
ISSN:0194-911X
1524-4563
1524-4563
DOI:10.1161/HYPERTENSIONAHA.122.20140