Exploring the transcriptome space of a recombinant BHK cell line through next generation sequencing

Baby Hamster Kidney (BHK) cell lines are used in the production of veterinary vaccines and recombinant proteins. To facilitate transcriptome analysis of BHK cell lines, we embarked on an effort to sequence, assemble, and annotate transcript sequences from a recombinant BHK cell line and Syrian hamst...

Full description

Saved in:
Bibliographic Details
Published inBiotechnology and bioengineering Vol. 111; no. 4; pp. 770 - 781
Main Authors Johnson, Kathryn C., Yongky, Andrew, Vishwanathan, Nandita, Jacob, Nitya M., Jayapal, Karthik P., Goudar, Chetan T., Karypis, George, Hu, Wei-Shou
Format Journal Article
LanguageEnglish
Published United States Blackwell Publishing Ltd 01.04.2014
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Baby Hamster Kidney (BHK) cell lines are used in the production of veterinary vaccines and recombinant proteins. To facilitate transcriptome analysis of BHK cell lines, we embarked on an effort to sequence, assemble, and annotate transcript sequences from a recombinant BHK cell line and Syrian hamster liver and brain. RNA‐seq data were supplemented with 6,170 Sanger ESTs from parental and recombinant BHK lines to generate 221,583 contigs. Annotation by homology to other species, primarily mouse, yielded more than 15,000 unique Ensembl mouse gene IDs with high coverage of KEGG canonical pathways. High coverage of enzymes and isoforms was seen for cell metabolism and N‐glycosylation pathways, areas of highest interest for biopharmaceutical production. With the high sequencing depth in RNA‐seq data, we set out to identify single‐nucleotide variants in the transcripts. A majority of the high‐confidence variants detected in both hamster tissue libraries occurred at a frequency of 50%, indicating their origin as heterozygous germline variants. In contrast, the cell line libraries' variants showed a wide range of occurrence frequency, indicating the presence of a heterogeneous population in cultured cells. The extremely high coverage of transcripts of highly abundant genes in RNA‐seq enabled us to identify low‐frequency variants. Experimental verification through Sanger sequencing confirmed the presence of two variants in the cDNA of a highly expressed gene in the BHK cell line. Furthermore, we detected seven potential missense mutations in the genes of the growth signaling pathways that may have arisen during the cell line derivation process. The development and characterization of a BHK reference transcriptome will facilitate future efforts to understand, monitor, and manipulate BHK cells. Our study on sequencing variants is crucial for improved understanding of the errors inherent in high‐throughput sequencing and to increase the accuracy of variant calling in BHK or other systems. Biotechnol. Bioeng. 2014;111: 770–781. © 2013 Wiley Periodicals, Inc.
Bibliography:ark:/67375/WNG-NXN297HL-8
istex:4683499A6732826AE51BDD0686CDCE0302010886
Bayer HealthCare
NSF Graduate Research Fellowship
ArticleID:BIT25135
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0006-3592
1097-0290
DOI:10.1002/bit.25135