Abstract 3571: Precise somatic mutation prediction in the absence of matching normal DNA

Accurate identification of somatic mutations is an essential first step for many cancer studies. It is usually done by comparing the genome of the tumour to the reference genome sequence derived from a normal tissue taken from the same donor. However, there are a number of possible scenarios in whic...

Full description

Saved in:
Bibliographic Details
Published inCancer research (Chicago, Ill.) Vol. 77; no. 13_Supplement; p. 3571
Main Authors Kalatskaya, Irina, Trinh, Quang, Spears, Melanie, Bartlett, John, Stein, Lincoln
Format Journal Article
LanguageEnglish
Published 01.07.2017
Online AccessGet full text

Cover

Loading…
Abstract Accurate identification of somatic mutations is an essential first step for many cancer studies. It is usually done by comparing the genome of the tumour to the reference genome sequence derived from a normal tissue taken from the same donor. However, there are a number of possible scenarios in which matched normal tissues might be not available for comparisons. It is most commonly encountered when performing analysis on retrospective studies with human tissues from clinical trials or pathology archives when normal samples were not collected in the first place or patient consent precludes examination of normal tissue or germline variants. Another common scenario is the use of a cancer cell line as an experimental model, many of which have no information on the donor’s normal genome. In this work, we describe an algorithm to identify somatic single nucleotide variants (SNVs) in Next Generation Sequencing (NGS) data in the absence of normal samples using a machine learning approach. Our algorithm was evaluated using a family of supervised learning classifications across six different cancer types and ~1,600 samples, including cell lines, fresh frozen tissues, and formalin-fixed paraffin-embedded (FFPE) tissues. In addition, our algorithm was tested with both deep targeted sequencing and whole exome sequencing strategies. The algorithm correctly classified between 95% and 98% of somatic mutations with F1-measure ranges from 75.9% to 98.6% depending on the tumour type. We have released the algorithm as a software package called ISOWN (Identification of SOmatic mutations Without matching Normal tissues), which is available as Open Source under Apache License 2.0 from https://github.com/ikalatskaya/ISOWN. Citation Format: Irina Kalatskaya, Quang Trinh, Melanie Spears, John Bartlett, Lincoln Stein. Precise somatic mutation prediction in the absence of matching normal DNA [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2017; 2017 Apr 1-5; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2017;77(13 Suppl):Abstract nr 3571. doi:10.1158/1538-7445.AM2017-3571
AbstractList Accurate identification of somatic mutations is an essential first step for many cancer studies. It is usually done by comparing the genome of the tumour to the reference genome sequence derived from a normal tissue taken from the same donor. However, there are a number of possible scenarios in which matched normal tissues might be not available for comparisons. It is most commonly encountered when performing analysis on retrospective studies with human tissues from clinical trials or pathology archives when normal samples were not collected in the first place or patient consent precludes examination of normal tissue or germline variants. Another common scenario is the use of a cancer cell line as an experimental model, many of which have no information on the donor’s normal genome. In this work, we describe an algorithm to identify somatic single nucleotide variants (SNVs) in Next Generation Sequencing (NGS) data in the absence of normal samples using a machine learning approach. Our algorithm was evaluated using a family of supervised learning classifications across six different cancer types and ~1,600 samples, including cell lines, fresh frozen tissues, and formalin-fixed paraffin-embedded (FFPE) tissues. In addition, our algorithm was tested with both deep targeted sequencing and whole exome sequencing strategies. The algorithm correctly classified between 95% and 98% of somatic mutations with F1-measure ranges from 75.9% to 98.6% depending on the tumour type. We have released the algorithm as a software package called ISOWN (Identification of SOmatic mutations Without matching Normal tissues), which is available as Open Source under Apache License 2.0 from https://github.com/ikalatskaya/ISOWN. Citation Format: Irina Kalatskaya, Quang Trinh, Melanie Spears, John Bartlett, Lincoln Stein. Precise somatic mutation prediction in the absence of matching normal DNA [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2017; 2017 Apr 1-5; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2017;77(13 Suppl):Abstract nr 3571. doi:10.1158/1538-7445.AM2017-3571
Author Trinh, Quang
Stein, Lincoln
Bartlett, John
Kalatskaya, Irina
Spears, Melanie
Author_xml – sequence: 1
  givenname: Irina
  surname: Kalatskaya
  fullname: Kalatskaya, Irina
– sequence: 2
  givenname: Quang
  surname: Trinh
  fullname: Trinh, Quang
– sequence: 3
  givenname: Melanie
  surname: Spears
  fullname: Spears, Melanie
– sequence: 4
  givenname: John
  surname: Bartlett
  fullname: Bartlett, John
– sequence: 5
  givenname: Lincoln
  surname: Stein
  fullname: Stein, Lincoln
BookMark eNqdj8FOwzAQRFeolUhbPgFpfyCt3cRKxC2iIC4gDj30Zrnuhho1duV1D_w9MSA-oKeZWe1Ib2Yw8cETwL0USylVu5KqasumrtWye10L2ZSVauQNFP_3CRRCiLZUdbO-hRnz5xiVFKqAXbfnFI1NmEsP-B7JOibkMJjkLA6XNGrweI50cPbHOo_pSGj2TN4Shh7HX3t0_gN9iIM54eatW8C0Nyemuz-dg3p-2j6-lDYG5ki9Pkc3mPilpdB5hs64OuPq3xk6E1XX9r4BBwNSXA
ContentType Journal Article
DBID AAYXX
CITATION
DOI 10.1158/1538-7445.AM2017-3571
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
EISSN 1538-7445
EndPage 3571
ExternalDocumentID 10_1158_1538_7445_AM2017_3571
GroupedDBID ---
-ET
18M
29B
2WC
34G
39C
476
53G
5GY
5RE
5VS
6J9
AAYXX
ABOCM
ACGFO
ACIWK
ACPRK
ACSVP
ADBBV
ADCOW
ADNWM
AENEX
AFHIN
AFOSN
AFRAH
ALMA_UNASSIGNED_HOLDINGS
BAWUL
BTFSW
CITATION
CS3
DIK
DU5
EBS
EJD
F5P
FRP
GX1
H13
IH2
KQ8
L7B
LSO
OK1
P0W
P2P
PQQKQ
RCR
RHF
RHI
RNS
SJN
TR2
W2D
W8F
WH7
WOQ
YKV
YZZ
ID FETCH-crossref_primary_10_1158_1538_7445_AM2017_35713
ISSN 0008-5472
IngestDate Thu Nov 21 23:23:13 EST 2024
IsPeerReviewed true
IsScholarly true
Issue 13_Supplement
Language English
LinkModel OpenURL
MergedId FETCHMERGED-crossref_primary_10_1158_1538_7445_AM2017_35713
ParticipantIDs crossref_primary_10_1158_1538_7445_AM2017_3571
PublicationCentury 2000
PublicationDate 2017-07-01
PublicationDateYYYYMMDD 2017-07-01
PublicationDate_xml – month: 07
  year: 2017
  text: 2017-07-01
  day: 01
PublicationDecade 2010
PublicationTitle Cancer research (Chicago, Ill.)
PublicationYear 2017
SSID ssj0005105
Score 4.5531297
Snippet Accurate identification of somatic mutations is an essential first step for many cancer studies. It is usually done by comparing the genome of the tumour to...
SourceID crossref
SourceType Aggregation Database
StartPage 3571
Title Abstract 3571: Precise somatic mutation prediction in the absence of matching normal DNA
Volume 77
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8NAEF5qBfEiPvHNHryVRNvstqm3UJRqabFQobeweYhiTEuSHvRn-Qud2d080CLWS2gCmWwyX2dm50nIRctnbStotwxP8LbBRNc37C6zDe_qCbS_sHjgob9jOGr3H9n9lE9rtc9K1tIi80z_Y2ldyX-4CteAr1gluwJnC6JwAX4Df-EIHIbjn3jseOio8LOGxTtN3Ns_YK-KNGykM9WJ9W2hkwnnCQZkqomNwkvlnxrj6yCOpSMqRgMWxODIqdqsPQRG0tBtgZ5l3FclcEgBE0VmxZ0wEJHI0lfxLo3Su0RP55bOATiRTpzxQmh9KcP2QFW5vMMI691L32oCT9exqzxhWPsnmp0il7WUubbBmRrQY4almO0w1Ugyl8N6nIvGm-XKsaZl_o8SsPg9K8o6P_2pCDgWNxTPMZ2hXFt5e7Xx9jeFWKQpyg0St10k4yIZV5FxkcwaWcfmizivYTAuO9RznTabv7WuGgMyl0tXU7GHKobNZJts6R0JdRS8dkgtjHfJxlDnXOyRaY4yioSuqcYY1RijOcZoiTH6ElPAGNUYo7MnmmOMKoxRwNg-4bc3k17fyFfmzlXrE_fXL2IdkHo8i8NDQr0W8xmWKQc4byGETXVg-7bfETZYxLzLjoi5Gu3jVW84IZslGk9JPUsW4RnYj5l3Lhn2BV8uaa8
link.rule.ids 314,780,784,27924,27925
linkProvider Colorado Alliance of Research Libraries
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Abstract+3571%3A+Precise+somatic+mutation+prediction+in+the+absence+of+matching+normal+DNA&rft.jtitle=Cancer+research+%28Chicago%2C+Ill.%29&rft.au=Kalatskaya%2C+Irina&rft.au=Trinh%2C+Quang&rft.au=Spears%2C+Melanie&rft.au=Bartlett%2C+John&rft.date=2017-07-01&rft.issn=0008-5472&rft.eissn=1538-7445&rft.volume=77&rft.issue=13_Supplement&rft.spage=3571&rft.epage=3571&rft_id=info:doi/10.1158%2F1538-7445.AM2017-3571&rft.externalDBID=n%2Fa&rft.externalDocID=10_1158_1538_7445_AM2017_3571
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0008-5472&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0008-5472&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0008-5472&client=summon