Development of methylation-based biomarkers for breast cancer detection by model training and validation in synthetic cell-free DNA

Circulating tumour-derived DNA (ctDNA) carries the genetic and epigenetic characteristics of the tumour from which it is derived and can give information about the biology and tissue origins of the underlying tumour. DNA methylation is an epigenetic mark that is specific to individual tissues and, a...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors De Procé, Sophie Marion, Adamowicz, Martyna, Dutta, Prasun, Warlow, Sophie J, Moss, Joshua, Shemer, Ruth, Dor, Yuval, Christelle, Robert, Aitman, Timothy J
Format Paper
LanguageEnglish
Published Cold Spring Harbor Cold Spring Harbor Laboratory Press 14.02.2022
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Circulating tumour-derived DNA (ctDNA) carries the genetic and epigenetic characteristics of the tumour from which it is derived and can give information about the biology and tissue origins of the underlying tumour. DNA methylation is an epigenetic mark that is specific to individual tissues and, as methylation profiles are disrupted in tumours, they can indicate the tissue of origin and cancer type of ctDNA. We have developed a set of methylation biomarkers for detecting breast cancer in plasma cell-free DNA (cfDNA). First, we mined publicly available methylation datasets to create synthetic methylation profiles that were modelled to reflect cfDNA from healthy subjects and cancer patients. These profiles were restricted to the most differentially methylated CpGs between breast tumour samples and haematopoietic cells. Regularised logistic regression models were trained using 10-fold cross-validation on synthetic cfDNA datasets with distinct fractions of breast tumour DNA spiked in silico into healthy cfDNA with the addition of 10% of a mix of different tissues. Initial validation with synthetic cfDNA permitted detection of breast cancer-derived DNA with as little as 0.25% tumour DNA spiked in silico into healthy subject cfDNA with an area under ROC curve (AUC) of 0.63. Performances of classifiers increased with increased fractions of spike-in tumour DNA (AUCs 0.77 and 0.93 at tumour DNA fractions 0.5% and 1% respectively). We then combined the most discriminative CpG markers from our models with methylation markers of breast cancer that had already been published to obtain a single marker set. In vitro testing of MCF-7 breast cancer cell line DNA spiked into leukocyte DNA showed highly significant correlation for individual markers between laboratory-measured and published methylation data for MCF-7 and leukocytes (R > 0.89, P < 2.2 x 10-16). These preliminary data indicate promising results for detection of breast cancer cell line DNA using this methylation marker set, which now require testing in cfDNA from breast cancer patients and healthy controls. Competing Interest Statement TA is Co-founder and Director of the company BioCaptiva Ltd. JM, RS and YD have filed for patents on methylation-based cfDNA analysis. YD received funding from Grail, a liquid biopsy company.
AbstractList Circulating tumour-derived DNA (ctDNA) carries the genetic and epigenetic characteristics of the tumour from which it is derived and can give information about the biology and tissue origins of the underlying tumour. DNA methylation is an epigenetic mark that is specific to individual tissues and, as methylation profiles are disrupted in tumours, they can indicate the tissue of origin and cancer type of ctDNA. We have developed a set of methylation biomarkers for detecting breast cancer in plasma cell-free DNA (cfDNA). First, we mined publicly available methylation datasets to create synthetic methylation profiles that were modelled to reflect cfDNA from healthy subjects and cancer patients. These profiles were restricted to the most differentially methylated CpGs between breast tumour samples and haematopoietic cells. Regularised logistic regression models were trained using 10-fold cross-validation on synthetic cfDNA datasets with distinct fractions of breast tumour DNA spiked in silico into healthy cfDNA with the addition of 10% of a mix of different tissues. Initial validation with synthetic cfDNA permitted detection of breast cancer-derived DNA with as little as 0.25% tumour DNA spiked in silico into healthy subject cfDNA with an area under ROC curve (AUC) of 0.63. Performances of classifiers increased with increased fractions of spike-in tumour DNA (AUCs 0.77 and 0.93 at tumour DNA fractions 0.5% and 1% respectively). We then combined the most discriminative CpG markers from our models with methylation markers of breast cancer that had already been published to obtain a single marker set. In vitro testing of MCF-7 breast cancer cell line DNA spiked into leukocyte DNA showed highly significant correlation for individual markers between laboratory-measured and published methylation data for MCF-7 and leukocytes (R > 0.89, P < 2.2 x 10-16). These preliminary data indicate promising results for detection of breast cancer cell line DNA using this methylation marker set, which now require testing in cfDNA from breast cancer patients and healthy controls. Competing Interest Statement TA is Co-founder and Director of the company BioCaptiva Ltd. JM, RS and YD have filed for patents on methylation-based cfDNA analysis. YD received funding from Grail, a liquid biopsy company.
Author Dor, Yuval
Dutta, Prasun
Christelle, Robert
Warlow, Sophie J
De Procé, Sophie Marion
Aitman, Timothy J
Adamowicz, Martyna
Moss, Joshua
Shemer, Ruth
Author_xml – sequence: 1
  givenname: Sophie
  surname: De Procé
  middlename: Marion
  fullname: De Procé, Sophie Marion
– sequence: 2
  givenname: Martyna
  surname: Adamowicz
  fullname: Adamowicz, Martyna
– sequence: 3
  givenname: Prasun
  surname: Dutta
  fullname: Dutta, Prasun
– sequence: 4
  givenname: Sophie
  surname: Warlow
  middlename: J
  fullname: Warlow, Sophie J
– sequence: 5
  givenname: Joshua
  surname: Moss
  fullname: Moss, Joshua
– sequence: 6
  givenname: Ruth
  surname: Shemer
  fullname: Shemer, Ruth
– sequence: 7
  givenname: Yuval
  surname: Dor
  fullname: Dor, Yuval
– sequence: 8
  givenname: Robert
  surname: Christelle
  fullname: Christelle, Robert
– sequence: 9
  givenname: Timothy
  surname: Aitman
  middlename: J
  fullname: Aitman, Timothy J
BookMark eNotjTtPwzAUhT3AAIUfwHYl5hS_44xVy0uqYOle2ckNNTh2cNxKnfnjlMd0dKTzfeeSnMUUkZAbRueMUXbHKedzyk9lLg2lRl2QrxUeMKRxwFgg9TBg2R2DLT7FytkJO3A-DTZ_YJ6gTxlcRjsVaG1sMUOHBdufMbgjDKnDACVbH318Axs7ONjgu18b-AjTMZYdFt9CiyFUfUaE1cviipz3Nkx4_Z8zsnm43yyfqvXr4_Nysa7GmqlKa2msMpzpRtS6FrJjoheoqHFSUIO1kbRhthYNVX1bN9xoJSjjLXVKoNNiRm7_tGNOn3ucyvY97XM8PW655idaSsXFN72TXRM
ContentType Paper
Copyright 2022. This article is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: 2022. This article is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID 8FE
8FH
AAFGM
AAMXL
ABOIG
ABUWG
ADZZV
AFKRA
AFLLJ
AFOLM
AGAJT
AQTIP
AZQEC
BBNVY
BENPR
BHPHI
CCPQU
DWQXO
GNUQQ
HCIFZ
LK8
M7P
PIMPY
PQCXX
PQEST
PQQKQ
PQUKI
PRINS
DOI 10.1101/2022.02.11.480085
DatabaseName ProQuest SciTech Collection
ProQuest Natural Science Collection
ProQuest Central Korea - hybrid linking
Natural Science Collection - hybrid linking
Biological Science Collection - hybrid linking
ProQuest Central (Alumni)
ProQuest Central (Alumni) - hybrid linking
ProQuest Central
SciTech Premium Collection - hybrid linking
ProQuest Central Student - hybrid linking
ProQuest Central Essentials - hybrid linking
ProQuest Women's & Gender Studies - hybrid linking
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Natural Science Collection
ProQuest One Community College
ProQuest Central Korea
ProQuest Central Student
SciTech Premium Collection
ProQuest Biological Science Collection
Biological Science Database
Publicly Available Content (ProQuest)
ProQuest Central - hybrid linking
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
DatabaseTitle Publicly Available Content Database
ProQuest Central Student
ProQuest Biological Science Collection
ProQuest Central Essentials
ProQuest One Academic Eastern Edition
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Natural Science Collection
Biological Science Database
ProQuest SciTech Collection
ProQuest Central China
ProQuest Central
ProQuest One Academic UKI Edition
Natural Science Collection
ProQuest Central Korea
Biological Science Collection
ProQuest One Academic
DatabaseTitleList Publicly Available Content Database
Database_xml – sequence: 1
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Genre Working Paper/Pre-Print
GroupedDBID 8FE
8FH
ABUWG
AFKRA
AZQEC
BBNVY
BENPR
BHPHI
CCPQU
DWQXO
GNUQQ
HCIFZ
LK8
M7P
PIMPY
PQEST
PQQKQ
PQUKI
PRINS
ID FETCH-LOGICAL-p715-6648a582169376734d13f3e508b4308e784091a73905fc7928653012c0b53eb63
IEDL.DBID BENPR
IngestDate Thu Oct 10 18:06:19 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-p715-6648a582169376734d13f3e508b4308e784091a73905fc7928653012c0b53eb63
OpenAccessLink https://www.proquest.com/docview/2628404452?pq-origsite=%requestingapplication%
PQID 2628404452
PQPubID 2050091
ParticipantIDs proquest_journals_2628404452
PublicationCentury 2000
PublicationDate 20220214
PublicationDateYYYYMMDD 2022-02-14
PublicationDate_xml – month: 02
  year: 2022
  text: 20220214
  day: 14
PublicationDecade 2020
PublicationPlace Cold Spring Harbor
PublicationPlace_xml – name: Cold Spring Harbor
PublicationTitle bioRxiv
PublicationYear 2022
Publisher Cold Spring Harbor Laboratory Press
Publisher_xml – name: Cold Spring Harbor Laboratory Press
Score 1.6909148
Snippet Circulating tumour-derived DNA (ctDNA) carries the genetic and epigenetic characteristics of the tumour from which it is derived and can give information about...
SourceID proquest
SourceType Aggregation Database
SubjectTerms Biomarkers
Biopsy
Breast cancer
Cell culture
CpG islands
Deoxyribonucleic acid
DNA
DNA methylation
Epigenetics
Leukocytes
Regression analysis
Tumors
Title Development of methylation-based biomarkers for breast cancer detection by model training and validation in synthetic cell-free DNA
URI https://www.proquest.com/docview/2628404452
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LS8NAEF60vXhTVHxUmYPX1Wz2keQkPlqKYClSobeym91AL2lN4qFn_7gzaYqC4HlvO--Zb-Zj7CZzkfBphoaEwudKF5ZnSmTcS0zvhdfattf1Xydm_K5e5nreNdzqDla584mto_arnHrkd7FBRxoppeP79Qcn1iiarnYUGvusHwuZpj3WfxxOpm_d-BLVjYr7mK5yCnGrUkow_jjdNpKMDll_atehOmJ7oTxmX78wO7AqgPicN1t0Gqf44oHW4wlBU9WA6SU4wpA3kJOsKvChaZFUJbgNtJw2sKN8AFt6QC1abjmTYFlCvSkx20NFAerW86IKAZ4nDydsNhrOnsa8o0Xg60RoboxKLa23mowusUjlhSxkwETLKRmlIaGSTdhEZpEu8iSj1VO04jiPnJbBGXnKeuWqDGcM4iSEGIWUW-lVbrx1aOGJwbhWYJkUinM22H3VolPtevEjiIv_ny_ZAX0-QZyFGrBeU32GK4zgjbvuxPQNh5mdHA
link.rule.ids 780,784,21388,27925,33744,43805
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV07T8MwELagHWADAeJR4AZWQxw_kkyIR6sCbVShInWr7NiRuqQlCUNn_ji-kAokJOZs-e7t7-4j5CoxAbNx4h3Jg0-FzDVNBEuo5b68Z1ZK3VzXH6dq-CaeZ3LWDtyqlla5iYlNoLbLDGfkN6HygTQQQoa3q3eKqlH4utpKaGyTLl5Olx3Sve-nk9f2-dKbGzb3IV7lZOxaxFhg_Am6TSYZ7JHuRK9cuU-2XHFAPn9xdmCZA-o5r7_ZaRTziwVcj0cGTVmBLy_BIIe8hgyxKsG6umFSFWDW0GjawEbyAXRhwVvR4lszCRYFVOvCV3veUACn9TQvnYPH9O6QTAf96cOQtrIIdBUxSZUSscb1VpXgJRYuLOM5d77QMoIHsYuwZWM64kkg8yxKcPXUe3GYBUZyZxQ_Ip1iWbhjAmHkXOhByjS3IlNWG-_hkfJ5LfdtkstPSG_zq-ataVfzHyBO__98SXaG0_FoPnpKX87ILgKBdGcmeqRTlx_u3Gfz2ly0kH0BAWegBA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Development+of+methylation-based+biomarkers+for+breast+cancer+detection+by+model+training+and+validation+in+synthetic+cell-free+DNA&rft.jtitle=bioRxiv&rft.au=De+Proc%C3%A9%2C+Sophie+Marion&rft.au=Adamowicz%2C+Martyna&rft.au=Dutta%2C+Prasun&rft.au=Warlow%2C+Sophie+J&rft.date=2022-02-14&rft.pub=Cold+Spring+Harbor+Laboratory+Press&rft_id=info:doi/10.1101%2F2022.02.11.480085