Predicting molecular mechanisms of hereditary diseases by using their tissue‐selective manifestation
How do aberrations in widely expressed genes lead to tissue‐selective hereditary diseases? Previous attempts to answer this question were limited to testing a few candidate mechanisms. To answer this question at a larger scale, we developed “Tissue Risk Assessment of Causality by Expression” (TRACE)...
Saved in:
Published in | Molecular systems biology Vol. 19; no. 8; pp. e11407 - n/a |
---|---|
Main Authors | , , , , , , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
London
Nature Publishing Group UK
08.08.2023
EMBO Press John Wiley and Sons Inc Springer Nature |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | How do aberrations in widely expressed genes lead to tissue‐selective hereditary diseases? Previous attempts to answer this question were limited to testing a few candidate mechanisms. To answer this question at a larger scale, we developed “Tissue Risk Assessment of Causality by Expression” (TRACE), a machine learning approach to predict genes that underlie tissue‐selective diseases and selectivity‐related features. TRACE utilized 4,744 biologically interpretable tissue‐specific gene features that were inferred from heterogeneous omics datasets. Application of TRACE to 1,031 disease genes uncovered known and novel selectivity‐related features, the most common of which was previously overlooked. Next, we created a catalog of tissue‐associated risks for 18,927 protein‐coding genes (
https://netbio.bgu.ac.il/trace/
). As proof‐of‐concept, we prioritized candidate disease genes identified in 48 rare‐disease patients. TRACE ranked the verified disease gene among the patient's candidate genes significantly better than gene prioritization methods that rank by gene constraint or tissue expression. Thus, tissue selectivity combined with machine learning enhances genetic and clinical understanding of hereditary diseases.
Synopsis
An interpretable machine‐learning framework predicts disease genes for tissue‐selective hereditary diseases. The framework highlights known and novel tissue‐selectivity features and enhances genetic diagnosis of rare‐disease patients.
An interpretable machine‐learning (ML) framework uses thousands of gene features to predict tissue‐associated disease genes.
ML models highlight known and novel tissue‐selectivity mechanisms.
An online catalogue of tissue‐associated risks for 18,927 protein‐coding genes in eight tissues is presented.
The framework and catalogue enhance genetic diagnosis of rare‐disease patients.
Graphical Abstract
An interpretable machine‐learning framework predicts disease genes for tissue‐selective hereditary diseases. The framework highlights known and novel tissue‐selectivity features and enhances genetic diagnosis of rare‐disease patients. |
---|---|
Bibliography: | Membership of the GTEx Consortium appears in the Appendix ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1744-4292 1744-4292 |
DOI: | 10.15252/msb.202211407 |