A taxonomy of tools and approaches for distributed genomic analyses
The amount of biomedical data collected and stored has grown significantly. Analyzing these extensive amounts of data cannot be done by individuals or single organizations anymore. Thus, the scientific community is creating global collaborative efforts to analyze these data. However, biomedical data...
Saved in:
Published in | Informatics in medicine unlocked Vol. 32; pp. 101024 - 17 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier Ltd
2022
Elsevier |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The amount of biomedical data collected and stored has grown significantly. Analyzing these extensive amounts of data cannot be done by individuals or single organizations anymore. Thus, the scientific community is creating global collaborative efforts to analyze these data. However, biomedical data is subject to several legal and socio-economic restrictions hindering the possibilities for research collaboration. In this paper, we argue that researchers require new tools and techniques to address the restrictions and needs of global scientific collaborations over geo-distributed biomedical data. These tools and techniques must support what we call Fully Distributed Collaborations (FDC), which are research endeavors that harness means to exploit and analyze massive biomedical information collaboratively while respecting legal and socio-economical restrictions. This paper first motivates and discusses the requirements of FDCs in the context of a research collaboration on the development of diagnostic and predictive tools for the risk of intracranial aneurysm formation and rupture (the ICAN project). The paper then presents a taxonomy classifying the current tools and techniques for biomedical analysis with respect to the proposed requirements. The taxonomy considers three key architectural features to support FDC scenarios: data and computation placement, Privacy and Security, and Performance and Scalability. The review reveals new research opportunities to design tools and techniques for multi-site analyses encouraging scientific collaborations while mitigating technical and legal constraints.
•Taxonomy of existing Biomedical Analytical Tools and Techniques.•Classification into three features: biomedical problems around genomic data, tool support provided for biomedical analyses, and support for distributed cooperation.•Fully distributed collaborations promise to enable more powerful biomedical analyses using distributed workflows analysis workflows.•Fully distributed collaborations operate over large volumes of shared data.•Fully distributed collaborations alleviate restrictions typically relying on their legal framework and security and technical constraints over local processes and workflows. |
---|---|
ISSN: | 2352-9148 2352-9148 |
DOI: | 10.1016/j.imu.2022.101024 |