AVOCADO: Visualization of Workflow-Derived Data Provenance for Reproducible Biomedical Research

A major challenge in data‐driven biomedical research lies in the collection and representation of data provenance information to ensure that findings are reproducibile. In order to communicate and reproduce multi‐step analysis workflows executed on datasets that contain data for dozens or hundreds o...

Full description

Saved in:
Bibliographic Details
Published inComputer graphics forum Vol. 35; no. 3; pp. 481 - 490
Main Authors Stitz, H., Luger, S., Streit, M., Gehlenborg, N.
Format Journal Article
LanguageEnglish
Published England Blackwell Publishing Ltd 01.06.2016
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A major challenge in data‐driven biomedical research lies in the collection and representation of data provenance information to ensure that findings are reproducibile. In order to communicate and reproduce multi‐step analysis workflows executed on datasets that contain data for dozens or hundreds of samples, it is crucial to be able to visualize the provenance graph at different levels of aggregation. Most existing approaches are based on node‐link diagrams, which do not scale to the complexity of typical data provenance graphs. In our proposed approach, we reduce the complexity of the graph using hierarchical and motif‐based aggregation. Based on user action and graph attributes, a modular degree‐of‐interest (DoI) function is applied to expand parts of the graph that are relevant to the user. This interest‐driven adaptive approach to provenance visualization allows users to review and communicate complex multi‐step analyses, which can be based on hundreds of files that are processed by numerous workflows. We have integrated our approach into an analysis platform that captures extensive data provenance information, and demonstrate its effectiveness by means of a biomedical usage scenario.
Bibliography:istex:59A92BF1996CA7F10DF1F6E01BDC850574840C1D
ArticleID:CGF12924
Supporting InformationSupporting Information
ark:/67375/WNG-KW702M8G-D
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
Equal contribution
ISSN:0167-7055
1467-8659
DOI:10.1111/cgf.12924