nf-core/airrflow: An adaptive immune receptor repertoire analysis workflow employing the Immcantation framework

Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) is a valuable experimental tool to study the immune state in health and following immune challenges such as infectious diseases, (auto)immune diseases, and cancer. Several tools have been developed to reconstruct B cell and T cell receptor se...

Full description

Saved in:
Bibliographic Details
Published inPLoS computational biology Vol. 20; no. 7; p. e1012265
Main Authors Gabernet, Gisela, Marquez, Susanna, Bjornson, Robert, Peltzer, Alexander, Meng, Hailong, Aron, Edel, Lee, Noah Y., Jensen, Cole G., Ladd, David, Polster, Mark, Hanssen, Friederike, Heumos, Simon, Yaari, Gur, Kowarik, Markus C., Nahnsen, Sven, Kleinstein, Steven H.
Format Journal Article
LanguageEnglish
Published United States Public Library of Science 26.07.2024
Public Library of Science (PLoS)
Subjects
Online AccessGet full text
ISSN1553-7358
1553-734X
1553-7358
DOI10.1371/journal.pcbi.1012265

Cover

More Information
Summary:Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) is a valuable experimental tool to study the immune state in health and following immune challenges such as infectious diseases, (auto)immune diseases, and cancer. Several tools have been developed to reconstruct B cell and T cell receptor sequences from AIRR-seq data and infer B and T cell clonal relationships. However, currently available tools offer limited parallelization across samples, scalability or portability to high-performance computing infrastructures. To address this need, we developed nf-core/airrflow, an end-to-end bulk and single-cell AIRR-seq processing workflow which integrates the Immcantation Framework following BCR and TCR sequencing data analysis best practices. The Immcantation Framework is a comprehensive toolset, which allows the processing of bulk and single-cell AIRR-seq data from raw read processing to clonal inference. nf-core/airrflow is written in Nextflow and is part of the nf-core project, which collects community contributed and curated Nextflow workflows for a wide variety of analysis tasks. We assessed the performance of nf-core/airrflow on simulated sequencing data with sequencing errors and show example results with real datasets. To demonstrate the applicability of nf-core/airrflow to the high-throughput processing of large AIRR-seq datasets, we validated and extended previously reported findings of convergent antibody responses to SARS-CoV-2 by analyzing 97 COVID-19 infected individuals and 99 healthy controls, including a mixture of bulk and single-cell sequencing datasets. Using this dataset, we extended the convergence findings to 20 additional subjects, highlighting the applicability of nf-core/airrflow to validate findings in small in-house cohorts with reanalysis of large publicly available AIRR datasets.
Bibliography:new_version
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
I have read the journal’s policy and the authors of this manuscript have the following competing interests: SHK receives consulting fees from Peraton. AP is an employee of Boehringer Ingelheim Pharma GmbH & Co KG and declares no conflict of interest. DL is an employee of oNKo-innate Pty Ltd and declares no conflict of interest. MCK has served on advisory boards and received speaker fees / travel grants from Merck, Sanofi-Genzyme, Novartis, Biogen, Janssen, Alexion, Celgene / Bristol-Myers Squibb and Roche. He has received research grants from Merck, Roche, Novartis, Sanofi-Genzyme and Celgene / Bristol-Myers Squibb. All other authors declare no conflicts of interest.
Membership of ‘nf-core community’ is provided in the Acknowledgements.
These authors are joint senior authors on this work.
ISSN:1553-7358
1553-734X
1553-7358
DOI:10.1371/journal.pcbi.1012265