Abordagem probabil\'istica para an\'alise de confiabilidade de dados gerados em sequenciamentos multiplex na plataforma ABI SOLiD

The next-generation sequencers such as Illumina and SOLiD platforms generate a large amount of data, commonly above 10 Gigabytes of text files. Particularly, the SOLiD platform allows the sequencing of multiple samples in a single run, called multiplex run, through a tagging system called Barcode. T...

Full description

Saved in:
Bibliographic Details
Main Authors Lobato, Fabio M. F, Damasceno, Carlos D. N, Machado, Péricles L, Vijaykumar, Nandamudi L, Santos, André R. dos, Darnet, Sylvain H, Gonçalves, André N. A, de Alencar, Dayse O, de Santana, Ádamo L
Format Journal Article
LanguageEnglish
Published 27.07.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The next-generation sequencers such as Illumina and SOLiD platforms generate a large amount of data, commonly above 10 Gigabytes of text files. Particularly, the SOLiD platform allows the sequencing of multiple samples in a single run, called multiplex run, through a tagging system called Barcode. This feature requires a computational process for separation of the data sample because the sequencer provides a mixture of all samples in a single output. This process must be secure to avoid any harm that may scramble further analysis. In this context, realized the need to develop a probabilistic model capable of assigning a degree of confidence in the marking system used in multiplex sequencing. The results confirmed the adequacy of the model obtained, which allows, among other things, to guide a process of filtering the data and evaluation of the sequencing protocol used.
DOI:10.48550/arxiv.2107.13537