Abordagem probabilística para análise de confiabilidade de dados gerados em sequenciamentos multiplex na plataforma ABI SOLiD

The next-generation sequencers such as Illumina and SOLiD platforms generate a large amount of data, commonly above 10 Gigabytes of text files. Particularly, the SOLiD platform allows the sequencing of multiple samples in a single run, called multiplex run, through a tagging system called Barcode. T...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Lobato, Fabio M F, Damasceno, Carlos D N, Machado, Péricles L, Vijaykumar, Nandamudi L, dos Santos, André R, Darnet, Sylvain H, Gonçalves, André N A, de Alencar, Dayse O, de Santana, Ádamo L
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 11.08.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The next-generation sequencers such as Illumina and SOLiD platforms generate a large amount of data, commonly above 10 Gigabytes of text files. Particularly, the SOLiD platform allows the sequencing of multiple samples in a single run, called multiplex run, through a tagging system called Barcode. This feature requires a computational process for separation of the data sample because the sequencer provides a mixture of all samples in a single output. This process must be secure to avoid any harm that may scramble further analysis. In this context, realized the need to develop a probabilistic model capable of assigning a degree of confidence in the marking system used in multiplex sequencing. The results confirmed the adequacy of the model obtained, which allows, among other things, to guide a process of filtering the data and evaluation of the sequencing protocol used.
ISSN:2331-8422