Modeling the minus two base pair stutter ratio of the D1S1656 locus: A sequence-based mixture distribution model

•Minus two base pair stutter ratio of D1S1656 was investigated.•Alleles were classified based on the copy number of the two base pair repeat motif.•Minus two base pair stutter ratio differed significantly between the two sequences.•We propose a mixture distribution model based on the sequences obser...

Full description

Saved in:
Bibliographic Details
Published inForensic science international : genetics Vol. 51; p. 102450
Main Authors Inokuchi, Shota, Fujii, Koji, Nakanishi, Hiroaki, Takada, Aya, Saito, Kazuyuki, Mizuno, Natsuko
Format Journal Article
LanguageEnglish
Published Netherlands Elsevier B.V 01.03.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•Minus two base pair stutter ratio of D1S1656 was investigated.•Alleles were classified based on the copy number of the two base pair repeat motif.•Minus two base pair stutter ratio differed significantly between the two sequences.•We propose a mixture distribution model based on the sequences observed. In this study, we propose a stutter ratio for a minus two base pair stutter (-2bpSR) model of the D1S1656 locus in capillary electrophoresis (CE)-based short tandem repeat (STR) typing. DNA from a total of 108 Japanese individuals was analyzed via massively parallel sequencing to investigate the length of the longest uninterrupted stretch of two base repeat motif (2bpLUS value) within repetitive structures involving the flanking region. Additionally, -2bpSR data was collected using the GlobalFiler Kit on a 3500xL Genetic Analyzer. As a result of sequencing analysis, all alleles were classified into two types by their 2bpLUS values. The -2bpSR differed significantly between the types. Then, we modeled the -2bpSR with a mixture log-normal distribution using the classification of alleles based on the 2bpLUS values. Furthermore, probabilities of the sequence type within each repeat number in the mixture log-normal distribution model were estimated using logistic regression for each of the five major detected populations. This study is expected to enable interpretation of STR typing while considering minus two base pair stutter at the D1S1656 locus.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1872-4973
1878-0326
DOI:10.1016/j.fsigen.2020.102450