Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions

The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino...

Full description

Saved in:
Bibliographic Details
Published inNucleic acids research Vol. 28; no. 19; pp. 3801 - 3810
Main Authors Nishizawa, M, Nishizawa, K
Format Journal Article
LanguageEnglish
Published England Oxford Publishing Limited (England) 01.10.2000
Oxford University Press
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
To whom correspondence should be addressed. Tel: +81 3 3964 1211; Fax: +81 3 5375 6366; Email: kazunet@med.teikyo-u.ac.jp
ISSN:1362-4962
0305-1048
1362-4962
DOI:10.1093/nar/28.19.3801