Large-Scale Sequencing of the CD33-Related Siglec Gene Cluster in Five Mammalian Species Reveals Rapid Evolution by Multiple Mechanisms
Siglecs are a recently discovered family of animal lectins that belong to the Ig superfamily and recognize sialic acids (Sias). CD33-related Siglecs (CD33rSiglecs) are a subgroup with as-yet-unknown functions, characterized by sequence homology, expression on innate immune cells, conserved cytosolic...
Saved in:
Published in | Proceedings of the National Academy of Sciences - PNAS Vol. 101; no. 36; pp. 13251 - 13256 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
United States
National Academy of Sciences
07.09.2004
National Acad Sciences |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Siglecs are a recently discovered family of animal lectins that belong to the Ig superfamily and recognize sialic acids (Sias). CD33-related Siglecs (CD33rSiglecs) are a subgroup with as-yet-unknown functions, characterized by sequence homology, expression on innate immune cells, conserved cytosolic tyrosine-based signaling motifs, and a clustered localization of their genes. To better understand the biology and evolution of CD33rSiglecs, we sequenced and compared the CD33rSiglec gene cluster from multiple mammalian species. Within the sequenced region, the segments containing CD33rSiglec genes showed a lower degree of sequence conservation. In contrast to the adjacent conserved kallikrein-like genes, the CD33rSiglec genes showed extensive species differences, including expansions of gene subsets; gene deletions, including one human-specific loss of a novel functional primate Siglec (Siglec-13); exon shuffling, generating hybrid genes; accelerated accumulation of nonsynonymous substitutions in the Sia-recognition domain; and multiple instances of mutations of an arginine residue essential for Sia recognition in otherwise intact Siglecs. Nonsynonymous differences between human and chimpanzee orthologs showed uneven distribution between the two β sheets of the Sia-recognition domain, suggesting biased mutation accumulation. These data indicate that CD33rSiglec genes are undergoing rapid evolution via multiple genetic mechanisms, possibly due to an evolutionary "arms race" between hosts and pathogens involving Sia recognition. These studies, which reflect one of the most complete comparative sequence analyses of a rapidly evolving gene cluster, provide a clearer picture of the ortholog status of CD33rSiglecs among primates and rodents and also facilitate rational recommendations regarding their nomenclature. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Present address: Research Center for Glycoscience, National Institute of Advanced Industrial Science and Technology, 1-1-1 Higashi, Tsukuba, Ibaraki 305-8562, Japan. T.A. and E.H.M. contributed equally to this work. Abbreviations: Ign, nth Ig-like domain of a Siglec; BAC, bacterial artificial chromosome; KLK, kallikrein; MCSs, multispecies conserved sequences; N, no. of nonsynonymous sites; Nd, no. of nonsynonymous differences; pN, nonsynonymous substitution rate (Nd/N); pS, synonymous substitution rate (Sd/S); S, no. of synonymous sites; Sd, no. of synonymous differences; Sia, sialic acid; Neu5Gc, N-glycolylneuraminic acid; CD33rSiglecs, CD33-related Siglecs. Data deposition: The sequences reported in this paper have been deposited in the GenBank database [accession nos. AC130782, AC130783, AC132069, AC133392, AC134953, and AC140096 (chimpanzee); AC130188, AC130272, AC130273, AC130786, and AC137058 (baboon); and AC134313, AC135541, and AC137057 (rat)]. Communicated by Morris Goodman, Wayne State University School of Medicine, Detroit, MI, July 9, 2004 To whom correspondence should be addressed. E-mail: varkiadmin@ucsd.edu. |
ISSN: | 0027-8424 1091-6490 |
DOI: | 10.1073/pnas.0404833101 |