Ancestry and diversity of the HMG box superfamily.
Abstract
The HMG box is a novel type of DNA-binding domain found in a diverse group of proteins. The HMG box superfamily comprises a.o. the High Mobility Group proteins HMG1 and HMG2, the nucleolar transcription factor UBF, the lymphoid transcription factors TCF-1 and LEF-1, the fungal mating-type genes mat-Mc and MATA1, and the mammalian sex-determining gene SRY. The superfamily dates back to at least 1,000 million years ago, as its members appear in animals, plants and yeast. Alignment of all known HMG boxes defined an unusually loose consensus sequence. We constructed phylogenetic trees connecting the members of the HMG box superfamily in order to understand their evolution. This analysis led us to distinguish two subfamilies: one comprising proteins with a single sequence-specific HMG box, the other encompassing relatively non sequence-specific DNA-binding proteins with multiple HMG boxes. By studying the extent of diversification of the superfamily, we found that the speed of evolution was very different within the various groups of HMG-box containing factors. Comparison of the evolution of the two boxes of ABF2 and of mtTF1 implied different diversification models for these two proteins. Finally, we provide a tree for the highly complex group of SRY-like ('Sox' genes), clustering at least 40 different loci that rapidly diverged in various animal lineages.
Full text
Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (1.6M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Landschulz WH, Johnson PF, McKnight SL. The leucine zipper: a hypothetical structure common to a new class of DNA binding proteins. Science. 1988 Jun 24;240(4860):1759–1764. [PubMed] [Google Scholar]
- Murre C, McCaw PS, Baltimore D. A new DNA binding and dimerization motif in immunoglobulin enhancer binding, daughterless, MyoD, and myc proteins. Cell. 1989 Mar 10;56(5):777–783. [PubMed] [Google Scholar]
- Jantzen HM, Admon A, Bell SP, Tjian R. Nucleolar transcription factor hUBF contains a DNA-binding motif with homology to HMG proteins. Nature. 1990 Apr 26;344(6269):830–836. [PubMed] [Google Scholar]
- Kelly M, Burke J, Smith M, Klar A, Beach D. Four mating-type genes control sexual differentiation in the fission yeast. EMBO J. 1988 May;7(5):1537–1547.[PMC free article] [PubMed] [Google Scholar]
- Staben C, Yanofsky C. Neurospora crassa a mating-type region. Proc Natl Acad Sci U S A. 1990 Jul;87(13):4917–4921.[PMC free article] [PubMed] [Google Scholar]
- Gubbay J, Collignon J, Koopman P, Capel B, Economou A, Münsterberg A, Vivian N, Goodfellow P, Lovell-Badge R. A gene mapping to the sex-determining region of the mouse Y chromosome is a member of a novel family of embryonically expressed genes. Nature. 1990 Jul 19;346(6281):245–250. [PubMed] [Google Scholar]
- Sinclair AH, Berta P, Palmer MS, Hawkins JR, Griffiths BL, Smith MJ, Foster JW, Frischauf AM, Lovell-Badge R, Goodfellow PN. A gene from the human sex-determining region encodes a protein with homology to a conserved DNA-binding motif. Nature. 1990 Jul 19;346(6281):240–244. [PubMed] [Google Scholar]
- Travis A, Amsterdam A, Belanger C, Grosschedl R. LEF-1, a gene encoding a lymphoid-specific protein with an HMG domain, regulates T-cell receptor alpha enhancer function [corrected]. Genes Dev. 1991 May;5(5):880–894. [PubMed] [Google Scholar]
- Waterman ML, Fischer WH, Jones KA. A thymus-specific member of the HMG protein family regulates the human T cell receptor C alpha enhancer. Genes Dev. 1991 Apr;5(4):656–669. [PubMed] [Google Scholar]
- Parisi MA, Clayton DA. Similarity of human mitochondrial transcription factor 1 to high mobility group proteins. Science. 1991 May 17;252(5008):965–969. [PubMed] [Google Scholar]
- Giese K, Cox J, Grosschedl R. The HMG domain of lymphoid enhancer factor 1 bends DNA and facilitates assembly of functional nucleoprotein structures. Cell. 1992 Apr 3;69(1):185–195. [PubMed] [Google Scholar]
- Diffley JF, Stillman B. A close relative of the nuclear, chromosomal high-mobility group protein HMG1 in yeast mitochondria. Proc Natl Acad Sci U S A. 1991 Sep 1;88(17):7864–7868.[PMC free article] [PubMed] [Google Scholar]
- Ferrari S, Harley VR, Pontiggia A, Goodfellow PN, Lovell-Badge R, Bianchi ME. SRY, like HMG1, recognizes sharp angles in DNA. EMBO J. 1992 Dec;11(12):4497–4506.[PMC free article] [PubMed] [Google Scholar]
- Bianchi ME, Falciola L, Ferrari S, Lilley DM. The DNA binding site of HMG1 protein is composed of two similar segments (HMG boxes), both of which have counterparts in other eukaryotic regulatory proteins. EMBO J. 1992 Mar;11(3):1055–1063.[PMC free article] [PubMed] [Google Scholar]
- Harley VR, Jackson DI, Hextall PJ, Hawkins JR, Berkovitz GD, Sockanathan S, Lovell-Badge R, Goodfellow PN. DNA binding activity of recombinant SRY from normal males and XY females. Science. 1992 Jan 24;255(5043):453–456. [PubMed] [Google Scholar]
- Sugimoto A, Iino Y, Maeda T, Watanabe Y, Yamamoto M. Schizosaccharomyces pombe ste11+ encodes a transcription factor with an HMG motif that is a critical regulator of sexual development. Genes Dev. 1991 Nov;5(11):1990–1999. [PubMed] [Google Scholar]
- Denny P, Swift S, Brand N, Dabhade N, Barton P, Ashworth A. A conserved family of genes related to the testis determining gene, SRY. Nucleic Acids Res. 1992 Jun 11;20(11):2887–2887.[PMC free article] [PubMed] [Google Scholar]
- Higgins DG, Sharp PM. CLUSTAL: a package for performing multiple sequence alignment on a microcomputer. Gene. 1988 Dec 15;73(1):237–244. [PubMed] [Google Scholar]
- Dessen P, Fondrat C, Valencien C, Mugnier C. BISANCE: a French service for access to biomolecular sequence databases. Comput Appl Biosci. 1990 Oct;6(4):355–356. [PubMed] [Google Scholar]
- Laudet V, Hänni C, Coll J, Catzeflis F, Stéhelin D. Evolution of the nuclear receptor gene superfamily. EMBO J. 1992 Mar;11(3):1003–1013.[PMC free article] [PubMed] [Google Scholar]
- Fitch WM. A non-sequential method for constructing trees and hierarchical classifications. J Mol Evol. 1981;18(1):30–37. [PubMed] [Google Scholar]
- Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987 Jul;4(4):406–425. [PubMed] [Google Scholar]
- Haggren W, Kolodrubetz D. The Saccharomyces cerevisiae ACP2 gene encodes an essential HMG1-like protein. Mol Cell Biol. 1988 Mar;8(3):1282–1289.[PMC free article] [PubMed] [Google Scholar]
- Mosrin C, Riva M, Beltrame M, Cassar E, Sentenac A, Thuriaux P. The RPC31 gene of Saccharomyces cerevisiae encodes a subunit of RNA polymerase C (III) with an acidic tail. Mol Cell Biol. 1990 Sep;10(9):4737–4743.[PMC free article] [PubMed] [Google Scholar]
- Wu M, Allis CD, Richman R, Cook RG, Gorovsky MA. An intervening sequence in an unusual histone H1 gene of Tetrahymena thermophila. Proc Natl Acad Sci U S A. 1986 Nov;83(22):8674–8678.[PMC free article] [PubMed] [Google Scholar]
- Schulman IG, Wang T, Wu M, Bowen J, Cook RG, Gorovsky MA, Allis CD. Macronuclei and micronuclei in Tetrahymena thermophila contain high-mobility-group-like chromosomal proteins containing a highly conserved eleven-amino-acid putative DNA-binding sequence. Mol Cell Biol. 1991 Jan;11(1):166–174.[PMC free article] [PubMed] [Google Scholar]
- Ner SS. HMGs everywhere. Curr Biol. 1992 Apr;2(4):208–210. [PubMed] [Google Scholar]
- Lautenberger JA, Burdett LA, Gunnell MA, Qi S, Watson DK, O'Brien SJ, Papas TS. Genomic dispersal of the ets gene family during metazoan evolution. Oncogene. 1992 Sep;7(9):1713–1719. [PubMed] [Google Scholar]
- Laudet V, Niel C, Duterque-Coquillaud M, Leprince D, Stehelin D. Evolution of the ets gene family. Biochem Biophys Res Commun. 1993 Jan 15;190(1):8–14. [PubMed] [Google Scholar]
- Hayashi T, Hayashi H, Iwai K. Tetrahymena HMG nonhistone chromosomal protein. Isolation and amino acid sequence lacking the N- and C-terminal domains of vertebrate HMG 1. J Biochem. 1989 Apr;105(4):577–581. [PubMed] [Google Scholar]
- Foster JW, Brennan FE, Hampikian GK, Goodfellow PN, Sinclair AH, Lovell-Badge R, Selwood L, Renfree MB, Cooper DW, Graves JA. Evolution of sex determination and the Y chromosome: SRY-related sequences in marsupials. Nature. 1992 Oct 8;359(6395):531–533. [PubMed] [Google Scholar]
- Denny P, Swift S, Connor F, Ashworth A. An SRY-related gene expressed during spermatogenesis in the mouse encodes a sequence-specific DNA-binding protein. EMBO J. 1992 Oct;11(10):3705–3712.[PMC free article] [PubMed] [Google Scholar]
- Knoll AH. The early evolution of eukaryotes: a geological perspective. Science. 1992 May 1;256(5057):622–627. [PubMed] [Google Scholar]
- Amero SA, Kretsinger RH, Moncrief ND, Yamamoto KR, Pearson WR. The origin of nuclear receptor proteins: a single precursor distinct from other transcription factors. Mol Endocrinol. 1992 Jan;6(1):3–7. [PubMed] [Google Scholar]
- Kappen C, Schughart K, Ruddle FH. Two steps in the evolution of Antennapedia-class vertebrate homeobox genes. Proc Natl Acad Sci U S A. 1989 Jul;86(14):5459–5463.[PMC free article] [PubMed] [Google Scholar]
- Schughart K, Kappen C, Ruddle FH. Duplication of large genomic regions during the evolution of vertebrate homeobox genes. Proc Natl Acad Sci U S A. 1989 Sep;86(18):7067–7071.[PMC free article] [PubMed] [Google Scholar]
- Murtha MT, Leckman JF, Ruddle FH. Detection of homeobox genes in development and evolution. Proc Natl Acad Sci U S A. 1991 Dec 1;88(23):10711–10715.[PMC free article] [PubMed] [Google Scholar]
- Novacek MJ. Mammalian phylogeny: shaking the tree. Nature. 1992 Mar 12;356(6365):121–125. [PubMed] [Google Scholar]
- Keese PK, Gibbs A. Origins of genes: "big bang" or continuous creation? Proc Natl Acad Sci U S A. 1992 Oct 15;89(20):9489–9493.[PMC free article] [PubMed] [Google Scholar]
- Shirakata M, Hüppi K, Usuda S, Okazaki K, Yoshida K, Sakano H. HMG1-related DNA-binding protein isolated with V-(D)-J recombination signal probes. Mol Cell Biol. 1991 Sep;11(9):4528–4536.[PMC free article] [PubMed] [Google Scholar]
- Bruhn SL, Pil PM, Essigmann JM, Housman DE, Lippard SJ. Isolation and characterization of human cDNA clones encoding a high mobility group box protein that recognizes structural distortions to DNA caused by binding of the anticancer agent cisplatin. Proc Natl Acad Sci U S A. 1992 Mar 15;89(6):2307–2311.[PMC free article] [PubMed] [Google Scholar]
- Wagner CR, Hamana K, Elgin SC. A high-mobility-group protein and its cDNAs from Drosophila melanogaster. Mol Cell Biol. 1992 May;12(5):1915–1923.[PMC free article] [PubMed] [Google Scholar]
- Kolodrubetz D, Burgum A. Duplicated NHP6 genes of Saccharomyces cerevisiae encode proteins homologous to bovine high mobility group protein 1. J Biol Chem. 1990 Feb 25;265(6):3234–3239. [PubMed] [Google Scholar]
- Pentecost BT, Wright JM, Dixon GH. Isolation and sequence of cDNA clones coding for a member of the family of high mobility group proteins (HMG-T) in trout and analysis of HMG-T-mRNA's in trout tissues. Nucleic Acids Res. 1985 Jul 11;13(13):4871–4888.[PMC free article] [PubMed] [Google Scholar]
- Grasser KD, Feix G. Isolation and characterization of maize cDNAs encoding a high mobility group protein displaying a HMG-box. Nucleic Acids Res. 1991 May 25;19(10):2573–2577.[PMC free article] [PubMed] [Google Scholar]
- Laux T, Goldberg RB. A plant DNA binding protein shares highly conserved sequence motifs with HMG-box proteins. Nucleic Acids Res. 1991 Sep 11;19(17):4769–4769.[PMC free article] [PubMed] [Google Scholar]
- Bachvarov D, Moss T. The RNA polymerase I transcription factor xUBF contains 5 tandemly repeated HMG homology boxes. Nucleic Acids Res. 1991 May 11;19(9):2331–2335.[PMC free article] [PubMed] [Google Scholar]
- van de Wetering M, Clevers H. Sequence-specific interaction of the HMG box proteins TCF-1 and SRY occurs within the minor groove of a Watson-Crick double helix. EMBO J. 1992 Aug;11(8):3039–3044.[PMC free article] [PubMed] [Google Scholar]
- Haqq CM, King CY, Donahoe PK, Weiss MA. SRY recognizes conserved DNA sites in sex-specific promoters. Proc Natl Acad Sci U S A. 1993 Feb 1;90(3):1097–1101.[PMC free article] [PubMed] [Google Scholar]
- Hisatake K, Hasegawa S, Takada R, Nakatani Y, Horikoshi M, Roeder RG. The p250 subunit of native TATA box-binding factor TFIID is the cell-cycle regulatory protein CCG1. Nature. 1993 Mar 11;362(6416):179–181. [PubMed] [Google Scholar]
- Sekiguchi T, Nohiro Y, Nakamura Y, Hisamoto N, Nishimoto T. The human CCG1 gene, essential for progression of the G1 phase, encodes a 210-kilodalton nuclear DNA-binding protein. Mol Cell Biol. 1991 Jun;11(6):3317–3325.[PMC free article] [PubMed] [Google Scholar]
- Yamaguchi-Shinozaki K, Shinozaki K. A novel Arabidopsis DNA binding protein contains the conserved motif of HMG-box proteins. Nucleic Acids Res. 1992 Dec 25;20(24):6737–6737.[PMC free article] [PubMed] [Google Scholar]
- Coriat AM, Müller U, Harry JL, Uwanogho D, Sharpe PT. PCR amplification of SRY-related gene sequences reveals evolutionary conservation of the SRY-box motif. PCR Methods Appl. 1993 Feb;2(3):218–222. [PubMed] [Google Scholar]
- Weir HM, Kraulis PJ, Hill CS, Raine AR, Laue ED, Thomas JO. Structure of the HMG box motif in the B-domain of HMG1. EMBO J. 1993 Apr;12(4):1311–1319.[PMC free article] [PubMed] [Google Scholar]
- Gastrop J, Hoevenagel R, Young JR, Clevers HC. A common ancestor of the mammalian transcription factors TCF-1 and TCF-1 alpha/LEF-1 expressed in chicken T cells. Eur J Immunol. 1992 May;22(5):1327–1330. [PubMed] [Google Scholar]
- Wright EM, Snopek B, Koopman P. Seven new members of the Sox gene family expressed during mouse development. Nucleic Acids Res. 1993 Feb 11;21(3):744–744.[PMC free article] [PubMed] [Google Scholar]
