Comparative genomics of biotechnologically important yeasts.
Journal: 2017/February - Proceedings of the National Academy of Sciences of the United States of America
ISSN: 1091-6490
Abstract:
Ascomycete yeasts are metabolically diverse, with great potential for biotechnology. Here, we report the comparative genome analysis of 29 taxonomically and biotechnologically important yeasts, including 16 newly sequenced. We identify a genetic code change, CUG-Ala, in Pachysolen tannophilus in the clade sister to the known CUG-Ser clade. Our well-resolved yeast phylogeny shows that some traits, such as methylotrophy, are restricted to single clades, whereas others, such as l-rhamnose utilization, have patchy phylogenetic distributions. Gene clusters, with variable organization and distribution, encode many pathways of interest. Genomics can predict some biochemical traits precisely, but the genomic basis of others, such as xylose utilization, remains unresolved. Our data also provide insight into early evolution of ascomycetes. We document the loss of H3K9me2/3 heterochromatin, the origin of ascomycete mating-type switching, and panascomycete synteny at the MAT locus. These data and analyses will facilitate the engineering of efficient biosynthetic and degradative pathways and gateways for genomic manipulation.
Relations:
Content
Citations
(46)
References
(43)
Affiliates
(14)
Similar articles
Articles by the same authors
Discussion board
Proc Natl Acad Sci U S A 113(35): 9882-9887

Comparative genomics of biotechnologically important yeasts

+30 authors

Supplementary Material

Supplementary File

Supplementary File

Supplementary File

Supplementary File

Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598;
University College Dublin Conway Institute, School of Medicine, University College Dublin, Dublin 4, Ireland;
Laboratory of Genetics, Genetics/Biotechnology Center, University of Wisconsin–Madison, Madison, WI, 53706;
Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, 31270-901, Brazil;
Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin–Madison, Madison, WI, 53726;
Deutsche Sammlung von Mikroorganismen und Zellkulturen German Collection of Microorganisms and Cell Cultures, Leibniz Institute, 38124 Braunschweig, Germany;
Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235;
Department of Bacteriology, University of Wisconsin–Madison, Madison, WI, 53706;
US Department of Agriculture Forest Products Laboratory, Madison, WI, 53726;
Xylome Corporation, Madison, WI, 53719;
School of Biology, Newcastle University, Newcastle upon Tyne NE1 7RU, United Kingdom;
Department of Molecular Genetics and Biotechnology, Institute of Cell Biology, National Academy of Sciences of Ukraine, Lviv 79005, Ukraine;
Department of Biotechnology and Microbiology, University of Rzeszow, Rzeszow 35-601, Poland;
Department of Plant Pathology, Ohio State University, Columbus, OH, 43210;
Centraalbureau voor Schimmelcultures Fungal Biodiversity Centre, Royal Netherlands Academy of Arts and Sciences, 3508 AD, Utrecht, The Netherlands;
Agricultural Research Service, National Center for Agricultural Utilization Research, US Department of Agriculture, Peoria, IL, 61604;
Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, 70803;
Department of Biological Sciences, University of South Carolina, Columbia, SC, 29208
To whom correspondence may be addressed. Email: vog.lbl@veirogirGVI or ude.csiw@irffejwt.
Edited by Chris R. Somerville, University of California, Berkeley, CA, and approved July 11, 2016 (received for review March 10, 2016)

Author contributions: A.L., C.P.K., M.B., I.V.G., and T.W.J. designed research; T.M.L., C.H.C., C.C., A.Y.C., S.D., S.J.H., H.-P.K., Y.P., A.A. Sibirny, J.B.S., C.P.K., and T.W.J. performed research; C.H.C. contributed new reagents/analytic tools; R.R., S.H., K.H.W., M.R.L., C.T.H., M.G., A.A. Salamov, J.H.W., A.L.A., K.W.B., A.C., A.P.D., K.M.L., A.L., E.A.L., A.M.L., J.P.M.-K., R.A.O., R.P.O., J.L.P., A.R., C.A.R., C.S., J.C.S., H.S., C.P.K., I.V.G., and T.W.J. analyzed data; R.R., S.H., K.H.W., C.T.H., M.G., C.P.K., M.B., I.V.G., and T.W.J. wrote the paper; and K.W.B., C.P.K., M.B., I.V.G., and T.W.J. coordinated the project.

Present address: Center for Algorithmic Biotechnology, St. Petersburg State University, St. Petersburg 199004, Russia.
Present address: Microbiology, Department of Biology, Utrecht University, 3508, Utrecht, The Netherlands.
Edited by Chris R. Somerville, University of California, Berkeley, CA, and approved July 11, 2016 (received for review March 10, 2016)

Significance

The highly diverse Ascomycete yeasts have enormous biotechnological potential. Collectively, these yeasts convert a broad range of substrates into useful compounds, such as ethanol, lipids, and vitamins, and can grow in extremes of temperature, salinity, and pH. We compared 29 yeast genomes with the goal of correlating genetics to useful traits. In one rare species, we discovered a genetic code that translates CUG codons to alanine rather than canonical leucine. Genome comparison enabled correlation of genes to useful metabolic properties and showed the synteny of the mating-type locus to be conserved over a billion years of evolution. Our study provides a roadmap for future biotechnological exploitations.

Keywords: genomics, bioenergy, biotechnological yeasts, genetic code, microbiology
Significance

Abstract

Ascomycete yeasts are metabolically diverse, with great potential for biotechnology. Here, we report the comparative genome analysis of 29 taxonomically and biotechnologically important yeasts, including 16 newly sequenced. We identify a genetic code change, CUG-Ala, in Pachysolen tannophilus in the clade sister to the known CUG-Ser clade. Our well-resolved yeast phylogeny shows that some traits, such as methylotrophy, are restricted to single clades, whereas others, such as l-rhamnose utilization, have patchy phylogenetic distributions. Gene clusters, with variable organization and distribution, encode many pathways of interest. Genomics can predict some biochemical traits precisely, but the genomic basis of others, such as xylose utilization, remains unresolved. Our data also provide insight into early evolution of ascomycetes. We document the loss of H3K9me2/3 heterochromatin, the origin of ascomycete mating-type switching, and panascomycete synteny at the MAT locus. These data and analyses will facilitate the engineering of efficient biosynthetic and degradative pathways and gateways for genomic manipulation.

Abstract

Yeasts are fungi that reproduce asexually by budding or fission and sexually without multicellular fruiting bodies (1, 2). Their unicellular, largely free-living lifestyle has evolved several times (3). Despite morphological similarities, yeasts constitute over 1,500 known species that inhabit many specialized environmental niches and associations, including virtually all varieties of fruits and flowers, plant surfaces and exudates, insects and other invertebrates, birds, mammals, and highly diverse soils (4). Biochemical and genomic studies of the model yeast Saccharomyces cerevisiae—essential for making bread, beer, and wine—have established much of our understanding of eukaryotic biology. However, in many ways, S. cerevisiae is an oddity among the yeasts, and many important biotechnological applications and highly divergent physiological capabilities of lesser-known yeast species have not been fully exploited (5). Various species can grow on methanol or n-alkanes as sole carbon and energy sources, overproduce vitamins and lipids, thrive under acidic conditions, and ferment unconventional carbon sources. Many features of yeasts make them ideal platforms for biotechnological processes. Their thick cell walls help them survive osmotic shock, and in contrast to bacteria, they are resistant to viruses. Their unicellular form is easy to cultivate, scale up, and harvest. The objective of this study was, therefore, to put yeasts with diverse biotechnological applications in a phylogenomic context and relate their physiologies to genomic features, so that their useful properties may be developed through genetic techniques. Backgrounds on 16 yeasts are given in SI Appendix.

Acknowledgments

We thank Marco A. Soares for computational advice. K.H.W. thanks G. Cagney, E. Dillon, and K. Wynne (University College Dublin Conway Institute Proteomics Core Facility) for help with MS. M.B. thanks Drs. S. O. Suh, H. Urbina, and N. H. Nguyen and numerous Louisiana State University undergraduates for their assistance. The work conducted by the US Department of Energy (DOE) Joint Genome Institute, a DOE Office of Science User Facility, is supported by Office of Science of the US DOE Contract DE-AC02-05CH11231. This material is based on work supported by National Science Foundation Grant DEB-1442148 (to C.T.H. and C.P.K.) and supported in part by DOE Great Lakes Bioenergy Research Center, DOE Office of Science Grant BER DE-FC02-07ER64494, and US Department of Agriculture (USDA) National Institute of Food and Agriculture Hatch Project 1003258. K.H.W. acknowledges European Research Council Grant 268893, Science Foundation Ireland Grant 13/IA/1910, and the Wellcome Trust. M.R.L. acknowledges a fellowship from the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (process no. 7371/13-6). C.T.H. is a Pew Scholar in the Biomedical Sciences and an Alfred Toepfer Faculty Fellow, which are supported by the Pew Charitable Trusts and the Alexander von Humboldt Foundation, respectively. C.A.R. acknowledges support from the Conselho Nacional de Desenvolvimento Científico e Tecnológico-CNPq. Funding from National Science Foundation Grants DEB-0072741 (to M.B.) and 0417180 (to M.B.) supported discovery and study of many new yeast strains that contributed to this study. T.W.J. acknowledges DOE Great Lakes Bioenergy Research Center DOE Office of Science Grant BER DE-FC02-07ER64494 and the USDA, Forest Products Laboratory for financial support.

Acknowledgments

Footnotes

Conflict of interest statement: C.H.C. and T.W.J. are employees of Xylome Corporation, which is developing nonconventional yeasts for biotechnological applications.

This article is a PNAS Direct Submission.

Data deposition: The sequences reported in this paper have been deposited in the GenBank database (accession nos. {"type":"entrez-nucleotide","attrs":{"text":"LWKO00000000","term_id":"1035967617","term_text":"LWKO00000000"}}LWKO00000000, {"type":"entrez-nucleotide","attrs":{"text":"LYME00000000","term_id":"1035969124","term_text":"LYME00000000"}}LYME00000000, {"type":"entrez-nucleotide","attrs":{"text":"LXTC00000000","term_id":"1037345587","term_text":"LXTC00000000"}}LXTC00000000, {"type":"entrez-nucleotide","attrs":{"text":"LYBQ00000000","term_id":"1035968374","term_text":"LYBQ00000000"}}LYBQ00000000, {"type":"entrez-nucleotide","attrs":{"text":"LYBR00000000","term_id":"1035968589","term_text":"LYBR00000000"}}LYBR00000000, {"type":"entrez-nucleotide","attrs":{"text":"LWUO00000000","term_id":"1035969854","term_text":"LWUO00000000"}}LWUO00000000, {"type":"entrez-nucleotide","attrs":{"text":"LSKT00000000","term_id":"1035969338","term_text":"LSKT00000000"}}LSKT00000000, {"type":"entrez-nucleotide","attrs":{"text":"LTAD00000000","term_id":"1035969463","term_text":"LTAD00000000"}}LTAD00000000, {"type":"entrez-nucleotide","attrs":{"text":"LXPE00000000","term_id":"1037359334","term_text":"LXPE00000000"}}LXPE00000000, {"type":"entrez-nucleotide","attrs":{"text":"AECK00000000","term_id":"1037333912","term_text":"AECK00000000"}}AECK00000000, {"type":"entrez-nucleotide","attrs":{"text":"LSGR00000000","term_id":"1035968351","term_text":"LSGR00000000"}}LSGR00000000, {"type":"entrez-nucleotide","attrs":{"text":"LXPB00000000","term_id":"1035966961","term_text":"LXPB00000000"}}LXPB00000000, {"type":"entrez-nucleotide","attrs":{"text":"LZCH00000000","term_id":"1035967641","term_text":"LZCH00000000"}}LZCH00000000, {"type":"entrez-nucleotide","attrs":{"text":"AEHA00000000","term_id":"1035966346","term_text":"AEHA00000000"}}AEHA00000000, {"type":"entrez-nucleotide","attrs":{"text":"AEUO00000000","term_id":"1035966231","term_text":"AEUO00000000"}}AEUO00000000, and {"type":"entrez-nucleotide","attrs":{"text":"LWUN00000000","term_id":"1035966906","term_text":"LWUN00000000"}}LWUN00000000).

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1603941113/-/DCSupplemental.

Footnotes

References

  • 1. Dujon BYeast evolutionary genomics. Nat Rev Genet. 2010;11(7):512–524.[PubMed][Google Scholar]
  • 2. Kurtzman CP, Fell JW, Boekhout T, editors. The Yeasts, a Taxonomic Study. Elsevier; Amsterdam: 2011. [PubMed]
  • 3. Nagy LG, et al Latent homology and convergent regulatory evolution underlies the repeated emergence of yeasts. Nat Commun. 2014;5:4471.[PubMed][Google Scholar]
  • 4. Sylvester K, et al Temperature and host preferences drive the diversification of Saccharomyces and other yeasts: A survey and the discovery of eight new yeast species. FEMS Yeast Res. 2015;15(3):fov002.[PubMed][Google Scholar]
  • 5. Hittinger CT, et al Genomics and the making of yeast biodiversity. Curr Opin Genet Dev. 2015;35:100–109.[Google Scholar]
  • 6. de Queiroz A, Gatesy JThe supermatrix approach to systematics. Trends Ecol Evol. 2007;22(1):34–41.[PubMed][Google Scholar]
  • 7. DeGiorgio M, Degnan JHFast and consistent estimation of species trees using supermatrix rooted triples. Mol Biol Evol. 2010;27(3):552–569.[Google Scholar]
  • 8. Kawaguchi Y, Honda H, Taniguchi-Morimura J, Iwasaki SThe codon CUG is read as serine in an asporogenic yeast Candida cylindracea. Nature. 1989;341(6238):164–166.[PubMed][Google Scholar]
  • 9. Santos MA, Gomes AC, Santos MC, Carreto LC, Moura GRThe genetic code of the fungal CTG clade. C R Biol. 2011;334(8-9):607–611.[PubMed][Google Scholar]
  • 10. Santos MA, Ueda T, Watanabe K, Tuite MFThe non-standard genetic code of Candida spp.: An evolving genetic code or a novel mechanism for adaptation? Mol Microbiol. 1997;26(3):423–431.[PubMed][Google Scholar]
  • 11. Mühlhausen S, Kollmar MMolecular phylogeny of sequenced Saccharomycetes reveals polyphyly of the alternative yeast codon usage. Genome Biol Evol. 2014;6(12):3222–3237.[Google Scholar]
  • 12. Jeffries TW, et al Genome sequence of the lignocellulose-bioconverting and xylose-fermenting yeast Pichia stipitis. Nat Biotechnol. 2007;25(3):319–326.[PubMed][Google Scholar]
  • 13. Wohlbach DJ, et al Comparative genomics of xylose-fermenting fungi for enhanced biofuel production. Proc Natl Acad Sci USA. 2011;108(32):13212–13217.[Google Scholar]
  • 14. Ryu S, Hipp J, Trinh CTActivating and elucidating metabolism of complex sugars in Yarrowia lipolytica. Appl Environ Microbiol. 2015;82(4):1334–1345.[Google Scholar]
  • 15. Tsigie YA, Wang CY, Truong CT, Ju YHLipid production from Yarrowia lipolytica Po1g grown in sugarcane bagasse hydrolysate. Bioresour Technol. 2011;102(19):9216–9222.[PubMed][Google Scholar]
  • 16. Urbina H, Blackwell MMultilocus phylogenetic study of the Scheffersomyces yeast clade and characterization of the N-terminal region of xylose reductase gene. PLoS One. 2012;7(6):e39128.[Google Scholar]
  • 17. Hittinger CT, Rokas A, Carroll SBParallel inactivation of multiple GAL pathway genes and ecological diversification in yeasts. Proc Natl Acad Sci USA. 2004;101(39):14144–14149.[Google Scholar]
  • 18. Wolfe KH, et al Clade- and species-specific features of genome evolution in the Saccharomycetaceae. FEMS Yeast Res. 2015;15(5):fov035.[Google Scholar]
  • 19. Johnston MA model fungal gene regulatory mechanism: The GAL genes of Saccharomyces cerevisiae. Microbiol Rev. 1987;51(4):458–476.[Google Scholar]
  • 20. Wegner GHEmerging applications of the methylotrophic yeasts. FEMS Microbiol Rev. 1990;7(3-4):279–283.[PubMed][Google Scholar]
  • 21. Koivistoinen OM, et al Characterisation of the gene cluster for l-rhamnose catabolism in the yeast Scheffersomyces (Pichia) stipitis. Gene. 2012;492(1):177–185.[PubMed][Google Scholar]
  • 22. Gojković Z, et al Horizontal gene transfer promoted evolution of the ability to propagate under anaerobic conditions in yeasts. Mol Genet Genomics. 2004;271(4):387–393.[PubMed][Google Scholar]
  • 23. Hall C, Dietrich FSThe reacquisition of biotin prototrophy in Saccharomyces cerevisiae involved horizontal gene transfer, gene duplication and gene clustering. Genetics. 2007;177(4):2293–2307.[Google Scholar]
  • 24. Hedges SB, Blair JE, Venturi ML, Shoe JLA molecular timescale of eukaryote evolution and the rise of complex multicellular life. BMC Evol Biol. 2004;4:2.[Google Scholar]
  • 25. Hanson SJ, Byrne KP, Wolfe KHMating-type switching by chromosomal inversion in methylotrophic yeasts suggests an origin for the three-locus Saccharomyces cerevisiae system. Proc Natl Acad Sci USA. 2014;111(45):E4851–E4858.[Google Scholar]
  • 26. Maekawa H, Kaneko YInversion of the chromosomal region between two mating type loci switches the mating type in Hansenula polymorpha. PLoS Genet. 2014;10(11):e1004796.[Google Scholar]
  • 27. Hickman MA, Froyd CA, Rusche LNReinventing heterochromatin in budding yeasts: Sir2 and the origin recognition complex take center stage. Eukaryot Cell. 2011;10(9):1183–1192.[Google Scholar]
  • 28. Kurtzman CP, Robnett CJRelationships among genera of the Saccharomycotina (Ascomycota) from multigene phylogenetic analysis of type species. FEMS Yeast Res. 2013;13(1):23–33.[PubMed][Google Scholar]
  • 29. Liu X. 2012. Conversion of the biodiesel by-product glycerol by the non-conventional yeast Pachysolen tannophilus. PhD thesis (Technical University of Denmark, Kongens Lyngby, Denmark)
  • 30. Wedlock DN, Thornton RJTransformation of a glucose negative mutant of Pachysolen tannophilus with a plasmid carrying the cloned hexokinase-PII gene from Saccharomyces cerevisiae. Biotechnol Lett. 1989;11(9):601–604.[PubMed][Google Scholar]
  • 31. Mühlhausen S, Findeisen P, Plessmann U, Urlaub H, Kollmar MA novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes. Genome Res. 2016;26(7):945–955.[Google Scholar]
  • 32. Bain JD, Switzer C, Chamberlin AR, Benner SARibosome-mediated incorporation of a non-standard amino acid into a peptide through expansion of the genetic code. Nature. 1992;356(6369):537–539.[PubMed][Google Scholar]
  • 33. Matsuzawa T, et al New insights into galactose metabolism by Schizosaccharomyces pombe: Isolation and characterization of a galactose-assimilating mutant. J Biosci Bioeng. 2011;111(2):158–166.[PubMed][Google Scholar]
  • 34. Toivari MH, Salusjärvi L, Ruohonen L, Penttilä MEndogenous xylose pathway in Saccharomyces cerevisiae. Appl Environ Microbiol. 2004;70(6):3681–3686.[Google Scholar]
  • 35. Gnerre S, et al High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci USA. 2011;108(4):1513–1518.[Google Scholar]
  • 36. Zerbino DR, Birney EVelvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18(5):821–829.[Google Scholar]
  • 37. LaButti K, Foster B, Han C, Brettin T, Lapidus A. 2009Gap Resolution: A Software Package for Improving Newbler Genome Assemblies, Rep No. LBNL-1899E Abs. Available at . Accessed May 27, 2009.[PubMed]
  • 38. English AC, et al Mind the gap: Upgrading genomes with Pacific Biosciences RS long-read sequencing technology. PLoS One. 2012;7(11):e47768.[Google Scholar]
  • 39. Grigoriev IV, et al MycoCosm portal: Gearing up for 1000 fungal genomes. Nucleic Acids Res. 2014;42(Database issue):D699–D704.[Google Scholar]
  • 40. Göker M, Scheuner C, Klenk HP, Stielow JB, Menzel WCodivergence of mycoviruses with their hosts. PLoS One. 2011;6(7):e22252.[Google Scholar]
  • 41. Scheuner C, et al Complete genome sequence of Planctomyces brasiliensis type strain (DSM 5305(T)), phylogenomic analysis and reclassification of Planctomycetes including the descriptions of Gimesia gen. nov., Planctopirus gen. nov. and Rubinisphaera gen. nov. and emended descriptions of the order Planctomycetales and the family Planctomycetaceae. Stand Genomic Sci. 2014;9:10.[Google Scholar]
  • 42. Spring S, et al The genome sequence of Methanohalophilus mahii SLP(T) reveals differences in the energy metabolism among members of the Methanosarcinaceae inhabiting freshwater and saline environments. Archaea. 2010;2010:690737.[Google Scholar]
  • 43. Lowe TM, Eddy SRtRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955–964.[Google Scholar]
  • 44. Di Tommaso P, et al T-Coffee: A web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension. Nucleic Acids Res. 2011;39(Web server issue):W13–W17.[Google Scholar]
  • 45. Claudel-Renard C, Chevalet C, Faraut T, Kahn DEnzyme-specific profiles for genome annotation: PRIAM. Nucleic Acids Res. 2003;31(22):6633–6639.[Google Scholar]
  • 46. Enright AJ, Van Dongen S, Ouzounis CAAn efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002;30(7):1575–1584.[Google Scholar]
  • 47. Wisecaver JH, Slot JC, Rokas AThe evolution of fungal metabolic pathways. PLoS Genet. 2014;10(12):e1004816.[Google Scholar]
Collaboration tool especially designed for Life Science professionals.Drag-and-drop any entity to your messages.