Evolutionary Genomics of Lactic Acid Bacteria<sup><a href="#fn1" rid="fn1" class=" fn">▿</a></sup>
The lactic acid bacteria (LAB) might be the most numerous group of bacteria linked to humans. They are naturally associated with mucosal surfaces, particularly the gastrointestinal tract, and are also indigenous to food-related habitats, including plant (fruits, vegetables, and cereal grains), wine, milk, and meat environments (60, 61). The LAB include both important pathogens, e.g., several Streptococcus species, and extremely valuable nonpathogenic species that are used for industrial fermentation of dairy products, meats, and vegetables, and they are also critical for the production of wine, coffee, silage, cocoa, and sourdough (13, 60, 61). In addition, the LAB are a priceless source of antimicrobial agents, the bacteriocins (reference 10 and references therein).
The term LAB mainly refers to the defining feature of the basal metabolism of these bacteria, the fermentation of hexose sugars yielding, primarily, lactic acid. Various aspects of LAB biology and application are thoroughly covered in several books by Wood and Holzapfel and Wood and Warner (60, 61) and numerous reviews, including those in a recent specialized issue of FEMS Microbiology Reviews (12, 14, 18, 20, 23, 24, 31, 39, 42, 49, 58). The definition of LAB is biological rather than taxonomical, i.e., the LAB do not comprise a monophyletic group of bacteria. Most of the LAB belong to the order Lactobacillales, a group of nonsporulating, gram-positive bacteria, but a few LAB species belong to the Actinobacteria (60).
The early sequencing of LAB genomes involved mostly bacteria of the genus Streptococcus, which encompasses most of the pathogenic LABs (50, 60). Currently, 19 complete genomes of streptococci are available, covering different strains of five species. A program aimed at extensive sequencing of the genomes of nonpathogenic LAB was announced in 2002 by the Lactic Acid Bacteria Genome Sequencing Consortium (19), but the actual breakthrough occurred only in the last 2 years (2005 and 2006). At the time of writing (August 2006), 18 complete genome sequences of the nonpathogenic LAB representing 14 species from the order Lactobacillales were available (Table (Table1)1) . The Lactobacillales have relatively small genomes for nonobligatory bacterial parasites or symbionts (characteristic genome size, ∼2 megabases, with ∼2,000 genes), with the number of genes in different species spanning the range from ∼1,600 to ∼3,000. This variation in the number of genes suggests that the evolution of LAB involved active processes of gene loss, duplication, and acquisition. The current collection of LAB genomes is a unique data set that includes multiple related genomes with a gradient of divergence in sequences and genome organizations. This set of related genomes is amenable to detailed reconstruction of genome evolution, which is not yet attainable with other groups of bacteria.
TABLE 1.
General features of the sequenced genomes of Lactobacillales
| Species/strain | Genome length (bp) | No. of plasmids (no. of genes) | No. of proteins | Reference | Association |
|---|---|---|---|---|---|
| Lactobacillus plantarum WCFS1a | 3,308,274 | 3 (50) | 3,009 | 22 | Plant/meat material |
| Lactobacillus johnsonii NCC 533a | 1,992,676 | 0 | 1,821 | 43 | GITb |
| Lactobacillus acidophilus NCFM | 1,993,564 | 0 | 1,864 | 2 | GIT |
| Lactobacillus sakei subsp. sakei 23K | 1,884,661 | 0 | 1,879 | 8 | Meat material |
| Lactobacillus salivarius subsp. salivarius UCC118 | 1,827,111 | 3 (320) | 1,717 | 9 | GIT |
| Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842 | 1,864,998 | 0 | 1,562 | 57 | Yogurt, cheese |
| Lactobacillus delbrueckii subsp. bulgaricusa | 1,856,951 | 0 | 1,725 | 28 | |
| Lactobacillus gasseria | 1,894,360 | 0 | 1,763 | 28 | GIT |
| Lactobacillus brevisa | 2,340,228 | 2 (37) | 2,221 | 28 | Plant/meat material |
| Pediococcus pentosaceusa | 1,832,387 | 0 | 1,757 | 28 | Plant/meat material |
| Oenococcus oenia | 1,780,517 | 0 | 1,701 | 28 | Wine |
| Leuconostoc mesenteroidesa | 2,075,763 | 1 (34) | 2,009 | 28 | Plant material |
| Lactobacillus caseia | 2,924,325 | 1 (20) | 2,776 | 28 | Dairy products |
| Lactococcus lactis subsp. lactisa | 2,365,589 | 0 | 2,321 | 7 | Dairy products, bacteriocins |
| Lactococcus lactis subsp. cremorisa | 2,641,635 | 5 (129) | 2,509 | 28 | |
| Streptococcus thermophilus CNRZ1066 | 1,796,226 | 0 | 1,915 | 5 | Yogurt, cheese |
| Streptococcus thermophilus LMG 18311 | 1,796,846 | 0 | 1,889 | 5 | |
| Streptococcus thermophilus LMD-9a | 1,864,178 | 2 (6) | 1,718 | 28 |
This review is largely based on recent work on comparative genomic analysis of 12 nonpathogenic LAB from the order Lactobacillales (Table (Table1)1) (28). We briefly discuss the customized comparative-genomic framework that was developed for the LAB and its application to evolutionary reconstruction and functional genomics.
Acknowledgments
We thank the members of the Lactic Acid Bacteria Genome Sequencing Consortium and, personally, David Mills and Sergay Kozyavkin for extensive collaboration and numerous helpful discussions in the course of the comparative-genomic analysis of LAB.
Footnotes
Published ahead of print on 3 November 2006.
REFERENCES
References
- 1. Adachi, J., and MHasegawa. 1992. MOLPHY: programs for molecular phylogenetics. Computer Science Monographs 27. Institute of Statistical Mathematics, Tokyo, Japan.
- 2. Altermann, E., W. M. Russell, M. A. Azcarate-Peril, R. Barrangou, B. L. Buck, O. McAuliffe, N. Souther, A. Dobson, T. Duong, M. Callanan, S. Lick, A. Hamrick, R. Cano, and T. R. Klaenhammer. 2005. Complete genome sequence of the probiotic lactic acid bacterium Lactobacillus acidophilus NCFM. Proc. Natl. Acad. Sci. USA102:3906-3912.
- 3. Barynin, V. V., M. M. Whittaker, S. V. Antonyuk, V. S. Lamzin, P. M. Harrison, P. J. Artymiuk, and J. W. Whittaker. 2001. Crystal structure of manganese catalase from Lactobacillus plantarum. Structure9:725-738. [[PubMed]
- 4. Blewett, A. M., A. J. Lloyd, A. Echalier, V. Fulop, C. G. Dowson, T. D. Bugg, and D. I. Roper. 2004. Expression, purification, crystallization and preliminary characterization of uridine 5′-diphospho-N-acetylmuramoyl l-alanyl-d-glutamate:lysine ligase (MurE) from Streptococcus pneumoniae 110K/70. Acta Crystallogr. D60:359-361. [[PubMed]
- 5. Bolotin, A., B. Quinquis, P. Renault, A. Sorokin, S. D. Ehrlich, S. Kulakauskas, A. Lapidus, E. Goltsman, M. Mazur, G. D. Pusch, M. Fonstein, R. Overbeek, N. Kyprides, B. Purnelle, D. Prozzi, K. Ngui, D. Masuy, F. Hancy, S. Burteau, M. Boutry, J. Delcour, A. Goffeau, and P. Hols. 2004. Complete sequence and comparative genome analysis of the dairy bacterium Streptococcus thermophilus. Nat. Biotechnol.22:1554-1558. [[PubMed]
- 6. Bolotin, A., B. Quinquis, A. Sorokin, and S. D. Ehrlich. 2005. Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. Microbiology151:2551-2561. [[PubMed]
- 7. Bolotin, A., P. Wincker, S. Mauger, O. Jaillon, K. Malarme, J. Weissenbach, S. D. Ehrlich, and A. Sorokin. 2001. The complete genome sequence of the lactic acid bacterium Lactococcus lactis ssp. lactis IL1403. Genome Res.11:731-753.
- 8. Chaillou, S., M. C. Champomier-Verges, M. Cornet, A. M. Crutz-Le Coq, A. M. Dudez, V. Martin, S. Beaufils, E. Darbon-Rongere, R. Bossy, V. Loux, and M. Zagorec. 2005. The complete genome sequence of the meat-borne lactic acid bacterium Lactobacillus sakei 23K. Nat. Biotechnol.23:1527-1533. [[PubMed]
- 9. Claesson, M. J., Y. Li, S. Leahy, C. Canchaya, J. P. van Pijkeren, A. M. Cerdeno-Tarraga, J. Parkhill, S. Flynn, G. C. O'Sullivan, J. K. Collins, D. Higgins, F. Shanahan, G. F. Fitzgerald, D. van Sinderen, and P. W. O'Toole. 2006. Multireplicon genome architecture of Lactobacillus salivarius. Proc. Natl. Acad. Sci. USA103:6718-6723.
- 10. Cotter, P. D., C. Hill, and R. P. Ross. 2005. Bacteriocins: developing innate immunity for food. Nat. Rev. Microbiol.3:777-788. [[PubMed]
- 11. de Jong, A., S. A. van Hijum, J. J. Bijlsma, J. Kok, and O. P. Kuipers. 2006. BAGEL: a web-based bacteriocin genome mining tool. Nucleic Acids Res.34:W273-W279.
- 12. de Vos, W. M., M. Kleerebezem, and O. P. Kuipers. 2005. Lactic acid bacteria—genetics, metabolism and application. FEMS Microbiol. Rev.29:391. [[PubMed]
- 13. Dunny, G. M., and P. P. Cleary. 1991. Genetics and molecular biology of streptococci, lactococci, and enterococci. American Society for Microbiology, Washington, DC.
- 14. Hols, P., F. Hancy, L. Fontaine, B. Grossiord, D. Prozzi, N. Leblond-Bourget, B. Decaris, A. Bolotin, C. Delorme, S. Dusko Ehrlich, E. Guedon, V. Monnet, P. Renault, and M. Kleerebezem. 2005. New insights in the molecular biology and physiology of Streptococcus thermophilus revealed by comparative genomics. FEMS Microbiol. Rev.29:435-463. [[PubMed]
- 15. Jamet, E., S. D. Ehrlich, F. Duperray, and P. Renault. 2001. Étude des gènes dupliqués de la glycolyse chez Lactococcus lactis il1403. Lait81:115-129. [PubMed]
- 16. Jordan, I. K., K. S. Makarova, J. L. Spouge, Y. I. Wolf, and E. V. Koonin. 2001. Lineage-specific gene expansions in bacterial and archaeal genomes. Genome Res.11:555-565.
- 17. Katla, A. K., H. Kruse, G. Johnsen, and H. Herikstad. 2001. Antimicrobial susceptibility of starter culture bacteria used in Norwegian dairy products. Int. J. Food Microbiol.67:147-152. [[PubMed]
- 18. Kilstrup, M., K. Hammer, P. Ruhdal Jensen, and J. Martinussen. 2005. Nucleotide metabolism and its control in lactic acid bacteria. FEMS Microbiol. Rev.29:555-590. [[PubMed]
- 19. Klaenhammer, T., E. Altermann, F. Arigoni, A. Bolotin, F. Breidt, J. Broadbent, R. Cano, S. Chaillou, J. Deutscher, M. Gasson, M. van de Guchte, J. Guzzo, A. Hartke, T. Hawkins, P. Hols, R. Hutkins, M. Kleerebezem, J. Kok, O. Kuipers, M. Lubbers, E. Maguin, L. McKay, D. Mills, A. Nauta, R. Overbeek, H. Pel, D. Pridmore, M. Saier, D. van Sinderen, A. Sorokin, J. Steele, D. O'Sullivan, W. de Vos, B. Weimer, M. Zagorec, and R. Siezen. 2002. Discovering lactic acid bacteria by genomics. Antonie Leeuwenhoek82:29-58. [[PubMed]
- 20. Klaenhammer, T. R., R. Barrangou, B. L. Buck, M. A. Azcarate-Peril, and E. Altermann. 2005. Genomic features of lactic acid bacteria effecting bioprocessing and health. FEMS Microbiol. Rev.29:393-409. [[PubMed]
- 21. Klasson, L., and S. G. Andersson. 2004. Evolution of minimal-gene-sets in host-dependent bacteria. Trends Microbiol.12:37-43. [[PubMed]
- 22. Kleerebezem, M., J. Boekhorst, R. van Kranenburg, D. Molenaar, O. P. Kuipers, R. Leer, R. Tarchini, S. A. Peters, H. M. Sandbrink, M. W. Fiers, W. Stiekema, R. M. Lankhorst, P. A. Bron, S. M. Hoffer, M. N. Groot, R. Kerkhoven, M. de Vries, B. Ursing, W. M. de Vos, and R. J. Siezen. 2003. Complete genome sequence of Lactobacillus plantarum WCFS1. Proc. Natl. Acad. Sci. USA100:1990-1995.
- 23. Klijn, A., A. Mercenier, and F. Arigoni. 2005. Lessons from the genomes of bifidobacteria. FEMS Microbiol. Rev.29:491-509. [[PubMed]
- 24. Kok, J., G. Buist, A. L. Zomer, S. A. van Hijum, and O. P. Kuipers. 2005. Comparative and functional genomics of lactococci. FEMS Microbiol. Rev.29:411-433. [[PubMed]
- 25. Koonin, EV. 2005. Orthologs, paralogs, and evolutionary genomics. Annu. Rev. Genet.39:309-338. [[PubMed][Google Scholar]
- 26. Koonin, E. V., K. S. Makarova, and L. Aravind. 2001. Horizontal gene transfer in prokaryotes—quantification and classification. Annu. Rev. Microbiol.55:709-742.
- 27. Leipuviene, R., Q. Qian, and G. R. Bjork. 2004. Formation of thiolated nucleosides present in tRNA from Salmonella enterica serovar Typhimurium occurs in two principally distinct pathways. J. Bacteriol.186:758-766.
- 28. Makarova, K., A. Slesarev, Y. Wolf, A. Sorokin, E. Koonin, A. Pavlov, N. Pavlova, V. Karamychev, N. Polouchin, V. Shakhova, I. Grigoriev, Y. Lou, D. Rohksar, S. Lucas, K. Huang, D. M. Goodstein, T. Hawkins, V. Plengvidhya, D. Welker, J. Hughes, Y. Goh, A. Benson, K. Baldwin, J.-H. Lee, I. Diaz-Muniz, B. Dosti, V. Smeianov, W. Wechter, R. Barabote, G. Lorca, E. Altermann, R. Barrangou, B. Ganesan, Y. Xie, H. Rawsthorne, D. Tamir, C. Parker, F. Breidt, J. Broadbent, R. Hutkins, D. O'Sullivan, J. Steele, G. Unlu, M. Saier, T. Klaenhammer, P. Richardson, S. Kozyavkin, B. Weimer, and D. Mills. 2006. Comparative genomics of the lactic acid bacteria. Proc. Natl. Acad. Sci. USA103:15611-15616.
- 29. Makarova, K. S., N. V. Grishin, S. A. Shabalina, Y. I. Wolf, and E. V. Koonin. 2006. A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action. Biol. Direct.1:7.
- 30. Metivier, A., M. F. Pilet, X. Dousset, O. Sorokine, P. Anglade, M. Zagorec, J. C. Piard, D. Marion, Y. Cenatiempo, and C. Fremaux. 1998. Divercin V41, a new bacteriocin with two disulphide bonds produced by Carnobacterium divergens V41: primary structure and genomic organization. Microbiology144:2837-2844. [[PubMed]
- 31. Mills, D. A., H. Rawsthorne, C. Parker, D. Tamir, and K. Makarova. 2005. Genomic analysis of Oenococcus oeni PSU-1 and its relevance to winemaking. FEMS Microbiol. Rev.29:465-475. [[PubMed]
- 32. Mirkin, B. G., T. I. Fenner, M. Y. Galperin, and E. V. Koonin. 2003. Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes. BMC Evol. Biol.3:2.
- 33. Mojica, F. J., C. Diez-Villasenor, J. Garcia-Martinez, and E. Soria. 2005. Intervening sequences of regularly spaced prokaryotic repeats derive from foreign genetic elements. J. Mol. Evol.60:174-182. [[PubMed]
- 34. Mora, D., E. Maguin, M. Masiero, C. Parini, G. Ricci, P. L. Manachini, and D. Daffonchio. 2004. Characterization of urease genes cluster of Streptococcus thermophilus. J. Appl. Microbiol.96:209-219. [[PubMed]
- 35. Moran, NA. 2003. Tracing the evolution of gene loss in obligate bacterial symbionts. Curr. Opin. Microbiol.6:512-518. [[PubMed][Google Scholar]
- 36. Natale, D. A., M. Y. Galperin, R. L. Tatusov, and E. V. Koonin. 2000. Using the COG database to improve gene recognition in complete genomes. Genetica108:9-17. [[PubMed]
- 37. Natale, D. A., U. T. Shankavaram, M. Y. Galperin, Y. I. Wolf, L. Aravind, and E. V. Koonin. 2000. Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs). Genome Biol.1:RESEARCH0009.
- 38. Nes, I. F., and O. Johnsborg. 2004. Exploration of antimicrobial potential in LAB by genomics. Curr. Opin. Biotechnol.15:100-104. [[PubMed]
- 39. Neves, A. R., W. A. Pool, J. Kok, O. P. Kuipers, and H. Santos. 2005. Overview on sugar metabolism and its control in Lactococcus lactis—the input from in vivo NMR. FEMS Microbiol. Rev.29:531-554. [[PubMed]
- 40. Nishi, H., H. Komatsuzawa, T. Fujiwara, N. McCallum, and M. Sugai. 2004. Reduced content of lysyl-phosphatidylglycerol in the cytoplasmic membrane affects susceptibility to moenomycin, as well as vancomycin, gentamicin, and antimicrobial peptides, in Staphylococcus aureus. Antimicrob. Agents Chemother.48:4800-4807.
- 41. Novichkov, P. S., M. V. Omelchenko, M. S. Gelfand, A. A. Mironov, Y. I. Wolf, and E. V. Koonin. 2004. Genome-wide molecular clock and horizontal gene transfer in bacterial evolution. J. Bacteriol.186:6575-6585.
- 42. Pedersen, M. B., S. L. Iversen, K. I. Sorensen, and E. Johansen. 2005. The long and winding road from the research laboratory to industrial applications of lactic acid bacteria. FEMS Microbiol. Rev.29:611-624. [[PubMed]
- 43. Pridmore, R. D., B. Berger, F. Desiere, D. Vilanova, C. Barretto, A. C. Pittet, M. C. Zwahlen, M. Rouvet, E. Altermann, R. Barrangou, B. Mollet, A. Mercenier, T. Klaenhammer, F. Arigoni, and M. A. Schell. 2004. The genome sequence of the probiotic intestinal bacterium Lactobacillus johnsonii NCC 533. Proc. Natl. Acad. Sci. USA101:2512-2517.
- 44. Reichard, K., and MKaufmann. 2003. EPPS: mining the COG database by an extended phylogenetic patterns search. Bioinformatics19:784-785. [[PubMed][Google Scholar]
- 45. Sauvageot, N., C. Muller, A. Hartke, Y. Auffray, and J. M. Laplace. 2002. Characterisation of the diol dehydratase pdu operon of Lactobacillus collinoides. FEMS Microbiol. Lett.209:69-74. [[PubMed]
- 46. Schell, M. A., M. Karmirantzou, B. Snel, D. Vilanova, B. Berger, G. Pessi, M. C. Zwahlen, F. Desiere, P. Bork, M. Delley, R. D. Pridmore, and F. Arigoni. 2002. The genome sequence of Bifidobacterium longum reflects its adaptation to the human gastrointestinal tract. Proc. Natl. Acad. Sci. USA99:14422-14427.
- 47. Siezen, R. J., B. Renckens, I. van Swam, S. Peters, R. van Kranenburg, M. Kleerebezem, and W. M. de Vos. 2005. Complete sequences of four plasmids of Lactococcus lactis subsp. cremoris SK11 reveal extensive adaptation to the dairy environment. Appl. Environ. Microbiol.71:8371-8382.
- 48. Siezen, R. J., F. H. van Enckevort, M. Kleerebezem, and B. Teusink. 2004. Genome data mining of lactic acid bacteria: the impact of bioinformatics. Curr. Opin. Biotechnol.15:105-115. [[PubMed]
- 49. Smit, G., B. A. Smit, and W. J. Engels. 2005. Flavour formation by lactic acid bacteria and biochemical flavour profiling of cheese products. FEMS Microbiol. Rev.29:591-610. [[PubMed]
- 50. Stiles, M. E., and W. H. Holzapfel. 1997. Lactic acid bacteria of foods and their current taxonomy. Int. J. Food Microbiol.36:1-29. [[PubMed]
- 51. Takezaki, N., A. Rzhetsky, and M. Nei. 1995. Phylogenetic test of the molecular clock and linearized trees. Mol. Biol. Evol.12:823-833. [[PubMed]
- 52. Tatusov, R. L., N. D. Fedorova, J. D. Jackson, A. R. Jacobs, B. Kiryutin, E. V. Koonin, D. M. Krylov, R. Mazumder, S. L. Mekhedov, A. N. Nikolskaya, B. S. Rao, S. Smirnov, A. V. Sverdlov, S. Vasudevan, Y. I. Wolf, J. J. Yin, and D. A. Natale. 2003. The COG database: an updated version includes eukaryotes. BMC Bioinformatics4:41.
- 53. Tatusov, R. L., E. V. Koonin, and D. J. Lipman. 1997. A genomic perspective on protein families. Science278:631-637. [[PubMed]
- 54. Teuber, M., L. Meile, and F. Schwarz. 1999. Acquired antibiotic resistance in lactic acid bacteria from food. Antonie Leeuwenhoek76:115-137. [[PubMed]
- 55. Twomey, D., R. P. Ross, M. Ryan, B. Meaney, and C. Hill. 2002. Lantibiotics produced by lactic acid bacteria: structure, function and applications. Antonie Leeuwenhoek82:165-185. [[PubMed]
- 56. Vandamme, P., B. Pot, M. Gillis, P. de Vos, K. Kersters, and J. Swings. 1996. Polyphasic taxonomy, a consensus approach to bacterial systematics. Microbiol. Rev.60:407-438.
- 57. van de Guchte, M., S. Penaud, C. Grimaldi, V. Barbe, K. Bryson, P. Nicolas, C. Robert, S. Oztas, S. Mangenot, A. Couloux, V. Loux, R. Dervyn, R. Bossy, A. Bolotin, J. M. Batto, T. Walunas, J. F. Gibrat, P. Bessieres, J. Weissenbach, S. D. Ehrlich, and E. Maguin. 2006. The complete genome sequence of Lactobacillus bulgaricus reveals extensive and ongoing reductive evolution. Proc. Natl. Acad. Sci. USA103:9274-9279.
- 58. Vaughan, E. E., H. G. Heilig, K. Ben-Amor, and W. M. de Vos. 2005. Diversity, vitality and activities of intestinal lactic acid bacteria and bifidobacteria assessed by molecular approaches. FEMS Microbiol. Rev.29:477-490. [[PubMed]
- 59. Wolf, Y. I., I. B. Rogozin, N. V. Grishin, R. L. Tatusov, and E. V. Koonin. 2001. Genome trees constructed using five different approaches suggest new major bacterial clades. BMC Evol Biol.1:8.
- 60. Wood, B. J. B., and W. H. Holzapfel. 1995. The genera of lactic acid bacteria, 1st ed. Blackie Academic and Professional, Glasgow, United Kingdom.
- 61. Wood, B. J. B., and P. J. Warner. 2003. Genetics of lactic acid bacteria. Kluwer Academic/Plenum Publishers, New York, NY.
- 62. Yang, D., and C. R. Woese. 1989. A phylogenetic analysis of lactobacilli, Pediococcus pentosaceus and Leuconostoc mesenteroides. Syst. Appl. Microbiol.12:145-149. [[PubMed]
- 63. Yeats, C., R. D. Finn, and A. Bateman. 2002. The PASTA domain: a beta-lactam-binding domain. Trends Biochem. Sci.27:438. [[PubMed]






