Estimating genome conservation between crop and model legume species
Abstract
Legumes are simultaneously one of the largest families of crop plants and a cornerstone in the biological nitrogen cycle. We combined molecular and phylogenetic analyses to evaluate genome conservation both within and between the two major clades of crop legumes. Genetic mapping of orthologous genes identifies broad conservation of genome macrostructure, especially within the galegoid legumes, while also highlighting inferred chromosomal rearrangements that may underlie the variation in chromosome number between these species. As a complement to comparative genetic mapping, we compared sequenced regions of the model legume Medicago truncatula with those of the diploid Lotus japonicus and the polyploid Glycine max. High conservation was observed between the genomes of M. truncatula and L. japonicus, whereas lower levels of conservation were evident between M. truncatula and G. max. In all cases, conserved genome microstructure was punctuated by significant structural divergence, including frequent insertion/deletion of individual genes or groups of genes and lineage-specific expansion/contraction of gene families. These results suggest that comparative mapping may have considerable utility for basic and applied research in the legumes, although its predictive value is likely to be tempered by phylogenetic distance and genome duplication.
The Fabaceae, or legumes, are cultivated on 180 million hectares, involving ≈12% of Earth's arable land and accounting for ≈27% of the world's primary crop production (1). Their unusual capacity for symbiotic nitrogen fixation underlies their importance as a source of protein in the human diet and of nitrogen in both natural and agricultural ecosystems. Legumes are also increasingly recognized as a source of valuable secondary metabolites. These factors have fueled a significant increase in legume research over the past decade.
The ≈20,000 legume species are divided into three subfamilies: Mimosoideae, Caesalpinioideae, and the numerically and economically dominant Papilionoideae (2). With the notable exception of peanut, the important crop legumes occur in two Papilionoid clades, referred to here as the “phaseoloid” and “galegoid” legumes (Table 1). Despite their close phylogenetic affiliations (Fig. 1), the genetic systems represented within this group are diverse, ranging from simple autogamous diploids to complex out-crossing polyploids. Genome size also varies widely among legumes, with pea having a genome size 10 times that of some related diploid genera.
Taxonomic relationships within the two major clades of crop legumes, the prevailing view of phylogeny for the species under analysis, with divergence times estimated based on Penalised Likelihood analysis (2). Most crop legumes occur either within the galegoid clade, including tribes Viceae, Trifolieae, and Cicereae, or within the phaseoloid clade, which is synonymous with the tribe Phaseoleae. MYA, million years ago.
Table 1.
| Species | Common name | Genome size, Mbp | N | Tribe | Clade | SL | PL | Genotypes |
|---|---|---|---|---|---|---|---|---|
| M. truncatula | Barrel medic | 500 | 8 | Trifoleae | Galegoid | 183 | 130 | A17, A20, DZA |
| M. sativa | Alfalfa | 1,600 | 16 | Trifoleae | Galegoid | 70 | 68 | Mscw2, Msq93 |
| Pi. sativum | Pea | 5,000 | 7 | Viceae | Galegoid | 101 | 68 | JI15, JI281, JI399, JI194 |
| G. max | Soybean | 1,100 | 20 | Phaseoleae | Phaseoloid | 56 | 15 | PI209322, Evans |
| V. radiata | Mung bean | 520 | 11 | Phaseoleae | Phaseoloid | 62 | 31 | TC1966, VC3890 |
| Ph. vulgaris | Common bean | 620 | 11 | Phaseoleae | Phaseoloid | 37 | 22 | BAT93, Jalo |
| L. japonicus | Bird's foot trefoil | 500 | 6 | Loteae | 67 | 44 | Lotus filicaulis, L. japonicus Gifu |
SL, sequenced loci; PL, polymorphic loci; N, gametic chromosome number.
The large number of important legume species precludes their simultaneous in-depth characterization. Moreover, several crop legumes have one or more characters (e.g., medium to large genomes and/or polyploid nature) that limit their utility as experimental systems. Two legumes with favorable genetic attributes, namely Medicago truncatula and Lotus japonicus, have been selected as model species and are the focus of large multinational genome projects. The early fruits of working with these well characterized genomes are evident in the recent advances in our understanding of symbiotic nitrogen fixation in both M. truncatula and L. japonicus (3).
A pressing need in legume genomics is to integrate knowledge gained from the study of model legume genomes with the biological and agronomic questions of importance in the crop species. Comparative genetic mapping is well established in several plant families, most notably the Poaceae (4), where initial studies predicted that synteny would greatly facilitate gene discovery among related species (5, 6). However, even closely related grass species (7, 8), in some cases members of the same species (9), can exhibit significant divergence in genome organization. It is important to know whether similar features are prevalent in other plant families, in particular because the extent of such differences may define the limits of comparative structural genomics as a strategy for applied agriculture.
Here we combined genetic and phylogenetic analyses to map putatively orthologous genes across seven legume species. Complementing the genetic linkage analysis, we surveyed the conservation of genome microstructure between M. truncatula and L. japonicus and M. truncatula and Glycine max (soybean) by comparing fully sequenced bacterial artificial chromosome (BAC) clones. The combined genetic, phylogenetic, and genomic analyses demonstrate extensive conservation of gene order and orthology between the crop and model legumes and also identify features of structural divergence between these genomes.
SL, sequenced loci; PL, polymorphic loci; N, gametic chromosome number.
Click here to view.Acknowledgments
This work was supported by National Science Foundation Grant 0110206 (to D.R.C., N.D.Y., and D.-J.K.); the Samuel Roberts Noble Foundation (to B.R.); the European Union MEDICAGO project (QLG2-CT-2000-30676) and the Biotechnology and Biological Sciences Research Council (to N.E.); and Nemzeti Kutatás es Feilesztesi Program and the (HNDRP) [Országos Tudományos Kutatási Alap Programok (OTKA) and T038211, to G.B.K.].
Notes
This paper was submitted directly (Track II) to the PNAS office.
Abbreviations: BAC, bacterial artificial chromosome; NCBI, National Center for Biotechnology Information.
References
- 1. Graham, P. H. & Vance, C. P. (2003) Plant Physiol.131, 872-877.
- 2. Doyle, J. J. & Luckow, M. A. (2003) Plant Physiol.131, 900-910.
- 3. Riely, B., Ane, J.-M., Penmetsa, R. V. & Cook, D. R. (2004) Curr. Opin. Plant Biol. (2004) 7, 408-413. [[PubMed]
- 4. Delseny, M(2004) Curr. Opin. Plant Biol.7, 126-131. [[PubMed][Google Scholar]
- 5. Devos, K. M. & Gale, M. D. (2000) Plant Cell12, 637-646.
- 6. Bennetzen, J. L. & Freeling, M. (1997) Genome Res.7, 301-306. [[PubMed]
- 7. Bennetzen, J. L. & Ramakrishna, W. (2002) Plant Mol. Biol.48, 821-817. [[PubMed]
- 8. Song, R., Llaca, V. & Messing, J. (2002) Genome Res.12, 1549-1555.
- 9. Fu, H. & Dooner, H. K. (2002) Proc. Natl. Acad. Sci. USA99, 9573-9578.
- 10. Choi, H. K., Kim, D., Uhm, T., Limpens, E., Lim, H., Mun, J. H., Kalo, P., Penmetsa, R. V., Seres, A., Kulikova, O., et al. (2004) Genetics166, 1463-1502.
- 11. Vallejos, C. E., Sakiyama, N. S. & Chase, C. D. (1992) Genetics131, 733-740.
- 12. Boutin, S. R., Young, N. D., Olson, T. C., Yu, Z. H., Shoemaker, R. C. & Vallejos, C. E. (1995) Genome38, 928-937. [[PubMed]
- 13. Ellis, T. H. N. & Poyser, S. J. (2002) New Phytol.153, 17-25. [PubMed]
- 14. Humphry, E., Konduri, V., Lambrides, J., Magner, T., McIntyre, L., Aitken, B. & Liu, J. (2002) Theor. Appl. Genet.105, 160-166. [[PubMed]
- 15. Lambrides, C. J., Lawn, R. J., Godwin, I. D., Manners, J. & Imrie, B. C. (2000) Aust. J. Agric. Res.51, 415-425. [PubMed]
- 16. Vision, T. J., Brown, D. G., Shmoys, D. B., Durrett, R. T. & Tanksley, S. D. (2000) Genetics155, 407-420.
- 17. Swofford, D. L. (2003) paup, Phylogenetic Analyses Using Parsimony (and Other Methods) (Sinauer, Sunderland, MA), Version 4.0b10.
- 18. Sato, S., Kaneko, T., Nakamura, Y., Asamizu, E., Kato, T. & Tabata, S. (2001) DNA Res.8, 311-318. [[PubMed]
- 19. Wojciechowski, M. F. (2004) Advances in Legume Systematics, Part 10: Higher Level Systematics, eds. Klitgaard, B. B. & Bruneau, A. (Royal Botanic Gardens, Kew, U.K.), pp. 5-35.
- 20. Kulikova, O., Gualtieri, G., Geurts, R., Kim, D., Cook, D. R., Huguet, T., de Jong, J. H., Fransz, P. F. & Bisseling, T. (2001) Plant J.27, 49-58. [[PubMed]
- 21. Yan, H., Mudge, J., Kim, D., Shoemaker, R. C., Cook, D. R. & Young, N. D. (2003) Theor. Appl. Genet.106, 1256-1265. [[PubMed]
- 22. Yan, H. H., Mudge, J., Kim, D., Larsen, D., Denny, R., Shoemaker, R. C., Cook, D. R. & Young, N. D. (2004) Genome47, 141-155. [[PubMed]
- 23. Bowers, J. E., Chapman, B. A., Rong, J. & Paterson, A. H. (2003) Nature422, 233-438. [PubMed]
- 24. Ku, H. M., Vision, T., Liu, J. & Tanksley, S. D. (2000) Proc. Natl. Acad. Sci. USA97, 9121-9126.
- 25. Vision, T. J., Brown, D. G. & Tanksley, S. D. (2000) Science290, 2114-2117. [[PubMed]
- 26. Greilhuber, J. & Ebert, I. (1994) Genome37, 646-655. [[PubMed]
- 27. Vershinin, A. V., Allnutt, T. R., Knox, M. R., Ambrose, M. J. & Ellis, T. H. N. (2003) Mol. Biol. Evol.20, 2067-2075. [[PubMed]
- 28. Zhu, H. Y., Cannon, S., Young, N. D. & Cook, D. R. (2002) Mol. Plant–Microbe Interact.15, 529-539. [[PubMed]
- 29. Kaló, P., Seres, A., Taylor, S. A., Jakab, J., Kevei, Z., Kereszt, A., Endre, G., Ellis, T. H. N. & Kiss, G. B. (2004) Mol. Genet. Genomics, in press.
- 30. Mural, R. J., Adams, M. D., Myers, E. W., Smith, H. O., Miklos, G. L. G., Wides, R., Halpern, A., Li, P. W., Sutton, G. G., Nadeau, J., et al. (2002) Science296, 1661-1671. [[PubMed]
- 31. Cannon, S. B., McCombie, W. R., Sato, S., Tabata, S., Denny, R., Palmer, L., Katari, M. & Young, N. D. (2003) Mol. Genet. Genomics270, 347-361. [[PubMed]
- 32. Zhu, H. Y., Kim, D., Baek, J. M., Choi, H. K., Ellis, L., Kuester, H., McCombie, W. R., Peng, H. M. & Cook, D. R. (2003) Plant Physiol.131, 1018-1026.
- 33. Vandepoele, K., Simillion, C. & Van de Peer, Y. (2002) Trends Genet.18, 606-608. [[PubMed]
- 34. Doust, A. N., Devos, K. M., Gadberry, M. D., Gale, M. D. & Kellog, E. A. (2004) Proc. Natl. Acad. Sci. USA101, 9045-9050.




