Inference of population structure using multilocus genotype data.
Journal: 2000/September - Genetics
ISSN: 0016-6731
PUBMED: 10835412
Abstract:
We describe a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations. We assume a model in which there are K populations (where K may be unknown), each of which is characterized by a set of allele frequencies at each locus. Individuals in the sample are assigned (probabilistically) to populations, or jointly to two or more populations if their genotypes indicate that they are admixed. Our model does not assume a particular mutation process, and it can be applied to most of the commonly used genetic markers, provided that they are not closely linked. Applications of our method include demonstrating the presence of population structure, assigning individuals to populations, studying hybrid zones, and identifying migrants and admixed individuals. We show that the method can produce highly accurate assignments using modest numbers of loci-e.g. , seven microsatellite loci in an example using genotype data from an endangered bird species. The software used for this article is available from http://www.stats.ox.ac.uk/ approximately pritch/home. html.
Relations:
Content
Citations
(5K+)
References
(12)
Grants
(2)
Organisms
(1)
Processes
(2)
Affiliates
(1)
Similar articles
Articles by the same authors
Discussion board
Genetics 155(2): 945-959

Inference of population structure using multilocus genotype data.

Abstract

We describe a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations. We assume a model in which there are K populations (where K may be unknown), each of which is characterized by a set of allele frequencies at each locus. Individuals in the sample are assigned (probabilistically) to populations, or jointly to two or more populations if their genotypes indicate that they are admixed. Our model does not assume a particular mutation process, and it can be applied to most of the commonly used genetic markers, provided that they are not closely linked. Applications of our method include demonstrating the presence of population structure, assigning individuals to populations, studying hybrid zones, and identifying migrants and admixed individuals. We show that the method can produce highly accurate assignments using modest numbers of loci-e.g. , seven microsatellite loci in an example using genotype data from an endangered bird species. The software used for this article is available from http://www.stats.ox.ac.uk/ approximately pritch/home. html.

Full Text

The Full Text of this article is available as a PDF (245K).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Balding DJ, Nichols RA. DNA profile match probability calculation: how to allow for population stratification, relatedness, database selection and single bands. Forensic Sci Int. 1994 Feb;64(2-3):125–140. [PubMed] [Google Scholar]
  • Balding DJ, Nichols RA. A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica. 1995;96(1-2):3–12. [PubMed] [Google Scholar]
  • Bowcock AM, Ruiz-Linares A, Tomfohrde J, Minch E, Kidd JR, Cavalli-Sforza LL. High resolution of human evolutionary trees with polymorphic microsatellites. Nature. 1994 Mar 31;368(6470):455–457. [PubMed] [Google Scholar]
  • Davies N, Villablanca FX, Roderick GK. Determining the source of individuals: multilocus genotyping in nonequilibrium population genetics. Trends Ecol Evol. 1999 Jan;14(1):17–21. [PubMed] [Google Scholar]
  • Ewens WJ, Spielman RS. The transmission/disequilibrium test: history, subdivision, and admixture. Am J Hum Genet. 1995 Aug;57(2):455–464.[PMC free article] [PubMed] [Google Scholar]
  • Goldstein DB, Pollock DD. Launching microsatellites: a review of mutation processes and methods of phylogenetic interference. J Hered. 1997 Sep-Oct;88(5):335–342. [PubMed] [Google Scholar]
  • Jorde LB, Bamshad MJ, Watkins WS, Zenger R, Fraley AE, Krakowiak PA, Carpenter KD, Soodyall H, Jenkins T, Rogers AR. Origins and affinities of modern humans: a comparison of mitochondrial and nuclear genetic data. Am J Hum Genet. 1995 Sep;57(3):523–538.[PMC free article] [PubMed] [Google Scholar]
  • Mountain JL, Cavalli-Sforza LL. Multilocus genotypes, a tree of individuals, and human evolutionary history. Am J Hum Genet. 1997 Sep;61(3):705–718.[PMC free article] [PubMed] [Google Scholar]
  • Paetkau D, Calvert W, Stirling I, Strobeck C. Microsatellite analysis of population structure in Canadian polar bears. Mol Ecol. 1995 Jun;4(3):347–354. [PubMed] [Google Scholar]
  • Parra EJ, Marcini A, Akey J, Martinson J, Batzer MA, Cooper R, Forrester T, Allison DB, Deka R, Ferrell RE, et al. Estimating African American admixture proportions by use of population-specific alleles. Am J Hum Genet. 1998 Dec;63(6):1839–1851.[PMC free article] [PubMed] [Google Scholar]
  • Pritchard JK, Rosenberg NA. Use of unlinked genetic markers to detect population stratification in association studies. Am J Hum Genet. 1999 Jul;65(1):220–228.[PMC free article] [PubMed] [Google Scholar]
  • Rannala B, Mountain JL. Detecting immigration by using multilocus genotypes. Proc Natl Acad Sci U S A. 1997 Aug 19;94(17):9197–9201.[PMC free article] [PubMed] [Google Scholar]
Department of Statistics, University of Oxford, United Kingdom. pritch@tats.ox.ac.uk
Department of Statistics, University of Oxford, United Kingdom. pritch@tats.ox.ac.uk

Abstract

We describe a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations. We assume a model in which there are K populations (where K may be unknown), each of which is characterized by a set of allele frequencies at each locus. Individuals in the sample are assigned (probabilistically) to populations, or jointly to two or more populations if their genotypes indicate that they are admixed. Our model does not assume a particular mutation process, and it can be applied to most of the commonly used genetic markers, provided that they are not closely linked. Applications of our method include demonstrating the presence of population structure, assigning individuals to populations, studying hybrid zones, and identifying migrants and admixed individuals. We show that the method can produce highly accurate assignments using modest numbers of loci-e.g. , seven microsatellite loci in an example using genotype data from an endangered bird species. The software used for this article is available from http://www.stats.ox.ac.uk/ approximately pritch/home. html.

Abstract
Full Text
Selected References
Collaboration tool especially designed for Life Science professionals.Drag-and-drop any entity to your messages.