Global variation in copy number in the human genome
Abstract
Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. 1,447 copy number variable regions covering 360 megabases (12% of the genome) were identified in these populations; these CNV regions contained hundreds of genes, disease loci, functional elements and segmental duplications. Strikingly, these CNVs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal dramatic variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.
References
- 1. Iafrate AJ, et al Detection of large-scale variation in the human genome. Nat Genet. 2004;36:949–51.[PubMed][Google Scholar]
- 2. Sebat J, et al Large-scale copy number polymorphism in the human genome. Science. 2004;305:525–8.[PubMed][Google Scholar]
- 3. Sharp AJ, et al Segmental duplications and copy-number variation in the human genome. Am J Hum Genet. 2005;77:78–88.[Google Scholar]
- 4. Tuzun E, et al Fine-scale structural variation of the human genome. Nat Genet. 2005;37:727–732.[PubMed][Google Scholar]
- 5. Conrad DF, Andrews TD, Carter NP, Hurles ME, Pritchard JKA high-resolution survey of deletion polymorphism in the human genome. Nat Genet. 2006;38:75–81.[PubMed][Google Scholar]
- 6. Hinds DA, Kloek AP, Jen M, Chen X, Frazer KACommon deletions and SNPs are in linkage disequilibrium in the human genome. Nat Genet. 2006;38:82–5.[PubMed][Google Scholar]
- 7. McCarroll SA, et al Common deletion polymorphisms in the human genome. Nat Genet. 2006;38:86–92.[PubMed][Google Scholar]
- 8. Locke DP, et al Linkage Disequilibrium and Heritability of CNPs within Duplicated Regions of the Human Genome. Am J Hum Genet. 2006;79:275–290.[Google Scholar]
- 9. Fredman D, et al Complex SNP-related sequence variation in segmental genome duplications. Nat Genet. 2004;36:861–6.[PubMed][Google Scholar]
- 10. Feuk L, Carson AR, Scherer SWStructural variation in the human genome. Nat Rev Genet. 2006;7:85–97.[PubMed][Google Scholar]
- 11. Freeman JL, et al Copy Number Variation: New Insights in Genome Diversity. Genome Research. 2006;16:949–961.[PubMed][Google Scholar]
- 12. Bridges CBThe Bar ‘gene’: a duplication. Science. 1936;83:210–211.[PubMed][Google Scholar]
- 13. Buckland PRPolymorphically duplicated genes: their relevance to phenotypic variation in humans. Ann Med. 2003;35:308–15.[PubMed][Google Scholar]
- 14. Nguyen DQ, Webber C, Ponting CPBias of Selection on Human Copy-Number Variants. PLoS Genet. 2006;2:e20.[Google Scholar]
- 15. Repping S, et al High mutation rates have driven extensive structural polymorphism among human Y chromosomes. Nat Genet. 2006;38:463–7.[PubMed][Google Scholar]
- 16. Inoue K, Lupski JRMolecular mechanisms for genomic disorders. Annu Rev Genomics Hum Genet. 2002;3:199–242.[PubMed][Google Scholar]
- 17. Lupski JR, Stankiewicz PGenomic disorders: molecular mechanisms for rearrangements and conveyed phenotypes. PLoS Genet. 2005;1:e49.[Google Scholar]
- 18. Shaw-Smith C, et al Microarray based comparative genomic hybridisation (array-CGH) detects submicroscopic chromosomal deletions and duplications in patients with learning disability/mental retardation and dysmorphic features. J Med Genet. 2004;41:241–8.[Google Scholar]
- 19. Gonzalez E, et al The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility. Science. 2005;307:1434–40.[PubMed][Google Scholar]
- 20. Aitman TJ, et al Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans. Nature. 2006;439:851–5.[PubMed][Google Scholar]
- 21. Jongmans MC, et al CHARGE syndrome: the phenotypic spectrum of mutations in the CHD7 gene. J Med Genet. 2006;43:306–14.[Google Scholar]
- 22. Singleton AB, et al alpha-Synuclein locus triplication causes Parkinson's disease. Science. 2003;302:841.[PubMed][Google Scholar]
- 23. Rovelet-Lecrux A, et al APP locus duplication causes autosomal dominant early-onset Alzheimer disease with cerebral amyloid angiopathy. Nat Genet. 2006;38:24–6.[PubMed][Google Scholar]
- 24. Feuk L, Marshall CR, Wintle RF, Scherer SWStructural variants: changing the landscape of chromosomes and design of disease studies. Hum Mol Genet. 2006;15(Suppl 1):R57–66.[PubMed][Google Scholar]
- 25. IHMC A haplotype map of the human genome. Nature. 2005;437:1299–320.
- 26. Fiegler H, et al Accurate and reliable high-throughput detection of copy number variation in the human genome. Genome Res. in press.
- 27. Ylstra B, van den Ijssel P, Carvalho B, Brakenhoff RH, Meijer GABAC to the future! or oligonucleotides: a perspective for micro array comparative genomic hybridization (array CGH) Nucleic Acids Res. 2006;34:445–50.[Google Scholar]
- 28. Komura D, et al Genome-Wide Detection of Human Copy Number Variants using High Density DNA Oligonucleotide Arrays. Genome Res. in press.
- 29. de Vries BB, et al Diagnostic genome profiling in mental retardation. Am J Hum Genet. 2005;77:606–16.[Google Scholar]
- 30. Wirtenberger M, Hemminki K, Chen B, Burwinkel BSNP microarray analysis for genome-wide detection of crossover regions. Hum Genet. 2005;117:389–97.[PubMed][Google Scholar]
- 31. Marth GT, Czabarka E, Murvai J, Sherry STThe allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations. Genetics. 2004;166:351–72.[Google Scholar]
- 32. Hurles MHow homologous recombination generates a mutable genome. Hum Genomics. 2005;2:179–86.[Google Scholar]
- 33. Stankiewicz P, Lupski JRGenome architecture, rearrangements and genomic disorders. Trends Genet. 2002;18:74–82.[PubMed][Google Scholar]
- 34. Bailey JA, et al Recent segmental duplications in the human genome. Science. 2002;297:1003–7.[PubMed][Google Scholar]
- 35. Cheung J, et al Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence. Genome Biol. 2003;4:R25.[Google Scholar]
- 36. Aradhya S, et al Multiple pathogenic and benign genomic rearrangements occur at a 35 kb duplication involving the NEMO and LAGE2 genes. Hum Mol Genet. 2001;10:2557–67.[PubMed][Google Scholar]
- 37. Neitz M, Neitz JNumbers and ratios of visual pigment genes for normal red-green color vision. Science. 1995;267:1013–6.[PubMed][Google Scholar]
- 38. Chance PF, et al 2 Autosomal-Dominant Neuropathies Result from Reciprocal DNA Duplication/Deletion of a Region on Chromosome-17. Hum Mol Genet. 1994;3:223–228.[PubMed][Google Scholar]
- 39. Lupski JR, Stankiewicz P, editors. Genomic Disorders: the genomic basis of disease. Humana Press; Totawa, New Jersey: 2006. [PubMed]
- 40. Deininger PL, Batzer MAAlu repeats and human disease. Mol Genet Metab. 1999;67:183–93.[PubMed][Google Scholar]
- 41. Kurtz S, et al REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001;29:4633–42.[Google Scholar]
- 42. Hurles MGene duplication: the genomic trade in spare parts. PLoS Biol. 2004;2:E206.[Google Scholar]
- 43. Drake JA, et al Conserved noncoding sequences are selectively constrained and not mutation cold spots. Nat Genet. 2006;38:223–7.[PubMed][Google Scholar]
- 44. Bejerano G, et al Ultraconserved elements in the human genome. Science. 2004;304:1321–5.[PubMed][Google Scholar]
- 45. Brewer C, Holloway S, Zawalnyski P, Schinzel A, FitzPatrick DA chromosomal duplication map of malformations: regions of suspected haplo- and triplolethality–and tolerance of segmental aneuploidy–in humans. Am J Hum Genet. 1999;64:1702–8.[Google Scholar]
- 46. Sokal RR, Rohlf JF Biometry. W.H. Freeman and Co; 1994. [PubMed][Google Scholar]
- 47. Futreal PA, et al A census of human cancer genes. Nat Rev Cancer. 2004;4:177–83.[Google Scholar]
- 48. Millar JK, et al DISC1 and PDE4B are interacting genetic factors in schizophrenia that regulate cAMP signaling. Science. 2005;310:1187–91.[PubMed][Google Scholar]
- 49. Sasaki K, Sasaki H, Jonasson F, Kojima M, Cheng HMRacial differences of lens transparency properties with aging and prevalence of age-related cataract applying a WHO classification system. Ophthalmic Res. 2004;36:332–40.[PubMed][Google Scholar]
- 50. Valentonyte R, et al Sarcoidosis is associated with a truncating splice site mutation in BTNL2. Nat Genet. 2005;37:357–64.[PubMed][Google Scholar]
- 51. Orru S, Giuressi E, Carcassi C, Casula M, Contu LMapping of the major psoriasis-susceptibility locus (PSORS1) in a 70-Kb interval around the corneodesmosin gene (CDSN) Am J Hum Genet. 2005;76:164–71.[Google Scholar]
- 52. Christiansen J, et al Chromosome 1q21.1 contiguous gene deletion is associated with congenital heart disease. Circ Res. 2004;94:1429–35.[PubMed][Google Scholar]
- 53. Lupski JR, et al DNA duplication associated with Charcot-Marie-Tooth disease type 1A. Cell. 1991;66:219–32.[PubMed][Google Scholar]
- 54. Chance PF, et al DNA deletion associated with hereditary neuropathy with liability to pressure palsies. Cell. 1993;72:143–51.[PubMed][Google Scholar]
- 55. Matise TC, et al Detection of tandem duplications and implications for linkage analysis. Am J Hum Genet. 1994;54:1110–21.[Google Scholar]
- 56. Hirschhorn JN, Daly MJGenome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005;6:95–108.[PubMed][Google Scholar]
- 57. Wong Z, Royle NJ, Jeffreys AJA novel human DNA polymorphism resulting from transfer of DNA from chromosome 6 to chromosome 16. Genomics. 1990;7:222–34.[PubMed][Google Scholar]
- 58. Woodward KJ, et al Heterogeneous duplications in patients with Pelizaeus-Merzbacher disease suggest a mechanism of coupled homologous and nonhomologous recombination. Am J Hum Genet. 2005;77:966–87.[Google Scholar]
- 59. Rosenberg NA, et al Genetic structure of human populations. Science. 2002;298:2381–5.[PubMed][Google Scholar]
- 60. Bamshad MJ, et al Human population genetic structure and inference of group membership. Am J Hum Genet. 2003;72:578–89.[Google Scholar]
- 61. Pritchard JK, Stephens M, Donnelly PInference of population structure using multilocus genotype data. Genetics. 2000;155:945–59.[Google Scholar]
- 62. Weir BS, Cockerham CCEstimating F-statistics for the analysis of population structure. Evolution. 1984;38:1358–1370.[PubMed][Google Scholar]
- 63. Turgeon D, Carrier JS, Chouinard S, Belanger AGlucuronidation activity of the UGT2B17 enzyme toward xenobiotics. Drug Metab Dispos. 2003;31:670–6.[PubMed][Google Scholar]
- 64. Wilson W, 3rd, et al Characterization of a common deletion polymorphism of the UGT2B17 gene linked to UGT2B15. Genomics. 2004;84:707–14.[PubMed][Google Scholar]
- 65. Xue Y, et al Spread of an inactive form of caspase-12 in humans is due to recent positive selection. Am J Hum Genet. 2006;78:659–70.[Google Scholar]
- 66. Voight BF, Kudaravalli S, Wen X, Pritchard JKA map of recent positive selection in the human genome. PLoS Biol. 2006;4:e72.[Google Scholar]
- 67. Rademakers R, Cruts M, van Broeckhoven CThe role of tau (MAPT) in frontotemporal dementia and related tauopathies. Hum Mutat. 2004;24:277–95.[PubMed][Google Scholar]
- 68. Stefansson H, et al A common inversion under selection in Europeans. Nat Genet. 2005;37:129–37.[PubMed][Google Scholar]
- 69. Sabeti PC, et al Detecting recent positive selection in the human genome from haplotype structure. Nature. 2002;419:832–7.[PubMed][Google Scholar]
- 70. Khaja R, et al Genome assembly comparison to identify structural variants in the human genome. Nat Genet. submitted.
- 71. Bennett EA, Coleman LE, Tsui C, Pittard WS, Devine SENatural genetic variation caused by transposable elements in humans. Genetics. 2004;168:933–51.[Google Scholar]
- 72. Kaufman L, Rousseeuw PJ Finding groups in data: an introduction to cluster analysis. Wiley; New York: 1990. [PubMed][Google Scholar]
- 73. Barrett JC, Fry B, Maller J, Daly MJHaploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21:263–5.[PubMed][Google Scholar]
- 74. Walsh EC, et al Searching for signals of evolutionary selection in 168 genes related to immune function. Hum Genet. 2006;119:92–102.[PubMed][Google Scholar]
- 75. Petes TDMeiotic recombination hot spots and cold spots. Nat Rev Genet. 2001;2:360–9.[PubMed][Google Scholar]
- 76. Myers S, Bottolo L, Freeman C, McVean G, Donnelly PA fine-scale map of recombination rates and hotspots across the human genome. Science. 2005;310:321–4.[PubMed][Google Scholar]
- 77. Dermitzakis ET, et al Comparison of human chromosome 21 conserved nongenic sequences (CNGs) with the mouse and dog genomes shows that their selective constraint is independent of their genic environment. Genome Res. 2004;14:852–9.[Google Scholar]
