Global variation in copy number in the human genome.
Journal: 2006/December - Nature
ISSN: 1476-4687
Abstract:
Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single-nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. A total of 1,447 copy number variable regions (CNVRs), which can encompass overlapping or adjacent gains or losses, covering 360 megabases (12% of the genome) were identified in these populations. These CNVRs contained hundreds of genes, disease loci, functional elements and segmental duplications. Notably, the CNVRs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal marked variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.
Relations:
Content
Citations
(1K+)
References
(73)
Organisms
(1)
Processes
(6)
Similar articles
Articles by the same authors
Discussion board
Nature 444(7118): 444-454

Global variation in copy number in the human genome

+34 authors
The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
Genome Science, Research Center for Advanced Science and Technology, University of Tokyo, 4-6-1 Komaba, Meguro, Tokyo 153-8904 Japan
Affymetrix, Inc., Santa Clara, CA, USA
The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, MaRS Centre- East Tower, 101 College Street, Rm. 14-701, Toronto, Ontario, M5G 1L7, Canada
Department of Molecular and Medical Genetics, University of Toronto
Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA 02115
Genes and Disease Program, and Barcelona CeGen Unit, Center for Genomic Regulation, Barcelona, Catalonia, Spain
Dependable and High Performance Computing, Research Center for Advanced Science and Technology, University of Tokyo, 4-6-1 Komaba, Meguro, Tokyo 153-8904 Japan
Departments of Medical Genetics and Pediatrics, University of Alberta, Edmonton, Canada
Department of Human Genetics, University of Chicago, 920 East 58th Street, Chicago, Illinois, USA
Department of Life and Health Sciences, Pompeu Fabra University, Barcelona, Catalonia, Spain
Japan Science and Technology Agency Kawaguchi, Saitama, 332-0012, Japan
Harvard Medical School, Boston, MA, USA 02115
Senior Corresponding Authors: Matthew E. Hurles, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK, E-mail: ku.ca.regnas@hem
Stephen W. Scherer, The Centre for Applied Genomics, The Hospital for Sick Children, MaRS Centre- East Tower, 101 College Street, Rm. 14-701, Toronto, Ontario, M5G 1L7, Canada, E-mail: ac.no.sdikkcis.teneg@evets
First author equal contribution
Principal Investigator equal contribution

Abstract

Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. 1,447 copy number variable regions covering 360 megabases (12% of the genome) were identified in these populations; these CNV regions contained hundreds of genes, disease loci, functional elements and segmental duplications. Strikingly, these CNVs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal dramatic variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.

Abstract

References

  • 1. Iafrate AJ, et al Detection of large-scale variation in the human genome. Nat Genet. 2004;36:949–51.[PubMed][Google Scholar]
  • 2. Sebat J, et al Large-scale copy number polymorphism in the human genome. Science. 2004;305:525–8.[PubMed][Google Scholar]
  • 3. Sharp AJ, et al Segmental duplications and copy-number variation in the human genome. Am J Hum Genet. 2005;77:78–88.[Google Scholar]
  • 4. Tuzun E, et al Fine-scale structural variation of the human genome. Nat Genet. 2005;37:727–732.[PubMed][Google Scholar]
  • 5. Conrad DF, Andrews TD, Carter NP, Hurles ME, Pritchard JKA high-resolution survey of deletion polymorphism in the human genome. Nat Genet. 2006;38:75–81.[PubMed][Google Scholar]
  • 6. Hinds DA, Kloek AP, Jen M, Chen X, Frazer KACommon deletions and SNPs are in linkage disequilibrium in the human genome. Nat Genet. 2006;38:82–5.[PubMed][Google Scholar]
  • 7. McCarroll SA, et al Common deletion polymorphisms in the human genome. Nat Genet. 2006;38:86–92.[PubMed][Google Scholar]
  • 8. Locke DP, et al Linkage Disequilibrium and Heritability of CNPs within Duplicated Regions of the Human Genome. Am J Hum Genet. 2006;79:275–290.[Google Scholar]
  • 9. Fredman D, et al Complex SNP-related sequence variation in segmental genome duplications. Nat Genet. 2004;36:861–6.[PubMed][Google Scholar]
  • 10. Feuk L, Carson AR, Scherer SWStructural variation in the human genome. Nat Rev Genet. 2006;7:85–97.[PubMed][Google Scholar]
  • 11. Freeman JL, et al Copy Number Variation: New Insights in Genome Diversity. Genome Research. 2006;16:949–961.[PubMed][Google Scholar]
  • 12. Bridges CBThe Bar ‘gene’: a duplication. Science. 1936;83:210–211.[PubMed][Google Scholar]
  • 13. Buckland PRPolymorphically duplicated genes: their relevance to phenotypic variation in humans. Ann Med. 2003;35:308–15.[PubMed][Google Scholar]
  • 14. Nguyen DQ, Webber C, Ponting CPBias of Selection on Human Copy-Number Variants. PLoS Genet. 2006;2:e20.[Google Scholar]
  • 15. Repping S, et al High mutation rates have driven extensive structural polymorphism among human Y chromosomes. Nat Genet. 2006;38:463–7.[PubMed][Google Scholar]
  • 16. Inoue K, Lupski JRMolecular mechanisms for genomic disorders. Annu Rev Genomics Hum Genet. 2002;3:199–242.[PubMed][Google Scholar]
  • 17. Lupski JR, Stankiewicz PGenomic disorders: molecular mechanisms for rearrangements and conveyed phenotypes. PLoS Genet. 2005;1:e49.[Google Scholar]
  • 18. Shaw-Smith C, et al Microarray based comparative genomic hybridisation (array-CGH) detects submicroscopic chromosomal deletions and duplications in patients with learning disability/mental retardation and dysmorphic features. J Med Genet. 2004;41:241–8.[Google Scholar]
  • 19. Gonzalez E, et al The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility. Science. 2005;307:1434–40.[PubMed][Google Scholar]
  • 20. Aitman TJ, et al Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans. Nature. 2006;439:851–5.[PubMed][Google Scholar]
  • 21. Jongmans MC, et al CHARGE syndrome: the phenotypic spectrum of mutations in the CHD7 gene. J Med Genet. 2006;43:306–14.[Google Scholar]
  • 22. Singleton AB, et al alpha-Synuclein locus triplication causes Parkinson's disease. Science. 2003;302:841.[PubMed][Google Scholar]
  • 23. Rovelet-Lecrux A, et al APP locus duplication causes autosomal dominant early-onset Alzheimer disease with cerebral amyloid angiopathy. Nat Genet. 2006;38:24–6.[PubMed][Google Scholar]
  • 24. Feuk L, Marshall CR, Wintle RF, Scherer SWStructural variants: changing the landscape of chromosomes and design of disease studies. Hum Mol Genet. 2006;15(Suppl 1):R57–66.[PubMed][Google Scholar]
  • 25. IHMC A haplotype map of the human genome. Nature. 2005;437:1299–320.
  • 26. Fiegler H, et al Accurate and reliable high-throughput detection of copy number variation in the human genome. Genome Res. in press.
  • 27. Ylstra B, van den Ijssel P, Carvalho B, Brakenhoff RH, Meijer GABAC to the future! or oligonucleotides: a perspective for micro array comparative genomic hybridization (array CGH) Nucleic Acids Res. 2006;34:445–50.[Google Scholar]
  • 28. Komura D, et al Genome-Wide Detection of Human Copy Number Variants using High Density DNA Oligonucleotide Arrays. Genome Res. in press.
  • 29. de Vries BB, et al Diagnostic genome profiling in mental retardation. Am J Hum Genet. 2005;77:606–16.[Google Scholar]
  • 30. Wirtenberger M, Hemminki K, Chen B, Burwinkel BSNP microarray analysis for genome-wide detection of crossover regions. Hum Genet. 2005;117:389–97.[PubMed][Google Scholar]
  • 31. Marth GT, Czabarka E, Murvai J, Sherry STThe allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations. Genetics. 2004;166:351–72.[Google Scholar]
  • 32. Hurles MHow homologous recombination generates a mutable genome. Hum Genomics. 2005;2:179–86.[Google Scholar]
  • 33. Stankiewicz P, Lupski JRGenome architecture, rearrangements and genomic disorders. Trends Genet. 2002;18:74–82.[PubMed][Google Scholar]
  • 34. Bailey JA, et al Recent segmental duplications in the human genome. Science. 2002;297:1003–7.[PubMed][Google Scholar]
  • 35. Cheung J, et al Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence. Genome Biol. 2003;4:R25.[Google Scholar]
  • 36. Aradhya S, et al Multiple pathogenic and benign genomic rearrangements occur at a 35 kb duplication involving the NEMO and LAGE2 genes. Hum Mol Genet. 2001;10:2557–67.[PubMed][Google Scholar]
  • 37. Neitz M, Neitz JNumbers and ratios of visual pigment genes for normal red-green color vision. Science. 1995;267:1013–6.[PubMed][Google Scholar]
  • 38. Chance PF, et al 2 Autosomal-Dominant Neuropathies Result from Reciprocal DNA Duplication/Deletion of a Region on Chromosome-17. Hum Mol Genet. 1994;3:223–228.[PubMed][Google Scholar]
  • 39. Lupski JR, Stankiewicz P, editors. Genomic Disorders: the genomic basis of disease. Humana Press; Totawa, New Jersey: 2006. [PubMed]
  • 40. Deininger PL, Batzer MAAlu repeats and human disease. Mol Genet Metab. 1999;67:183–93.[PubMed][Google Scholar]
  • 41. Kurtz S, et al REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001;29:4633–42.[Google Scholar]
  • 42. Hurles MGene duplication: the genomic trade in spare parts. PLoS Biol. 2004;2:E206.[Google Scholar]
  • 43. Drake JA, et al Conserved noncoding sequences are selectively constrained and not mutation cold spots. Nat Genet. 2006;38:223–7.[PubMed][Google Scholar]
  • 44. Bejerano G, et al Ultraconserved elements in the human genome. Science. 2004;304:1321–5.[PubMed][Google Scholar]
  • 45. Brewer C, Holloway S, Zawalnyski P, Schinzel A, FitzPatrick DA chromosomal duplication map of malformations: regions of suspected haplo- and triplolethality–and tolerance of segmental aneuploidy–in humans. Am J Hum Genet. 1999;64:1702–8.[Google Scholar]
  • 46. Sokal RR, Rohlf JF Biometry. W.H. Freeman and Co; 1994. [PubMed][Google Scholar]
  • 47. Futreal PA, et al A census of human cancer genes. Nat Rev Cancer. 2004;4:177–83.[Google Scholar]
  • 48. Millar JK, et al DISC1 and PDE4B are interacting genetic factors in schizophrenia that regulate cAMP signaling. Science. 2005;310:1187–91.[PubMed][Google Scholar]
  • 49. Sasaki K, Sasaki H, Jonasson F, Kojima M, Cheng HMRacial differences of lens transparency properties with aging and prevalence of age-related cataract applying a WHO classification system. Ophthalmic Res. 2004;36:332–40.[PubMed][Google Scholar]
  • 50. Valentonyte R, et al Sarcoidosis is associated with a truncating splice site mutation in BTNL2. Nat Genet. 2005;37:357–64.[PubMed][Google Scholar]
  • 51. Orru S, Giuressi E, Carcassi C, Casula M, Contu LMapping of the major psoriasis-susceptibility locus (PSORS1) in a 70-Kb interval around the corneodesmosin gene (CDSN) Am J Hum Genet. 2005;76:164–71.[Google Scholar]
  • 52. Christiansen J, et al Chromosome 1q21.1 contiguous gene deletion is associated with congenital heart disease. Circ Res. 2004;94:1429–35.[PubMed][Google Scholar]
  • 53. Lupski JR, et al DNA duplication associated with Charcot-Marie-Tooth disease type 1A. Cell. 1991;66:219–32.[PubMed][Google Scholar]
  • 54. Chance PF, et al DNA deletion associated with hereditary neuropathy with liability to pressure palsies. Cell. 1993;72:143–51.[PubMed][Google Scholar]
  • 55. Matise TC, et al Detection of tandem duplications and implications for linkage analysis. Am J Hum Genet. 1994;54:1110–21.[Google Scholar]
  • 56. Hirschhorn JN, Daly MJGenome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005;6:95–108.[PubMed][Google Scholar]
  • 57. Wong Z, Royle NJ, Jeffreys AJA novel human DNA polymorphism resulting from transfer of DNA from chromosome 6 to chromosome 16. Genomics. 1990;7:222–34.[PubMed][Google Scholar]
  • 58. Woodward KJ, et al Heterogeneous duplications in patients with Pelizaeus-Merzbacher disease suggest a mechanism of coupled homologous and nonhomologous recombination. Am J Hum Genet. 2005;77:966–87.[Google Scholar]
  • 59. Rosenberg NA, et al Genetic structure of human populations. Science. 2002;298:2381–5.[PubMed][Google Scholar]
  • 60. Bamshad MJ, et al Human population genetic structure and inference of group membership. Am J Hum Genet. 2003;72:578–89.[Google Scholar]
  • 61. Pritchard JK, Stephens M, Donnelly PInference of population structure using multilocus genotype data. Genetics. 2000;155:945–59.[Google Scholar]
  • 62. Weir BS, Cockerham CCEstimating F-statistics for the analysis of population structure. Evolution. 1984;38:1358–1370.[PubMed][Google Scholar]
  • 63. Turgeon D, Carrier JS, Chouinard S, Belanger AGlucuronidation activity of the UGT2B17 enzyme toward xenobiotics. Drug Metab Dispos. 2003;31:670–6.[PubMed][Google Scholar]
  • 64. Wilson W, 3rd, et al Characterization of a common deletion polymorphism of the UGT2B17 gene linked to UGT2B15. Genomics. 2004;84:707–14.[PubMed][Google Scholar]
  • 65. Xue Y, et al Spread of an inactive form of caspase-12 in humans is due to recent positive selection. Am J Hum Genet. 2006;78:659–70.[Google Scholar]
  • 66. Voight BF, Kudaravalli S, Wen X, Pritchard JKA map of recent positive selection in the human genome. PLoS Biol. 2006;4:e72.[Google Scholar]
  • 67. Rademakers R, Cruts M, van Broeckhoven CThe role of tau (MAPT) in frontotemporal dementia and related tauopathies. Hum Mutat. 2004;24:277–95.[PubMed][Google Scholar]
  • 68. Stefansson H, et al A common inversion under selection in Europeans. Nat Genet. 2005;37:129–37.[PubMed][Google Scholar]
  • 69. Sabeti PC, et al Detecting recent positive selection in the human genome from haplotype structure. Nature. 2002;419:832–7.[PubMed][Google Scholar]
  • 70. Khaja R, et al Genome assembly comparison to identify structural variants in the human genome. Nat Genet. submitted.
  • 71. Bennett EA, Coleman LE, Tsui C, Pittard WS, Devine SENatural genetic variation caused by transposable elements in humans. Genetics. 2004;168:933–51.[Google Scholar]
  • 72. Kaufman L, Rousseeuw PJ Finding groups in data: an introduction to cluster analysis. Wiley; New York: 1990. [PubMed][Google Scholar]
  • 73. Barrett JC, Fry B, Maller J, Daly MJHaploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21:263–5.[PubMed][Google Scholar]
  • 74. Walsh EC, et al Searching for signals of evolutionary selection in 168 genes related to immune function. Hum Genet. 2006;119:92–102.[PubMed][Google Scholar]
  • 75. Petes TDMeiotic recombination hot spots and cold spots. Nat Rev Genet. 2001;2:360–9.[PubMed][Google Scholar]
  • 76. Myers S, Bottolo L, Freeman C, McVean G, Donnelly PA fine-scale map of recombination rates and hotspots across the human genome. Science. 2005;310:321–4.[PubMed][Google Scholar]
  • 77. Dermitzakis ET, et al Comparison of human chromosome 21 conserved nongenic sequences (CNGs) with the mouse and dog genomes shows that their selective constraint is independent of their genic environment. Genome Res. 2004;14:852–9.[Google Scholar]
Collaboration tool especially designed for Life Science professionals.Drag-and-drop any entity to your messages.