Summaries of Affymetrix GeneChip probe level data.
Journal: 2003/March - Nucleic Acids Research
ISSN: 1362-4962
PUBMED: 12582260
Abstract:
High density oligonucleotide array technology is widely used in many areas of biomedical research for quantitative and highly parallel measurements of gene expression. Affymetrix GeneChip arrays are the most popular. In this technology each gene is typically represented by a set of 11-20 pairs of probes. In order to obtain expression measures it is necessary to summarize the probe level data. Using two extensive spike-in studies and a dilution study, we developed a set of tools for assessing the effectiveness of expression measures. We found that the performance of the current version of the default expression measure provided by Affymetrix Microarray Suite can be significantly improved by the use of probe level summaries derived from empirically motivated statistical models. In particular, improvements in the ability to detect differentially expressed genes are demonstrated.
Relations:
Content
Citations
(2K+)
References
(13)
Chemicals
(1)
Organisms
(1)
Anatomy
(2)
Affiliates
(1)
Similar articles
Articles by the same authors
Discussion board
Nucleic Acids Res 31(4): e15

Summaries of Affymetrix GeneChip probe level data

Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205, USA, Biostatistics Group, University of California, Berkeley, CA, USA, Gene Logic Inc., Berkeley, CA, USA, Mathematical Sciences Department, Johns Hopkins University, Baltimore, MD, USA, Division of Genetics and Bioinformatics, WEHI, Melbourne, Australia and Department of Statistics, University of California, Berkeley, CA, USA
To whom correspondence should be addressed. Tel: +1 410 614 5157; Fax: +1 410 955 0958; Email: ude.uhj@afar
Received 2002 Oct 8; Revised 2002 Nov 25; Accepted 2002 Nov 25.

Abstract

High density oligonucleotide array technology is widely used in many areas of biomedical research for quantitative and highly parallel measurements of gene expression. Affymetrix GeneChip arrays are the most popular. In this technology each gene is typically represented by a set of 11–20 pairs of probes. In order to obtain expression measures it is necessary to summarize the probe level data. Using two extensive spike-in studies and a dilution study, we developed a set of tools for assessing the effectiveness of expression measures. We found that the performance of the current version of the default expression measure provided by Affymetrix Microarray Suite can be significantly improved by the use of probe level summaries derived from empirically motivated statistical models. In particular, improvements in the ability to detect differentially expressed genes are demonstrated.

Abstract

ACKNOWLEDGEMENTS

We would like to thank GeneLogic and Affymetrix for the data, in particular, Uwe Scherf, Yasmin D. Beazer-Barclay and Kristen J. Antonellis (GeneLogic). We would also like to thank the R core, Bioconductor and Laurent Gautier (Technical University of Denmark) for writing great code and Ron Brookmeyer, Thomas Cappola, Sabra Klein, Scott Zeger (Johns Hopkins University), Ken Simpson, Sam Wormald (Walter and Eliza Hall Institute), Cheng Li (Harvard University) and Earl Hubbell (Affymetrix) for their insightful comments. The work of R.I. is supported by the PGA U01 HL66583.

ACKNOWLEDGEMENTS

REFERENCES

REFERENCES

References

  • 1. Lockhart D., Dong,H., Byrne,M., Follettie,M., Gallo,M., Chee M., Mittmann,M., Wang,C., Kobayashi,M., Horton,H. et al. (1996) Expression monitoring by hybridization to high-density oligonucleotide arrays. Nat. Biotechnol., 14, 1675–1680. [[PubMed]
  • 2. Lipshutz R., Fodor,S., Gingeras,T. and Lockhart D. (1999) High density synthetic oligonucleotide arrays. Nature Genet., Suppl. 21, 20–24. [[PubMed]
  • 3. Affymetrix (1999) Microarray Suite User Guide, Version 4. Affymetrix, .[PubMed]
  • 4. Irizarry R., Hobbs,B., Collin,F., Beazer-Barclay,Y., Antonellis,K., Scherf,U. and Speed,T. (2003) Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics, in press. [[PubMed]
  • 5. Affymetrix (2001) Microarray Suite User Guide, Version 5. Affymetrix, .[PubMed]
  • 6. Naef F., Lim,D.A., Patil,N. and Magnasco,M. (2002) DNA hybridization to mismatched templates: a chip study. Phys. Rev. E Stat. Nonlin. Soft Matter Phys., 65, 040902. [[PubMed]
  • 7. Li C. and Wong,W. (2001) Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. Proc. Natl Acad. Sci. USA, 98, 31–36.
  • 8. Tukey J(1977) Exploratory Data Analysis. Addison-Wesley, Reading, MA.
  • 9. Bolstad B., Irizarry,R., Åstrand,M. and Speed,T. (2003) A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics, in press. [[PubMed]
  • 10. Naef F., Hacker,C., Patil N. and Magnasco,M. (2002) Empirical characterization of the expression ratio noise structure in high-density oligonucleotide arrays. Genome Biol., 3, RESEARCH0018.
  • 11. Hill A., Brown,E., Whitley,M., Tucker-Kellogg,G., Hunter G. and Slonim,D. (2001) Evaluation of normalization procedures for oligonucleotide array data based on spiked cRNA controls. Genome Biol., 2, RESEARCH0055.
  • 12. Chudin E., Walker,R., Kosaka,A., Wu,S., Rabert,D., Chang,T. and Kreder,D. (2001) Assessment of the relationship between signal intensities and transcript concentration for Affymetrix GeneChip® arrays. Genome Biol., 3, RESEARCH0005.
  • 13. Lemon W., Palatini,J., Krahe,R. and Wright,F. (2002) Theoretical and experimental comparisons of gene expression indices for oligonucleotide arrays. Bioinformatics, 18, 1470–1476. [[PubMed]
  • 14. Hill A., Hunter,C., Tsung,B., Tucker-Kellogg,G. and Brown,E. (2000) Genomic analysis of gene expression in C. elegans. Science, 290, 809–812. [[PubMed]
  • 15. Baugh L., Hill,A., Brown,E. and Hunter C. (2001) Quantitative analysis of mRNA amplification by in vitro transcription. Nucleic Acids Res., 29, 1–9.
  • 16. Ihaka R. and Gentleman,R. (1996) R: a language for data analysis and graphics. J. Comput. Graph. Stat., 5, 299–314. [PubMed]
  • 17. Irizarry R., Gautier,L. and Cope,L. (2003) An R package for analyses of Affymetrix oligonucleotide arrays. In Parmigiani,G., Garrett,E.S., Irizarry,R.A. and Zeger,S.L. (eds), The Analysis of Gene Expression Data: Methods and Software. Springer, in press.
  • 18. Dudoit S., Yang,Y., Luu,P., Lin,D., Peng,V., Ngai,J. and Speed,T. (2002) Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res., 30, e15.
Collaboration tool especially designed for Life Science professionals.Drag-and-drop any entity to your messages.