Microarray Data Analysis
An (Opinionated) Guide to Microarray Data Analysis
National Cancer Institute, and Karolinska Instutitute, Dept. of Biosciences
The aim of these pages is to set out the author’s opinions about the best ways to deal with the most common issues in microarray data analysis. The opinions are based on experience with dozens of collaborating lab scientists, and discussions with microarray statisticians. Contributions, disputes, and opinions are welcome (and may be posted!). These pages are intended primarily for people working in a microarray facility. The focus is on practical biological issues. There is no attempt to cover all the fancy data mining algorithms that have been developed. However I hope that statisticians and programmers will also find interesting material here.
This site is organized as follows
This section discusses how many replicates are needed, whether to pool samples, and designs for two color arrays.
This section discusses steps along the way from quantification of the image, through background correction, quality control, and normalization, to obtain reliable estimates of relative gene abundance in the samples.
The Affymetrix multiple-probe system offers a unique set of statistical challenges before one even gets to interpreting biological meaning. This section discusses image quantification and background, MAS5.0, normalization, multi-chip algorithms for estimation, and model-based quality control..
A preliminary examination of data confirms that groups are homogeneous. Many studies aim to find unknown co-regulated genes. This section discusses how to achieve these goals using multi-dimensional scaling, and clustering.
This section discusses methods and problems in determining which genes are differentially expressed between groups of samples, covering t-tests, multiple-testing corrections, false discovery rate, empirical Bayes methods, and analysis of variance.
For descriptions of the technology see the Nature Genetics Chipping Report supplements
This web site is a development of the Genomics and Pharmacology Facility, Developmental Therapeutics Branch (DTB), Center for Cancer Research (CCR), National Cancer Institute (NCI).