Microarrays Research - Experiments, Designs, Statistics, Analysis, Software

Microarrays Research Today is a free monthly online journal that collates and summarizes the latest research about Microarrays, including details on experiments, designs, statistics, analysis, software.


Microarrays Research Today

Home

View Latest Issue

Information About Microarrays

Books on Microarrays

Advertising in Research Today

View Other Research Today Publications



Statistical development and evaluation of microarray gene expression data filters.

Pounds S, Cheng C

Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, TN 38105-2794, USA. stanley.pounds@stjude.org

Filtering is a common practice used to simplify the analysis of microarray data by removing from subsequent consideration probe sets believed to be unexpressed. The m/n filter, which is widely used in the analysis of Affymetrix data, removes all probe sets having fewer than m present calls among a set of n chips. The m/n filter has been widely used without considering its statistical properties. The level and power of the m/n filter are derived. Two alternative filters, the pooled p-value filter and the error-minimizing pooled p-value filter are proposed. The pooled p-value filter combines information from the present-absent p-values into a single summary p-value which is subsequently compared to a selected significance threshold. We show that pooled p-value filter is the uniformly most powerful statistical test under a reasonable beta model and that it exhibits greater power than the m/n filter in all scenarios considered in a simulation study. The error-minimizing pooled p-value filter compares the summary p-value with a threshold determined to minimize a total-error criterion based on a partition of the distribution of all probes' summary p-values. The pooled p-value and error-minimizing pooled p-value filters clearly perform better than the m/n filter in a case-study analysis. The case-study analysis also demonstrates a proposed method for estimating the number of differentially expressed probe sets excluded by filtering and subsequent impact on the final analysis. The filter impact analysis shows that the use of even the best filter may hinder, rather than enhance, the ability to discover interesting probe sets or genes. S-plus and R routines to implement the pooled p-value and error-minimizing pooled p-value filters have been developed and are available from www.stjuderesearch.org/depts/biostats/index.html.

Published 10 May 2005 in J Comput Biol, 12(4): 482-95.
Full-text of this article is available online (may require subscription).

Place a permanent text-link or advertisement here for just US$15.

© 2004-2008 Microarrays Research Today. All Rights Reserved.



Microarrays Research Today Archive:

Volume 1 (2004)
  Issue 1 (June)
  Issue 2 (July)
  Issue 3 (August)
  Issue 4 (September)
  Issue 5 (October)
  Issue 6 (November)
  Issue 7 (December)

Volume 2 (2005)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 3 (2006)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 4 (2007)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 5 (2008)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)



Microarrays Books

Bioinformatics in Cancer and Cancer Therapy (Cancer Drug Discovery and Development)

Bioinformatics in Cancer and Cancer Therapy (Cancer Drug Discovery and Development)