Microarrays Research - Experiments, Designs, Statistics, Analysis, Software

Microarrays Research Today is a free monthly online journal that collates and summarizes the latest research about Microarrays, including details on experiments, designs, statistics, analysis, software.


Microarrays Research Today

Home

View Latest Issue

Information About Microarrays

Books on Microarrays

Advertising in Research Today

View Other Research Today Publications



Clustering threshold gradient descent regularization: with applications to microarray studies.

Ma S, Huang J

Department of Epidemiology and Public Health, Yale University, New Haven, CT, USA. shuangge.ma@yale.edu

MOTIVATION: An important goal of microarray studies is to discover genes that are associated with clinical outcomes, such as disease status and patient survival. While a typical experiment surveys gene expressions on a global scale, there may be only a small number of genes that have significant influence on a clinical outcome. Moreover, expression data have cluster structures and the genes within a cluster have correlated expressions and coordinated functions, but the effects of individual genes in the same cluster may be different. Accordingly, we seek to build statistical models with the following properties. First, the model is sparse in the sense that only a subset of the parameter vector is non-zero. Second, the cluster structures of gene expressions are properly accounted for. RESULTS: For gene expression data without pathway information, we divide genes into clusters using commonly used methods, such as K-means or hierarchical approaches. The optimal number of clusters is determined using the Gap statistic. We propose a clustering threshold gradient descent regularization (CTGDR) method, for simultaneous cluster selection and within cluster gene selection. We apply this method to binary classification and censored survival analysis. Compared to the standard TGDR and other regularization methods, the CTGDR takes into account the cluster structure and carries out feature selection at both the cluster level and within-cluster gene level. We demonstrate the CTGDR on two studies of cancer classification and two studies correlating survival of lymphoma patients with microarray expressions. AVAILABILITY: R code is available upon request. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Published 15 February 2007 in Bioinformatics, 23(4): 466-72.
Full-text of this article is available online (may require subscription).

Place a permanent text-link or advertisement here for just US$15.

© 2004-2008 Microarrays Research Today. All Rights Reserved.



Microarrays Research Today Archive:

Volume 1 (2004)
  Issue 1 (June)
  Issue 2 (July)
  Issue 3 (August)
  Issue 4 (September)
  Issue 5 (October)
  Issue 6 (November)
  Issue 7 (December)

Volume 2 (2005)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 3 (2006)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 4 (2007)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 5 (2008)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)



Microarrays Books

The Analysis of Gene Expression Data

The Analysis of Gene Expression Data