Microarrays Research - Experiments, Designs, Statistics, Analysis, Software

Microarrays Research Today is a free monthly online journal that collates and summarizes the latest research about Microarrays, including details on experiments, designs, statistics, analysis, software.


Microarrays Research Today

Home

View Latest Issue

Information About Microarrays

Books on Microarrays

Advertising in Research Today

View Other Research Today Publications



Supervised group Lasso with applications to microarray data analysis.

Ma S, Song X, Huang J

Department of Epidemiology and Public Health, Yale University, New Haven, CT 06520, USA. shuangge.ma@yale.edu

BACKGROUND: A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure. RESULTS: We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data. CONCLUSION: We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods.

Published 14 March 2007 in BMC Bioinformatics, 8: 60.
Full-text of this article is available online (may require subscription).

Place a permanent text-link or advertisement here for just US$15.

© 2004-2008 Microarrays Research Today. All Rights Reserved.



Microarrays Research Today Archive:

Volume 1 (2004)
  Issue 1 (June)
  Issue 2 (July)
  Issue 3 (August)
  Issue 4 (September)
  Issue 5 (October)
  Issue 6 (November)
  Issue 7 (December)

Volume 2 (2005)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 3 (2006)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 4 (2007)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 5 (2008)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)



Microarrays Books

Bioinformatics in Cancer and Cancer Therapy (Cancer Drug Discovery and Development)

Bioinformatics in Cancer and Cancer Therapy (Cancer Drug Discovery and Development)