Microarrays Research Today is a free monthly online journal that collates and summarizes the latest research about Microarrays, including details on experiments, designs, statistics, analysis, software. | ||||||||
|
Development of Two-Stage SVM-RFE Gene Selection Strategy for Microarray Expression Data Analysis.Tang Y, Zhang YQ, Huang Z
Extracting a subset of informative genes from microarray expression data is a critical data preparation step in cancer classification and other biological function analyses. Though many algorithms have been developed, the Support Vector Machine - Recursive Feature Elimination (SVM-RFE) algorithm is one of the best gene feature selection algorithms. It assumes that a smaller "filter-out" factor in the SVM-RFE, which results in a smaller number of gene features eliminated in each recursion, should lead to extraction of a better gene subset. Because the SVM-RFE is highly sensitive to the "filter-out" factor, our simulations have shown that this assumption is not always correct and that the SVM-RFE is an unstable algorithm. To select a set of key gene features for reliable prediction of cancer types or subtypes and other applications, a new two-stage SVM-RFE algorithm has been developed. It is designed to effectively eliminate most of the irrelevant, redundant and noisy genes while keeping information loss small at the first stage. A fine selection for the final gene subset is then performed at the second stage. The two-stage SVM-RFE overcomes the instability problem of the SVM-RFE to achieve better algorithm utility. We have demonstrated that the two-stage SVM-RFE is significantly more accurate and more reliable than the SVM-RFE and three correlation-based methods based on our analysis of three publicly available microarray expression datasets. Furthermore, the two-stage SVM-RFE is computationally efficient because its time complexity is $O(d * log{_2d})$, where $d$ is the size of the original gene set. Published 1 August 2007 in IEEE/ACM Trans Comput Biol Bioinform, 4(3): 365-81.
© 2004-2008 Microarrays Research Today. All Rights Reserved. |
| ||||||