The Open UniversitySkip to content
 

Missing Value Imputation Framework for Microarray Significant Gene Selection and Class Prediction

Sehgal, Shoaib; Gondal, Iqbal and Dooley, Laurence (2006). Missing Value Imputation Framework for Microarray Significant Gene Selection and Class Prediction. In: ed. Data Mining for Biomedical Applications. Lecture Notes in Computer Science, 3916. Berlin: Springer Verlag, pp. 131–142.

URL: http://www.springerlink.com/content/t37w704081mw38...
DOI (Digital Object Identifier) Link: http://dx.doi.org/10.1007/11691730_14
Google Scholar: Look up in Google Scholar

Abstract

Microarray data is used in a large number of applications ranging from diagnosis through to drug discovery. Such data however, often contains multiple missing genetic expressions which are generally ignored thus degrading the reliability of inferred results. This paper presents an innovative and robust imputation framework that more accurately estimates missing values leading subsequently to better gene selection and class prediction. To prove this premise, several missing value techniques including the Collateral Missing Values Estimation (CMVE), Bayesian Principal Component Analysis (BPCA), Least Square Impute (LSImpute), k-Nearest Neighbour (KNN) and ZeroImpute are analysed. A combination of univariate and multiple gene selection methods, namely, Between Group to within Group Sum of Squares and Weighted Partial Least Squares is then performed before applying class prediction using the Ridge Partial Least Square method. Overall, CMVE imputation consistently provided superior missing values estimation accuracy compared with the other algorithms examined, by virtue of exploiting local and global as well as positive and negative correlations between genes, with all empirical results being corroborated by the two-sided Wilcoxon Rank sum statistical significance test.

Item Type: Book Chapter
ISBN: 3-540-33104-2, 978-3-540-33104-9
Academic Unit/Department: Mathematics, Computing and Technology > Computing & Communications
Interdisciplinary Research Centre: Centre for Research in Computing (CRC)
Item ID: 10551
Depositing User: Laurence Dooley
Date Deposited: 11 Apr 2008
Last Modified: 02 Dec 2010 20:07
URI: http://oro.open.ac.uk/id/eprint/10551
Share this page:

Altmetrics

Scopus Citations

Actions (login may be required)

View Item
Report issue / request change

Policies | Disclaimer

© The Open University   + 44 (0)870 333 4340   general-enquiries@open.ac.uk