Comparison of Different Classifiers for Prediction of Breast Cancer Metastasis in Microarray Analysis

Document Type : Original Article (s)

Authors

1 PhD Student, Department of Bioelectric and Biomedical Engineering, School of Advanced Medical Technology, Isfahan University of Medical Sciences, Isfahan, Iran

2 Associate Professor, Department of Bioelectric and Biomedical Engineering, School of Advanced Medical Technology, Isfahan University of Medical Sciences, Isfahan, Iran

Abstract

Background: In this research, we investigated the performance of some different classifiers for prediction of metastasis in breast cancer.Methods: We used the DNA microarrays of primary breast tumors of 78 young patients. Among these patients, 34 had developed distant metastases within 5 years (poor prognosis group) and 44 formed good prognosis group. For analysis, we applied three different classifiers including support vector machine (SVM), stepwise linear discriminant analysis (SWLDA) and K-nearest neighbors (KNN) classifier. Each of these classifiers used 231 selected genes as an input feature vector and their performances were estimated via using leave one out (LOO) method to classify patients into two groups namely, good and poor prognosis.Findings: The best results were obtained by support vector machine with linear kernel. This classifier achieved a sensitivity and specificity of 84% and 82%, respectively, for metastasis prediction.Conclusion: Our findings provide a strategy to specify patients who would benefit from adjuvant therapy.

Keywords


  1. Weigelt B, Peterse JL, van't Veer LJ. Breast cancer metastasis: markers and models. Nature Reviews 2005; 5: 591-602.
  2. Lujambioa A, Calinc GA, Villanuevad A, Roperoa S, Sanchez-Cespedese M, Blancof D, et al. A microRNA DNA methylation signature for human cancer metastasis. PNAS 2008; 105(36): 13556-61.
  3. Mehridehnavi A, Ziaei L. Minimal gene selection for classification and diagnosis prediction based on gene expression profile. Adv Biomed Res 2013; 2: 26.
  4. van't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, Mao M, et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature 2002; 415(6871): 530-6.
  5. Lotte F, Congedo M, Lecuyer A, Lamarche F, Arnaldi B. A review of classification algorithms for EEG-based brain-computer interfaces. J Neural Eng 2007; 4(2): R1-R13.
  6. Huerta EB, Duval B, Hao JK. Selection for microarray data by a LDA-based genetic algorithm. Lecture Notes in Computer Science 2008; 5265: 250-61.
  7. Sharmaa A, Paliwala KK. Cancer classification by gradient LDA technique using microarray gene expression data. Data and Knowledge Engineering 2008; 66(2): 338-47.
  8. Krusienski DJ, Sellers EW, McFarland DJ, Vaughan TM, Wolpaw JR. Toward enhanced P300 speller performance. J Neurosci Methods 2008; 167(1): 15-21.
  9. Nijboer F, Sellers EW, Mellinger J, Jordan MA, Matuz T, Furdea A, et al. A P300-based brain-computer interface for people with amyotrophic lateral sclerosis. Clin Neurophysiol 2008; 119(8): 1909-16.
  10. Furey TS, Cristianini N, Duffy N, Bednarski DW, Schummer M, Haussler D. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics 2000; 16(10): 906-14.
  11. Hernandez JCh, Duval B, Hao JK. SVM-based local search for gene selection and classification of microarray data. Bioinformatics Research and Development Communications in Computer and Information Science 2008; 13: 499-508.
  12. Parry RM, Jones W, Stokes TH, Phan JH, Moffitt RA, Fang H, et al. k-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction. Pharmacogenomics J 2010; 10(4): 292-309.
  13. Jiangsheng Y. Method of k-nearest neighbors. Beijing, China: Institute of Computational Linguistics Peking University; 2002.
  14. Krusienski DJ, Sellers EW, Cabestaing F, Bayoudh S, McFarland DJ, Vaughan TM, et al. A comparison of classification techniques for the P300 Speller. J Neural Eng 2006; 3(4): 299-305.