Evaluating the Performance of Machine Learning Techniques in the Classification of Wisconsin Breast Cancer

  • Authors

    • Omar Ibrahim Obaid
    • Mazin Abed Mohammed
    • Mohd Khanapi Abd Ghani
    • Salama A. Mostafa
    • Fahad Taha AL-Dhief
    2018-12-09
    https://doi.org/10.14419/ijet.v7i4.36.23737
  • Breast Cancer, Machine Learning, Accuracy, Classification, Support Vector Machine, Decision Tree, k-Nearest Neighbors, Wisconsin Breast Cancer (Diagnostic) Dataset
  • Breast cancer is a considerable problem among the women and causes death around the world. This disease can be detected by distinguishing malignant and benign tumors. Hence, doctors require trustworthy diagnosing process in order to differentiate between malignant and benign tumors. Therefore, the automation of this process is required to recognize tumors. Numerous research works have tried to apply the algorithms of machine learning for classifying breast cancer and it was proven by many researchers that machine learning algorithms act preferable in the diagnosing process. In this paper, three machine-learning algorithms (Support Vector Machine, K-nearest neighbors, and Decision tree) have been used and the performance of these classifiers has been compared in order to detect which classifier works better in the classification of breast cancer. Furthermore, the dataset of   Wisconsin Breast Cancer (Diagnostic) has been used in this study. The main aim of this work is to make comparison among several classifiers and find the best classifier which gives better accuracy. The outcomes of this study have revealed that quadratic support vector machine grants the largest accuracy of (98.1%) with lowest false discovery rates. The experiments of this study have been carried out and managed in Matlab which has a special toolbox for machine learning algorithms.

     

     

     

  • References

    1. [1] Mohammed, M.A., Al-Khateeb, B., Rashid, A.N., Ibrahim, D.A., Ghani, M.K.A. and Mostafa, S.A., 2018. Neural network and multi-fractal dimension features for breast cancer classification from ultrasound images. Computers & Electrical Engineering.70,pp.871-882.

      [2] Al-Hashimi MMY, Wang XJ. Breast cancer in Iraq, incidence trends from 2000-2009. Asian Pac J Cancer Prev 2014; 15(1):

      281–6.

      [3] B.M.Gayathri, C.P.Sumathi, and T.Santhanam. Breast Cancer Diagnosis Using Machine Learning Algorithms –A Survey,

      International Journal of Distributed and Parallel Systems (IJDPS) Vol.4, No.3, May 2013.

      [4] Meesad, P.; Yen, G.G. Combined numerical and linguistic knowledge representation and its application to medical diagnosis.

      IEEE Trans. Syst. Man Cybern. 2003, 33, 206–222.

      [5] Pavlopoulos, S.A.; Delopoulos, A.N. Designing and implementing the transition to a fully digital hospital. IEEE Trans. Inf.

      Technol. Biomed. 1999, 3, 6–19.

      [6] Mohammed, M.A., Ghani, M.K.A., Arunkumar, N., Hamed, R.I., Abdullah, M.K. and Burhanuddin, M.A., 2018. A real time computer aided object detection of nasopharyngeal carcinoma using genetic algorithm and artificial neural network based on Haar feature fear. Future Generation Computer Systems, 89, pp.539-547.

      [7] Radiology & Imaging. (2018). Breast Cancer Screening with 3D Mammography or Tomosynthesis - Radiology & Imaging,

      MA, CT. [online] Available at: https://www.rad-imaging.com/services/womens-imaging/breast-cancer-screening-3d-

      mammography-tomosynthesis/ [Accessed 5 Sep. 2018].

      [8] Ilias Maglogiannis, E Zafiropoulos “An intelligent system for automated breast cancer diagnosis and prognosis using SVM

      based classifiers†Applied Intelligence, 2009 – Springer.

      [9] Mohammed, M.A., Ghani, M.K.A., Arunkumar, N., Hamed, R.I., Mostafa, S.A., Abdullah, M.K. and Burhanuddin, M.A., 2018. Decision support system for nasopharyngeal carcinoma discrimination from endoscopic images using artificial neural network. The Journal of Supercomputing, https://doi.org/10.1007/s11227-018-2587-z.

      [10] Delen, D.; Walker, G.; Kadam, A. Predicting breast cancer survivability: A comparison of three data mining

      methods. Artif. Intell. Med. 2005, 34, 113–127.

      [11] Mostafa, S.A., Mustapha, A., Khaleefah, S.H., Ahmad, M.S. and Mohammed, M.A., 2018, February. Evaluating the Performance of Three Classification Methods in Diagnosis of Parkinson’s Disease. In International Conference on Soft Computing and Data Mining (pp. 43-52). Springer, Cham.

      [12] Abdulhay, E., Mohammed, M.A., Ibrahim, D.A., Arunkumar, N. and Venkatraman, V., 2018. Computer aided solution for automatic segmenting and measurements of blood leucocytes using static microscope images. Journal of medical systems, 42(4), p.58.

      [13] Kumar, U.K.; Nikhil, M.B.S.; Sumangali, K. Prediction of breast cancer using voting classifier technique.

      In Proceedings of the IEEE International Conference on Smart Technologies and Management for Computing,

      Communication, Controls, Energy and Materials, Chennai, India, 2–4 August 2017.

      [14] Osman, A.H. An enhanced breast cancer diagnosis scheme based on two-step-SVM technique. Int. J. Adv.

      Comput. Sci. Appl. 2017, 8, 158–165.

      [15] "Breast Cancer Wisconsin (Diagnostic) Data Set | Kaggle", Kaggle.com, 2018. [Online]. Available:

      https://www.kaggle.com/uciml/breast-cancer-wisconsin-data. [Accessed: 06- Sep- 2018].

      [16] Vapnik, V.N. An overview of statistical learning theory. IEEE Trans. Neural Netw. 1999, 10, 988–999.

      [17] Lee, Y.-J.; Mangasarian, O.L.; Wolberg, W.H. Breast cancer survival and chemotherapy: A support vector

      machine analysis. DIMACS Ser. Discret. Math. Theor. Comput. Sci. 2000, 55, 1–20.

      [18] Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297.

      [19] Moreno-Seco, F.; Micó, L.; Oncina, J. A modification of the LAESA algorithm for approximated k-NN classification.

      Pattern Recognit. Lett. 2003, 24, 47–53.

      [20] Mohammed, M.A., Ghani, M.K.A., Hamed, R.I. and Ibrahim, D.A., 2017. Review on Nasopharyngeal Carcinoma: Concepts, methods of analysis, segmentation, classification, prediction and impact: A review of the research literature. Journal of Computational Science, 21, pp.283-298.

      [21] Wenbin Yue, Zidong Wang, Hongwei Chen, Annette Payne, Xiaohui Liu. “Machine Learning with Applications in

      Breast Cancer Diagnosis and Prognosisâ€. 9 May 2018.

      [22] Mohammed, M.A., Ghani, M.K.A., Hamed, R.I. and Ibrahim, D.A., 2017. Analysis of an electronic methods for nasopharyngeal carcinoma: Prevalence, diagnosis, challenges and technologies. Journal of Computational Science, 21, pp.241-254.

      [23] De Mántaras, R.L. A distance-based attribute selection measure for decision tree induction. Mach. Learn. 1991, 6,

      81–92.

  • Downloads

  • How to Cite

    Ibrahim Obaid, O., Abed Mohammed, M., Khanapi Abd Ghani, M., A. Mostafa, S., & Taha AL-Dhief, F. (2018). Evaluating the Performance of Machine Learning Techniques in the Classification of Wisconsin Breast Cancer. International Journal of Engineering & Technology, 7(4.36), 160-166. https://doi.org/10.14419/ijet.v7i4.36.23737