Big data management with machine learning inscribed by domain knowledge for health care

  • Authors

    • EPhzibah E.P. VIT university, Vellore-632014
    • Sujatha R VIT university, Vellore-632014
    2017-09-20
    https://doi.org/10.14419/ijet.v6i4.8214
  • Big Data, Classification, Disease Diagnosis, Domain Knowledge, Machine Learning.
  • Abstract

    In this work, a framework that helps in the disease diagnosis process with big-data management and machine learning using rule based, instance based, statistical, neural network and support vector method is given. Concerning this, big-data that contains the details of various diseases are collected, preprocessed and managed for classification. Diagnosis is a day-to-day activity for the medical practitioners and is also a decision-making task that requires domain knowledge and expertise in the specific field. This framework suggests different machine learning methods to aid the practitioner to diagnose disease based on the best classifier that is identified in the health care system. The framework has three main segments like big-data management, machine learning and input/output details of the patient. It has been already proved in the literature that the computing methods do help in disease diagnosis, provided the data about that particular disease is available in the data center. Thus this framework will provide a source of confidence and satisfaction to the doctors, as the model generated is based on the accuracy of the classifier compared to other classifiers.

  • References

    1. [1] K. Polat and S. Güne, 2007. An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease. Digital Signal Processing, 17, 702–710. https://doi.org/10.1016/j.dsp.2006.09.005.

      [2] Pramanik, M.I., Lau, R.Y., Demirkan, H. and Azad, M.A.K., 2017. Smart Health: Big Data Enabled Health Paradigm within Smart Cities. Expert Systems with Applications. https://doi.org/10.1016/j.eswa.2017.06.027.

      [3] S. Sakr and A. Elgammal, 2016. Towards a Comprehensive Data Analytics Framework for Smart Healthcare Services. Big Data Research, 4, 44–58. https://doi.org/10.1016/j.bdr.2016.05.002.

      [4] L. Wang and C. A. Alexander, 2015. Big Data in Medical Applications and Health Care. American Medical Journal, 6 (1),1- 8 https://doi.org/10.3844/amjsp.2015.1.8.

      [5] W. Raghupathi and V. Raghupathi, 2014. Big data analytics in healthcare : promise and potential. Health Information and Science Systems, 2(1), 1–10. https://doi.org/10.1186/2047-2501-2-3.

      [6] D. Haluza and D. Jungwirth, 2015. ICT and the future of health care : aspects of health promotion. International Journal of Medical Informatics, 84(1), 48–57. https://doi.org/10.1016/j.ijmedinf.2014.09.005.

      [7] M. De Bruijne, 2016. Machine learning approaches in medical image analysis : From detection to diagnosis. Medical Image Analysis, 33, 94–97. https://doi.org/10.1016/j.media.2016.06.032.

      [8] Kavakiotis, O. Tsave, A. Salifoglou, N. Maglaveras, I. Vlahavas, and I. Chouvarda, 2017. Machine Learning and Data Mining Methods in Diabetes Research. Computational and Structural Biotechnology Journal, 15, 104–116. https://doi.org/10.1016/j.csbj.2016.12.005.

      [9] K. Greyson et al., 2013. Formulation Process of Knowledge for an Expert Healthcare System Unit. AASRI Procedia, 4, 190–195. https://doi.org/10.1016/j.aasri.2013.10.029.

      [10] Sohail Jabbar, Farhan Ullah, Shehzad Khalid, Murad Khan, and Kijun Han, 2017.Semantic Interoperability in Heterogeneous IoT Infrastructure for Healthcare. Wireless Communications and Mobile Computing, 2017, https://doi.org/10.1155/2017/9731806.

      [11] Nair, L.R., Shetty, S.D. and Shetty, S.D., 2017. Applying spark based machine learning model on streaming big data for health status prediction. Computers & Electrical Engineering. In Press. https://doi.org/10.1016/j.compeleceng.2017.03.009.

      [12] Fatima, M. and Pasha, M., 2017, Survey of Machine Learning Algorithms for Disease Diagnostic. Journal of Intelligent Learning Systems and Applications, 9(01), 1-16. https://doi.org/10.4236/jilsa.2017.91001.

      [13] Papakostas, G.A., Savio, A., Graña, M. and Kaburlasos, V.G., 2015, A lattice computing approach to Alzheimer’s disease computer assisted diagnosis based on MRI data. Neurocomputing, 150, 37-42. https://doi.org/10.1016/j.neucom.2014.02.076.

      [14] Huang, W., 2016. A novel disease severity prediction scheme via big pair-wise ranking and learning techniques using image-based personal clinical data. Signal Processing, 124, 233-245. https://doi.org/10.1016/j.sigpro.2015.08.004.

      [15] Lin, W., Dou, W., Zhou, Z. and Liu, C., 2015. A cloud-based framework for Home-diagnosis service over big medical data. Journal of Systems and Software, 102, 192-206. https://doi.org/10.1016/j.jss.2014.05.068.

      [16] Costa, F.F., 2014. Big data in biomedicine. Drug discovery today, 19(4), 433-440. https://doi.org/10.1016/j.drudis.2013.10.012.

      [17] Nilashi, M., bin Ibrahim, O., Ahmadi, H. and Shahmoradi, L., 2017.An Analytical Method for Diseases Prediction Using Machine Learning Techniques. Computers & Chemical Engineering, 106, 212-223. https://doi.org/10.1016/j.compchemeng.2017.06.011.

      [18] Cottle, M., Hoover, W., Kanwal, S., Kohn, M., Strome, T. and Treister, N., 2013. Transforming Health Care through Big Data Strategies for leveraging big data in the health care industry. Institute for Health Technology Transformation, http://ihealthtran. com/big-data-in-healthcare.

      [19] J. Yang et al., 2015. Computers in Industry Emerging information technologies for enhanced healthcare. Computers in Industry, 69, 3–11. https://doi.org/10.1016/j.compind.2015.01.012.

      [20] L. Ericson, T. Hammar, N. Schönström, and G. Petersson, 2017. Stakeholder consensus on the purpose of clinical evaluation of electronic health records is required. Health Policy and Technology, 6(2), 152–160. https://doi.org/10.1016/j.hlpt.2017.02.005.

      [21] C. Muriana, T. Piazza, and G. Vizzini, 2016. An expert system for financial performance assessment of health care structures based on fuzzy sets and KPIs. Knowledge-Based Systems, 97, 1–10. https://doi.org/10.1016/j.knosys.2016.01.026.

      [22] Chaves, R., Ramírez, J., Gorriz, J.M. and Alzheimer’s Disease Neuroimaging Initiative, 2013. Integrating discretization and association rule-based classification for Alzheimer’s disease diagnosis. Expert Systems with Applications, 40(5), 1571-1578. https://doi.org/10.1016/j.eswa.2012.09.003.

      [23] He, R., Xiong, N., Yang, L.T. and Park, J.H., 2011. Using multi-modal semantic association rules to fuse keywords and visual features automatically for web image retrieval. Information Fusion, 12 (3), 223-230. https://doi.org/10.1016/j.inffus.2010.02.001.

      [24] Cheruku, R., Edla, D.R., Kuppili, V. and Dharavath, R., 2017. RST-BatMiner: A Fuzzy Rule Miner Integrating Rough Set Feature Selection and Bat Optimization for Detection of Diabetes Disease. Applied Soft Computing, In Press. https://doi.org/10.1016/j.asoc.2017.06.032.

      [25] Tong, T., Wolz, R., Gao, Q., Guerrero, R., Hajnal, J.V., Rueckert, D. and Alzheimer’s Disease Neuroimaging Initiative, 2014. Multiple instance learning for classification of dementia in brain MRI. Medical image analysis, 18(5), 808-818. https://doi.org/10.1016/j.media.2014.04.006.

      [26] Gagliardi, F., 2011.Instance-based classifiers applied to medical databases: diagnosis and knowledge extraction. Artificial intelligence in medicine, 52(3), 123-139. https://doi.org/10.1016/j.artmed.2011.04.002.

      [27] Arabasadi, Z., Alizadehsani, R., Roshanzamir, M., Moosaei, H. and Yarifard, A.A., 2017. Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm. Computer Methods and Programs in Biomedicine, 141, pp.19-26. https://doi.org/10.1016/j.cmpb.2017.01.004.

      [28] Dolatabadi, A.D., Khadem, S.E.Z. and Asl, B.M., 2017. Automated diagnosis of coronary artery disease (CAD) patients using optimized SVM. Computer methods and programs in biomedicine, 138, pp.117-126. https://doi.org/10.1016/j.cmpb.2016.10.011.

      [29] Sartakhti, J.S., Zangooei, M.H. and Mozafari, K., 2012. Hepatitis disease diagnosis using a novel hybrid method based on support vector machine and simulated annealing (SVM-SA). Computer methods and programs in biomedicine, 108(2), pp.570-579. https://doi.org/10.1016/j.cmpb.2011.08.003.

  • Downloads

  • How to Cite

    E.P., E., & R, S. (2017). Big data management with machine learning inscribed by domain knowledge for health care. International Journal of Engineering & Technology, 6(4), 98-102. https://doi.org/10.14419/ijet.v6i4.8214

    Received date: 2017-08-10

    Accepted date: 2017-09-11

    Published date: 2017-09-20