Multinode Cluster Analysis for Prediction of Heart Disease Using Biglm Algorithm

  • Authors

    • J Leelavathy
    • Dr S.Selva brundha
    2018-04-18
    https://doi.org/10.14419/ijet.v7i2.20.16721
  • Biglm, Linear Regression, , Spark, R
  • Abstract

    The manuscript proposes a novel approach to uncover pattern from a huge chunk of data with minimal time consumption. The goal of the manuscript is to build a robust predictive model to handle big data in patient data. The manuscript follows a master-slave  approach. The training data is divided into n number of chunks and processed. The data involves 32,96,168 row vectors and each row vector holds 14 attributes. The model is given a row vector with 14 attributes whether the system could predict the target class by linear modeling. From the experimentation done in single master and four slave setup, we infer that the system could fit a linear model in 1800 seconds. A comparative study has been made on training the huge chunk of data in a standalone device and in master-slave devices. From the experimental analysis, we infer that the training time of the proposed approach has been reduced by more than half since the proposed approach divides the huge chunk of data into independent chunks and computation is done at multiple nodes.


  • References

    1. [1] Blackwell, M. and Sen, “Large Datasets and You: A Field Guide. Political Methodologistâ€, Vol no20.1, p.3, 2012

      [2] Enea, “Fitting linear models and generalized linear models with large data sets in R. Statistical Methods for the Analysis of Large Datasets: book of short papersâ€, pp.411-414,2019

      [3] Yanish Pradhananga, Shridevi C. Karande “Memory Management in Bigdataâ€, International Journal of Science and Research, Vol 3, p.no 1183-1187, 2014

      [4] Assunção, M. D., Calheiros, R. N., Bianchi, S., Netto, M.A.S. , Buyya, R., “Big Data computing and clouds: Trends and future directionsâ€, Journal of Parallel and Distributed Computing, Volumes 79–80, May, Pages 3-15,2015 Bakshi, K., “Considerations for big data: Architecture and approachâ€.,proceedings of the IEEE Aerospace Conference, Big Sky Montana. pp. 1-7,2012

      [5] Beck, J., Mostow, J.†How who should practice: Using learning decomposition to evaluate the efficacy of different types of practice for different types of students†In: Proceedings of the 9th International Conference on Intelligent Tutoring Systems. pp. 353-362,2012

      [6] Benjamins, V. R., “Big data: From hype to reality? “, Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics (WIMS14). ACM, New York, NY, USA,pp. 1-5,2014.

      [7] Bhadani, A., and Chaudhary, S., “Performance evaluation of web servers using central load balancing policy over virtual machines on cloudâ€, Proceedings of the Third Annual ACM Conference, pp.1-10,2013.

      [8] Garlasu, D., Sandulescu, V., Halcu, I., Neculoiu, G., Grigoriu, O., Marinescu, M., Marinescu, V.,â€A big data implementation based on grid computingâ€,proceedings of the 11th Roedunet International Conference (RoEduNet), Sinaia. pp. 1-4,2014.Jacobs, The pathologies of big data, Communications of The ACM,52(8),,pp.36-44,2009

      [9] H. Zhu, Z. Xu and Y. Huang, “Research on the security technology of big data informationâ€, International Conference on Information Technology and Management Innovation, pp.1041-1044.,2015.

      [10] Merelli, H. Perez-sanchez, S. Gesing and D. D.Agostino, “Managing, analysing, and integrating big data in medical bioinformatics: open problems and future perspectivesâ€, BioMed Research International, pp.1-13,2014.

      [11] N. Mishra, C. Lin and H. Chang, “A cognitive adopted framework for iot big data management and knowledge discovery prospectiveâ€, International Journal of Distributed Sensor Networks, pp.1-13,2015.

      [12] X. Y.Chen and Z. G.Jin, “Research on key technology and applications for internet of thingsâ€, Physics Procedia, pp. 561-566.2012,

      [13] M. D. Assuno, R. N. Calheiros, S. Bianchi, M. a. S. Netto and R. Buyya, “Big data computing and clouds: Trends and future directionsâ€, Journal of Parallel and Distributed Computing, pp.3-15.,2015.

      [15] A. T. Hashem, I. Yaqoob, N. Badrul Anuar, S. Mokhtar, A. Gani and S. Ullah Khan, “The rise of big data on cloud computing: Review and open research issues, Information Systemsâ€, pp. 98-115.,2014.

  • Downloads

  • How to Cite

    Leelavathy, J., & S.Selva brundha, D. (2018). Multinode Cluster Analysis for Prediction of Heart Disease Using Biglm Algorithm. International Journal of Engineering & Technology, 7(2.20), 298-301. https://doi.org/10.14419/ijet.v7i2.20.16721

    Received date: 2018-08-03

    Accepted date: 2018-08-03

    Published date: 2018-04-18