An Enhanced K-Means Clustering Algorithm for Pattern Discovery in Big Data Analysis of 3-Phase Electrical Quantities

  • Authors

    • Dikpride Despa
    • Gigih Forda Nama
  • Data Mining, Electrical Quantities, Rapidminer, CRISP-DM, K-Mean, Clustering, 3-Phase, Internet of Things (IoT), Big Data.
  • The Unila Internet of Things Research Group (UIRG) was developed online monitoring of power distribution system based on Internet of Things (IoT) technology on Department of Electrical Engineering University of Lampung (Unila), has been running for several months, this system monitored electrical quantities of 3-phase main distribution panel of H-building. The measurement system involve multiple sensors such current sensors and voltage sensors, the measurement data stored in to database server and shown the information in a real-time through a web-based application.

    Main objective of this research was to capture, analyze, and identified the knowledge pattern of electrical quantities data measurements, using Cross-Industry Standard Process for Data Mining (CRISP-DM) data mining framework, for helping the stake holders to continuous improvement of the quality of electricity services, the initial research limited to total 770847 electrical quantities recorded data that save on database system, since 1 September - 31 October 2018, the dataset consist of 21 attribute electrical quantities such as; voltage, current, power factor values, energy consumption, frequency, on H building 3-Phase main panel control.

    Rapidminer as leading application on knowledge discovery application was used to analyze the big data, K-Mean cluster algorithm implemented to identify the data pattern, the result indicated that 3-Phase load was unbalanced, and Phase-0 was the most utilized phase, based on from total 5 cluster analysis result.


  • References

    1. [1] P.Gregor. Gartner 2018 Magic Quadrant for Advanced Analytics Platforms: who gained and who lost.

      [2] Introduction To Rapidminer,, 2018.

      [3] W. A. Al-Dhuraibi, J. Ali, Using Classification Techniques to Predict Gold Price Movement, 4th International Conference on Computer and Technology Applications, 2018.

      [4] J. Estrada, L. Vea, Sitting Posture Recognition for Computer Users using Smartphones and a Web Camera, Proc. of the 2017 IEEE Region 10 Conference (TENCON), Malaysia, November 5-8, 2017.

      [5] M. A. Alhaj, A. Y. A. Maghari, Cancer Survivability Prediction using Random Forest and Rule Induction Algorithms, 8th International Conference on Information Technology (ICIT), 2017.

      [6] A. Geetha, G. M. Nasira, Data Mining for Meteorological Applications: Decision Trees for Modeling Rainfall Prediction, IEEE International Conference on Computational Intelligence and Computing Research, 2014.

      [7] Rianto, L. E. Nugroho, P. I. Santosa, Pattern Discovery of Indonesian Customers in an Online Shop: A Case of Fashion Online Shop, Proc. of 2016 3rd Int. Conf. on Information Tech., Computer, and Electrical Engineering (ICITACEE), Oct 19-21st, Semarang, Indonesia., 2016.

      [8] M. R. Mahmud, M. A. Mamun, M. A. Hossain, M. P. Uddin, Comparative Analysis of K-Means and Bisecting K-Means Algorithms for Brain Tumor Detection, International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2), 2018.

      [9] M. S. Rahim, T. Ahmed, An initial centroid selection method based on radial and angular coordinates for K-means algorithm, 20th International Conference of Computer and Information Technology (ICCIT), 2017.

      [10] M. A. Altuncu, B. Türkoğlu, M. A. Çavuşlu, S. SahIn, Implementation of K-means algorithm on FGGA, 26th Signal Processing and Communications Applications Conference (SIU), 2018.

      [11] P. Manivannan, P. Isakki Devi, Dengue fever prediction using K-means clustering algorithm, IEEE International Conference on Intelligent Techniques in Control, Optimization and Signal Processing (INCOS), 2017.

      [12] UC Business Analytics R Programming Guide - K-mean Algorithm., 2018.

      [13] P. Kalgotra, R. Sharda, Progression analysis of signals: Extending CRISP-DM to stream analytics, IEEE International Conference on Big Data (Big Data), 2016.

      [14] F. Chiheb, F. Boumahdi, H. Bouarfa, D. Boukraa, Predicting students performance using decision trees: Case of an Algerian University, International Conference on Mathematics and Information Technology (ICMIT), 2017

      [15] L. C. Chinchilla, K. A. R. Ferreira, Analysis of the behavior of customers in the social networks using data mining techniques, IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2016.

      [16] Z. Hou, Data Mining Method and Empirical Research for Extension Architecture Design, International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS), 2018.

      [17] C.Shearer. The CRISP-DM Model: The New Blueprint for Data Mining, Journal of Data Warehousing, Volume 5, Number 4, pag. 13-22, 2000.

      [18] D. Despa, G. F. Nama, M. A. Muhammad, K. Anwar, The Implementation Internet of Things (IoT) Technology in Real Time Monitoring of Electrical Quantities, The 2nd International Conference on Mathematics, Science, Education and Technology, 5–6 October, Padang, West Sumatera, Indonesia, 2017.

      [19] G. F. Nama, D. Despa, Mardiana, Real-time monitoring system of electrical quantities on ICT Centre building University of Lampung based on Embedded Single Board Computer BCM2835, International Conference on Informatics and Computing (ICIC), 2016.

      [20] G. F. Nama, K. Muludi, Implementation of Two-Factor Authentication (2FA) to Enhance the Security of Academic Information System, Journal of Engineering and Applied Sciences 13 (8), 2209-2220, 2018.

      [21] G. F. Nama, G. I. Suhada, A. Zaenudin, Smart System Monitoring of Gradient Soil Temperature at the Anak Krakatoa Volcano, Asian Journal of Information Technology 16 (2), 337-347, 2017.

      [22] D. Despa, F. X. A. Setyawan, G. F. Nama, J. Delano, Artificial Neural Network Applications Use Measurements Of Electrical Quantities To Estimate Electric Power, International Conference on Engineering, Technologies, and Applied Sciences (ICETsAS 2018), 2018.

  • Downloads

  • How to Cite

    Despa, D., & Forda Nama, G. (2018). An Enhanced K-Means Clustering Algorithm for Pattern Discovery in Big Data Analysis of 3-Phase Electrical Quantities. International Journal of Engineering & Technology, 7(4.44), 8-16.