Analysis of large volume data processing using  clustering algorithms

Sarada. B; Vinayaka Murthy. M; Udaya Rani. V

doi:10.14419/ijet.v7i4.5.25058

Article Summary Abstract References Full Article How to cite

Authors
- Sarada. B
- Vinayaka Murthy. M
- Udaya Rani. V
2018-09-22

https://doi.org/10.14419/ijet.v7i4.5.25058
Big Data, Canopy Clustering, Hadoop, K-Mean Clustering, Data Processing Techniques, Mapreduce.
Abstract

The study of large dataset with velocity, variety and volume which is also known as Big data. When the dataset has limited number of clusters, low dimensions and small number of data points the existing traditional clustering algorithms can be used.. As we know this is the internet age, the data is growing very fast and existing clustering algorithms are not giving the acceptable results in terms of time complexity and spatial complexity. So there is a need to develop a new approach of applying clustering of large volume of data processing with low time and spatial complexity through MapReduce and Hadoop frame work applying to different clustering algorithms, k-means, Canopy clustering and proposed algorithm .The analysis shows that the large volume of data processing will take low time and spatial complexity when compared to small volume of data.
Â
Â

Â
References
1. [1] Ambika.s and Kavitha.G,â€ Overcoming the Defects of K-means clustering by using Canopy Clustering Algorithm IJSRD |Vol. 4, Issue 05, 2016 | ISSN (online): 2321-0613.
  [2] D. Napoleon & P. Ganga lakshmi â€œAn Efficient K-Means Clustering Algorithm for Reducing Time Complexity Using Uniform Distribution Data Pointsâ€ IEEE, 2010, pp, 42-45.
  [3] Dweepna Garg 1, Khushboo Trivedi 2, B.B.Panchal ,â€ A Comparative study of Clustering Algorithms using MapReduce 2321-0613 in Hadoopâ€ IJSRDt| Vol. 4, Issue 05,2016 | ISSN (online):
  [4] M. S. Chen, J. Han, and P. S. Yu. IEEE Trans Knowledge and Data Engineering Data mining. An overview from a database perspective, 8:866-883, 1996.
  [5] Ayman E. Kheer, Ahmed I. El Seddawy, Amira M. Idrees,â€ Performance Tuning of K-Mean Clustering Algorithm a Step towards Efficient DSSâ€, IJIRCST,ISSN: 2347-5552, Volume 2, Issue 6, November â€“ 2014.
  [6] A. Hunter and S. Parsons, "A review of uncertainty handling formalisms", Applications of Uncertainty Formalisms LNAI 1455, pp.8-37. Springer â€“ Verlag, 1998.
  [7] H.R. Shashidhar, G.T. Raju and M Vinayaka Murthy,â€œEfficient Estimation of Result Selectivity for Web Query Optimi zationâ€, International Journal of Pure and Applied Mathematics, Volume 17 No. 7 2017, PP 193-205, ISSN:311-8080.
  [8] H.R. Shashidhar, G.T. Raju and M Vinayaka Murthy, â€œEffective Cost Models for Web Query Optimizationâ€, International Journal of Pure and.
  [9] Applied Mathematics, Volume 117 No. 20, 17, PP 727-739, ISSN: 1311-8080.
  [10] M Vinayaka Murthy â€œSurvey On Web Query Optimization Trends and Future Researchâ€, International Conference On Advanced Material Technology 2016, Issue â€“V, pp 409 â€“ 417, Elsevier Materials Today: Proceedings.
  [11] M Vinayaka Murthy, â€œA Comparative Study on Mining the & Healthy Food Preferences of Women Clustersâ€, Journal of Scientific Engineering Research, Vol 6, Issue 7, pp 126 131, 2017, ISSN: 2229-5518.
Downloads
How to Cite
B, S., Murthy. M, V., & Rani. V, U. (2018). Analysis of large volume data processing using clustering algorithms. International Journal of Engineering & Technology, 7(4.5), 685-688. https://doi.org/10.14419/ijet.v7i4.5.25058
ACM

ACS

APA

ABNT

Chicago

Harvard

IEEE

MLA

Turabian

Vancouver

Download Citation

Endnote/Zotero/Mendeley (RIS)

BibTeX
Received date: 2018-12-30

Accepted date: 2018-12-30

Published date: 2018-09-22

Analysis of large volume data processing using clustering algorithms

Authors

Abstract

References

Downloads

How to Cite

Published