Efficient Document Clustering for Web Search Result

Sumathi Rani Manukonda; Asst.Prof Kmit; Narayanguda .; Hyderabad .; Nomula Divya; Asst. Prof. Cmrit; Medchal .; Hyderabad .

doi:10.14419/ijet.v7i3.3.14494

Article Summary Abstract References Full Article How to cite

Authors
- Sumathi Rani Manukonda
- Asst.Prof Kmit
- Narayanguda .
- Hyderabad .
- Nomula Divya
- Asst. Prof. Cmrit
- Medchal .
- Hyderabad .
2018-06-21

https://doi.org/10.14419/ijet.v7i3.3.14494
Document Clustering, Hierarchical Clustering, K-means, Spherical k-means, GFA, Distance for Euclidean, To measure the Cosine, Text data mining, Knowledge discovery in databases
Abstract

Clustering the document in data mining is one of the traditional approach in which the same documents that are more relevant are grouped together. Document clustering take part in achieving accuracy that retrieve information for systems that identifies the nearest neighbors of the document. Day to day the massive quantity of data is being generated and it is clustered. According to particular sequence to improve the cluster qualityeven though different clustering methods have been introduced, still many challenges exist for the improvement of document clustering. For web search purposea document in group is efficiently arranged for the result retrieval.The users accordingly search query in an organized way. Hierarchical clustering is attained by document clustering.To the greatest algorithms for groupingdo not concentrate on the semantic approach, hence resulting to the unsatisfactory output clustering. The involuntary approach of organizing documents of web like Google, Yahoo is often considered as a reference. A distinct method to identify the existing group of similar things in the previously organized documents and retrieves effective document classifier for new documents. In this paper the main concentration is on hierarchical clustering and k-means algorithms, hence prove that k-means and its variant are efficient than hierarchical clustering along with this by implementing greedy fast k-means algorithm (GFA) for cluster document in efficient way is considered.
Â
Â
References
1. [1] Chan, L.M (1994) Cataloging and Classification: An Introduction. McGraw Hill, New York.
  [2] JochenDorrePeterGerstland (1999)Text mining finding nuggets in mountains of textual data in knowledge discovery and data mining.
  [3] Dan Pelleg and Andrew Moore (2000).: X-means: Extending k-means with efficient estimation of the number of clusters. In Proceedings of the Seventeenth International Conference on Machine Learning, Palo Alto, C.
  [4] AristidisLikas, Nikos Vlassis and Jacob J. Verbeek (2003): The global k-means clustering algorithm. In Pattern Recognition Vol 36, No 2.
  [5] R Kannan, S. Vempala, and Adrian Vetta (2000), â€œOn Clusterings: Good, Bad, and Spectralâ€, Proc. of the 41st Foundations of Computer Science, Redondo Beach.
  [6] Michael Steinbach Vipin Kumar (2003) â€•finding clusters of different sizes, shapes, and densities in noisy high dimensional dataâ€•. IEEEâ€˜university of Minnesota, MN, USA.
  [7] Padmini Srinivasan (2005) â€•The search for Novelity in text.
  [8] Anoop Jain, Aruna Bajpai, Manish Kumar Rohila (2012) Efficient Clustering Technique for Information Retrieval in Data Miningâ€–, Department of Computer Applications, Samrat Ashok Technological Institute, Vidisha (M.P.) India.
  [9] J. MatouÅ¡ek (2000): On the approximate geometric k-clustering. Discrete and Computational Geometry. 24:61-84.
  [10] Anand V. Saurkar, Vaibhav Bhujade, Priti Bhagat Amit Kasarda (2014) â€• Various Data Mining Techniquesâ€– Department of Computer Science& Engineering, Department of Information Technology,DMIETR, Sawangi(M), Wardh,Maharashtra, India.
  [11] Shi Zhong (2008) â€•Efficient online spherical K-means clusteringâ€–, Deptt of computer science & Engineering, Florida Atlantivc University, IEEE, USA.
  [12] R Kannan, S. Vempala, and Adrian Vetta (2000), â€œOn Clusterings: Good, Bad, and Spectralâ€, Proc. of the 41st Foundations of Computer Science, Redondo Beach.
Downloads
How to Cite
Rani Manukonda, S., Kmit, A., ., N., ., H., Divya, N., Cmrit, A. P., ., M., & ., H. (2018). Efficient Document Clustering for Web Search Result. International Journal of Engineering & Technology, 7(3.3), 90-92. https://doi.org/10.14419/ijet.v7i3.3.14494
ACM

ACS

APA

ABNT

Chicago

Harvard

IEEE

MLA

Turabian

Vancouver

Download Citation

Endnote/Zotero/Mendeley (RIS)

BibTeX
Received date: 2018-06-21

Accepted date: 2018-06-21

Published date: 2018-06-21

Efficient Document Clustering for Web Search Result

Authors

Abstract

References

Downloads

How to Cite

Published