Silhouette index for determining optimal k-means clustering on images in different color models

  • Authors

    • Abd Rasid Mamat
    • Fatma Susilawati Mohamed
    • Mohamad Afendee Mohamed
    • Norkhairani Mohd Rawi
    • Mohd Isa Awang
    2018-04-06
    https://doi.org/10.14419/ijet.v7i2.14.11464
  • Cluster validation, Color model, Image filtering, K-means algorithm, Silhouette index.
  • Abstract

    Clustering process is an essential part of the image processing. Its aim to group the data according to having the same attributes or similarities of the images. Consequently, determining the number of the optimum clusters or the best (well-clustered) for the image in different color models is very crucial. This is because the cluster validation is fundamental in the process of clustering and it reflects the split between clusters. In this study, the k-means algorithm was used on three colors model: CIE Lab, RGB and HSV and the clustering process made up to k clusters. Next, the Silhouette Index (SI) is used to the cluster validation process, and this value is range between 0 to 1 and the greater value of SI illustrates the best of cluster separation. The results from several experiments show that the best cluster separation occurs when k=2 and the value of average SI is inversely proportional to the number of k cluster for all color model. The result shows in HSV color model the average SI decreased 14.11% from k = 2 to k = 8, 11.1% in HSV color model and 16.7% in CIE Lab color model. Comparisons are also made for the three color models and generally the best cluster separation is found within HSV, followed by the RGB and CIE Lab color models.

     

     

  • References

    1. [1] Arbelaitz O, Gurrutxaga I, Muguerza J, PéRez JM & Perona I (2013), An extensive comparative study of cluster validity indices. Pattern Recognition 46, 243–256.

      [2] MacQueen J (1967), Some methods for classification and analysis of multivariate observations. Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, 281–297.

      [3] Dubey SR, Dixit P, Singh N & Gupta JP (2013), Infected fruit part detection using K-means clustering segmentation technique. International Journal of Artificial Intelligence and Interactive Multimedia 2, 65–72.

      [4] Vendramin L, Campello RJ & Hruschka ER (2010), Relative clustering validity criteria: A comparative overview. Statistical Analysis and Data Mining: The ASA Data Science Journal 3, 209–235.

      [5] Küçükkülahli E, Erdoğmuş P & Polat K (2016), Brain MRI segmentation based on different clustering algorithms. International Journal of Computer Applications 155, 37–40.

      [6] Pal NR & Biswas J (1997), Cluster validation using graph theoretic concepts. Pattern Recognition 30, 847–857.

      [7] Sree PK & Babu IR (2013), Face detection from still and video images using unsupervised cellular automata with K means clustering algorithm. ICGST-GVIP Journal 8, 1–7.

      [8] Bevilacqua V, Filograno G & Mastronardi G (2008), Face detection by means of skin detection. Proceedings of the International Conference on Intelligent Computing, pp. 1210–1220.

      [9] Dhanachandra N, Manglem K & Chanu YJ (2015), Image segmentation using K-means clustering algorithm and subtractive clustering algorithm. Procedia Computer Science 54, 764–771.

      [10] Salem NM (2014), Segmentation of white blood cells from microscopic images using K-means clustering. Proceedings of the IEEE 31st National Radio Science Conference, pp. 371–376.

      [11] Liu Y, Li Z, Xiong H, Gao X & Wu J (2010), Understanding of internal clustering validation measures. Proceedings of the IEEE 10th International Conference on Data Mining, pp. 911–916.

      [12] Qiao H & Edwards B (2009), A data clustering tool with cluster validity indices. IEEE International Conference on Computing, Engineering and Information, pp. 303–309.

      [13] Manoharan S & Sathappan S (2013), A novel approach for content based image retrieval using hybrid filter techniques. Proceedings of the IEEE 8th International Conference on Computer Science and Education, pp. 518–524.

      [14] Kannan A, Mohan V & Anbazhagan N (2010), An effective method of image retrieval using image mining techniques. International journal of Multimedia and Its Applications 2, 17–26.

      [15] Szeliski R (2010), Computer vision: Algorithms and applications, Springer Science and Business Media.

      [16] Zhao H, Kim P & Park J (2009), Feature analysis based on Edge Extraction and Median Filtering for CBIR. Proceedings of the IEEE 11th International Conference on Computer Modelling and Simulation, pp. 245–249.

      [17] Arumugadevi S & Seenivasagam V (2016), Color image segmentation using feedforward neural networks with FCM. International Journal of Automation and Computing 13, 491–500.

      [18] Arumugadevi S & Seenivasagam V (2016), Color image segmentation using feedforward neural networks with FCM. International Journal of Automation and Computing 13, 491–500.

      [19] Maulik U & Bandyopadhyay S (2002), Performance evaluation of some clustering algorithms and validity indices. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 1650–1654.

      [20] Dubes RC & Jain AK (1988), Algorithms for clustering data, Prentice-Hall.

      [21] Berry MJ & Linoff G (1997), Data mining techniques: For marketing, sales, and customer support. John Wiley and Sons.

      [22] Rendón E, Abundez I, Arizmendi A & Quiroz EM (2011), Internal versus external cluster validation indexes. International Journal of Computers and Communications 5, 27–34.

      [23] Chaimontree S, Atkinson K & Coenen F (2010), Best clustering configuration metrics: Towards multiagent based clustering. Proceedings of the International Conference on Advanced Data Mining and Applications, pp. 48–59.

      [24] Burney SA & Tariq H (2014), K-means cluster analysis for image segmentation. International Journal of Computer Applications 96, 1–8.

      [25] Rousseeuw PJ (1987), Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20, 53–65.

  • Downloads

  • How to Cite

    Rasid Mamat, A., Susilawati Mohamed, F., Afendee Mohamed, M., Mohd Rawi, N., & Isa Awang, M. (2018). Silhouette index for determining optimal k-means clustering on images in different color models. International Journal of Engineering & Technology, 7(2.14), 105-109. https://doi.org/10.14419/ijet.v7i2.14.11464

    Received date: 2018-04-12

    Accepted date: 2018-04-12

    Published date: 2018-04-06