Hybrid optimization for feature selection in opinion mining

  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract

    A sub-discipline of Information Retrieval (IR) is opinion mining and the lexicon of computers is not concerned of the subject of thedocument, but about the opinion expressed.It has caused a large impact in the arena of academics and industry as it has a wide area of research and the applications are widespread.Feature selection is a vital step in opinion mining, as its individual feature decides the opinions expressed by the customers.Feature selection reduces the dimensionality of data by avoiding non-relevant features; it can be con-sidered as a necessary and excellent process for data mining applications.In this study, feature subset is optimized through Particle Swarm Optimization (PSO) algorithm, Cuckoo Search (CS) algorithm and hybridized PSO-CS algorithm.Classification is done through Naïve bayes and K-Nearest Neighbours (KNN) classifiers.Feature extraction has its basis on Term Frequency-Inverse DocumentFrequency (TF-IDF).The accuracy of classification precision is increased by the reduction in size of feature subset and computational com-plexity.

  • Keywords

    Opinion Mining; Feature Selection; Particle Swarm Optimization (PSO); Cuckoo Search (CS); Term Frequency-Inverse Document Frequency (TF-IDF); Naïve Bayes and K-Nearest Neighbours (KNN).

  • References

      [1] Jandail, R. R. S. (2014). A proposed Novel Approach for Sentiment Analysis and Opinion Mining. International Journal of UbiComp, 5(1/2), 1.https://doi.org/10.5121/iju.2014.5201.

      [2] Sharma, R., Nigam, S., & Jain, R. (2014). Opinion mining of movie reviews at document level. arXiv preprint arXiv:1408.3829.

      [3] Jeong, H., Shin, D., & Choi, J. (2011). Ferom: Feature extraction and refinement for opinion mining. Etri Journal, 33(5), 720-730.https://doi.org/10.4218/etrij.11.0110.0627.

      [4] Chandrashekar, G., &Sahin, F. (2014). A survey on feature selection methods. Computers & Electrical Engineering, 40(1), 16-28.https://doi.org/10.1016/j.compeleceng.2013.11.024.

      [5] Samsudin, N., Puteh, M., Hamdan, A. R., &Nazri, M. Z. A. (2013). Immune based feature selection for opinion mining. In Proceedings of the World Congress on Engineering (Vol. 3, pp. 3-5).

      [6] Sumathi, T., Karthik, S., &Marikkannan, M. (2014). Artificial Bee Colony Optimization for Feature Selection in Opinion Mining. Journal of Theoretical & Applied Information Technology, 66(1).

      [7] Wahyudi, M., &Kristiyanti, D. A. (2016). Sentiment Analysis of Smartphone Product Review Using Support Vector Machine Algorithm-Based Particle Swarm Optimization. Journal of Theoretical & Applied Information Technology, 91(1).

      [8] Isabella, J., & Suresh, R. (2012). Analysis and evaluation of Feature selectors in opinion mining. Indian Journal of Computer Science and Engineering (IJCSE), 3(6), pp. 757-762.

      [9] Behera, R. N., Manan, R., & Dash, S. (2016). Ensemble based Hybrid Machine Learning Approach for Sentiment Classification-A Review. International Journal of Computer Applications, 146(6).

      [10] Hai, Z., Chang, K., Kim, J. J., & Yang, C. C. (2014). Identifying features in opinion mining via intrinsic and extrinsic domain relevance. IEEE Transactions on Knowledge and Data Engineering, 26(3), 623-634.https://doi.org/10.1109/TKDE.2013.26.

      [11] Quan, C., & Ren, F. (2014). Unsupervised product feature extraction for feature-oriented opinion determination. Information Sciences, 272, 16-28.https://doi.org/10.1016/j.ins.2014.02.063.

      [12] Saraswathi, K., &Tamilarasi, A. (2016). Ant Colony Optimization Based Feature Selection for Opinion Mining Classification. Journal of Medical Imaging and Health Informatics, 6(7), 1594-1599.https://doi.org/10.1166/jmihi.2016.1856.

      [13] Keshavarz, H., &Abadeh, M. S. (2017). ALGA: Adaptive lexicon learning using genetic algorithm for sentiment analysis of microblogs. Knowledge-Based Systems, 122, 1-16.https://doi.org/10.1016/j.knosys.2017.01.028.

      [14] Shahid, R., Javed, S. T., & Zafar, K. (2017, April). Feature selection based classification of sentiment analysis using Biogeography optimization algorithm. In Innovations in Electrical Engineering and Computational Technologies (ICIEECT), 2017 International Conference on (pp. 1-5). IEEE.https://doi.org/10.1109/ICIEECT.2017.7916549.

      [15] Kumar, A., &Khorwal, R. (2017). Firefly Algorithm for Feature Selection in Sentiment Analysis. In Computational Intelligence in Data Mining (pp. 693-703). Springer, Singapore.https://doi.org/10.1007/978-981-10-3874-7_66.

      [16] Souza, E., Oliveira, A. L., Oliveira, G., Silva, A., & Santos, D. (2016, October). An Unsupervised Particle Swarm Optimization Approach for Opinion Clustering. In Intelligent Systems (BRACIS), 2016 5th Brazilian Conference on (pp. 307-312). IEEE.https://doi.org/10.1109/BRACIS.2016.063.

      [17] Shang, L., Zhou, Z., & Liu, X. (2016). Particle swarm optimization-based feature selection in sentiment classification. Soft Computing, 20(10), 3821-3834.https://doi.org/10.1007/s00500-016-2093-2.

      [18] He, R., &McAuley, J. (2016, April). Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In Proceedings of the 25th International Conference on World Wide Web (pp. 507-517). International World Wide Web Conferences Steering Committee.https://doi.org/10.1145/2872427.2883037.

      [19] Vinodhini, G., & Chandrasekaran, R. M. (2012). Sentiment analysis and opinion mining: a survey. International Journal, 2(6), 282-292.

      [20] Kumar, J. A., &Abirami, S. (2015). An Experimental Study Of Feature Extraction Techniques In Opinion Mining. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), 4(1).

      [21] Basari, A. S. H., Hussin, B., Ananta, I. G. P., &Zeniarja, J. (2013). Opinion mining of movie review using hybrid method of support vector machine and particle swarm optimization. Procedia Engineering, 53, 453-462.https://doi.org/10.1016/j.proeng.2013.02.059.

      [22] Patil, P. K., &Adhiya, K. P. (2015). Automatic Sentiment Analysis of Twitter Messages Using Lexicon Based Approach and Naive Bayes Classifier with Interpretation of Sentiment Variation. International Journal of Innovative Research in Science Engineering and Technology, 4(9).

      [23] Li, X., Li, J., & Wu, Y. (2015). A global optimization approach to multi-polarity sentiment analysis. PloS one, 10(4), e0124672.https://doi.org/10.1371/journal.pone.0124672.

      [24] Adnan, M. A., &Razzaque, M. A. (2013, March). A comparative study of particle swarm optimization and Cuckoo search techniques through problem-specific distance function. In Information and Communication Technology (ICoICT), 2013 International Conference of (pp. 88-92). IEEE.https://doi.org/10.1109/ICoICT.2013.6574619.

      [25] Gandomi, A. H., Yang, X. S., &Alavi, A. H. (2013). Cuckoo search algorithm: a metaheuristic approach to solve structural optimization problems. Engineering with computers, 29(1), 17-35.https://doi.org/10.1007/s00366-011-0241-y.

      [26] Pandey, A. C., Rajpoot, D. S., &Saraswat, M. (2017). Twitter sentiment analysis using hybrid cuckoo search method. Information Processing & Management, 53(4), 764-779.https://doi.org/10.1016/j.ipm.2017.02.004.

      [27] Ghodrati, A., &Lotfi, S. (2012). A hybrid CS/PSO algorithm for global optimization. Intelligent Information and Database Sstems, 89-98.https://doi.org/10.1007/978-3-642-28493-9_11.

      [28] Guo, J., Sun, Z., Tang, H., Jia, X., Wang, S., Yan, X., & Wu, G. (2016). Hybrid Optimization Algorithm of Particle Swarm Optimization and Cuckoo Search for Preventive Maintenance Period Optimization. Discrete Dynamics in Nature and Society, 2016.https://doi.org/10.1155/2016/1516271.

      [29] Dey, L., Chakraborty, S., Biswas, A., Bose, B., & Tiwari, S. (2016). Sentiment Analysis of Review Datasets Using Naive Bayes and K-NN Classifier. arXiv preprint arXiv:1610.09982.

      Alsaffar, A., & Omar, N. (2015). Integrating a Lexicon based approach and K nearest neighbour for Malay sentiment analysis. Journal of Computer Science, 11(4), 639.https://doi.org/10.3844/jcssp.2015.639.644




Article ID: 9668
DOI: 10.14419/ijet.v7i1.3.9668

Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.