a survey on sentiment study in twitter data using Hadoop streaming API

  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract

    Twitter is an online individual with singular correspondence webpage that conveys created live of knowledge which is handled, by semi-formed and disheveled information. In this work, a system that accomplishes demand of tweets analysis in Twitter-API is talked relating to. to revamp its ability, it is planned to finish the work on the java-Hadoop system, a typically got coursed managing organize utilizing the Map cut back parallel composition purpose of the scan. At long last, wide examinations area unit about to be driven on evident educational gatherings, with a necessity to accomplish in every implies that really matters indefinite or lots of obvious truth than the planned systems in composing. The focus is providing the positive negative and neutral analysis by opinion Mining.



  • Keywords

    Java-Hadoop; Map-decrease; Opinion Mining; Positive analysis; Twitter-API.

  • References

      [1] T. Wilson, J. Wiebe and P. Hoffmann, “Recognizing contextual polarity in phrase-level sentiment analysis,” in Proceedings of HLT and EMNLP. ACL, (2005), pp. 347–354

      [2] C. C. Tao, S. K. Kim, Y. A. Lin, Y. Y. Yu, G. Bradski, A. Y. Ng and Kunle Olukotun, “Map-reduce for machine learning on multicore”, In NIPS, vol. 6, (2006), pp. 281-288.

      [3] L. Jimmy, and A. Kolcz, “Large-scale machine learning at twitter”, In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, ACM, (2012), pp. 793-804.

      [4] B. Jiang, U. Topaloglu and F. Yu, “Towards large-scale twitter mining for drug-related adverse events”, In Proceedings of the 2012 international workshop on Smart health and wellbeing, ACM, (2012), pp. 25-32.

      [5] L. Bingwei, E. Blasch, Y. Chen, D. Shen and G. Chen, “Scalable Sentiment Classification for Big Data Analysis Using Naive Bayes Classifier”, In Big Data, 2013 IEEE International Conference on, IEEE, (2013), pp. 99-104.

      [6] Á. Cuesta, David F. and María D. R-Moreno, “A Framework for Massive Twitter Data Extraction and Analysis”, In Malaysian Journal of Computer Science, (2014), pp. 50-67.

      [7] S. Michal and A. Romanowski, “Sentiment analysis of Twitter data within big data distributed environment for stock prediction”, In Computer Science and Information Systems (FedCSIS), 2015 Federated Conference on, IEEE, (2015), pp. 1349-1354.

      [8] T. Mohit, I. Gohokar, J. Sable, D. Paratwar and R. Wajgi, “Multi-Class Tweet Categorization Using Map Reduce Paradigm”, In International Journal of Computer Trends and Technology. (2014), pp. 78-81.

      [9] D. Jeffrey and S. Ghemawat, “MapReduce: simplified data processing on large clusters”, Communications of the ACM 51.1, (2008), pp. 107-113.

      [10] B. Yingyi, “HaLoop: Efficient iterative data processing on large clusters”, Proceedings of the VLDB Endowment 3.1-2, (2010), pp. 285-296.

      [11] T. Maite, “Lexicon-based methods for sentiment analysis”, Computational linguistics 37.2, (2011), pp. 267-307.

      [12] R. Tushar and S. Srivastava, “Analyzing stock market movements using twitter sentiment analysis”, Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012). IEEE Computer Society, (2012).

      [13] D. Pessemier and Martens “MovieTweetings: A Movie Rating Dataset Collected From Twitter”, Ghent University, Ghent, Belgium, (2013).

      [14] Twitter. Twitter Search API, available at https://dev.twitter.com/rest/public/search.

      [15] V. D. Katkar, S. V. Kulkarni, “A Novel Parallel implementation of Naive Bayesian classifier for Big Data”, International Conference on Green Computing, Communication and Conservation of Energy, 978-1-4673-6126-2/2013 IEEE, pp. 847-852.

      [16] S. Kumar, F. Morstatter and H. Liu, “Twitter Data Analytics”, Springer Science & Business Media, (2013).

      [17] B. Vishal, “Data Mining in Dynamic Social Networks and Fuzzy Systems”, IGI Global, (2013).

      [18] G. Elmer, G. Langlois and J. Redden, “Compromised Data: From Social Media to Big Data”, Bloomsbury Publishing USA, (2015).

      [19] Nalini K. and L. J. Sheela, “Classification of Tweets Using Text Classifier to Detect Cyber Bullying”, In Emerging ICT for Bridging the Future-Proceedings of the 49th Annual Convention of the Computer Society of India CSI, Springer International Publishing, vol. 2, (2015), pp. 637-645.

      [20] Jaba S. L. and Dr V. Shanthi, “An Approach for Discretization and Feature Selection Of Continuous-Valued Attributes in Medical Images for Classification Learning”, International Journal of Computer Theory and Engineering, vol. 1, no. 2, pp. 154.

      [21] T. White, “Hadoop: The Definitive Guide”, Third Edition, O'Reilley, (2012).

      [22] L. George, “HBase: The Definitive Guide”, O'Reilley, (2011).

      [23] E. Hewitt, “Cassandra: The Definitive Guide”, O'Reilley, (2010).

      [24] A. Gates, “Programming Pig”, O'Reilley, (2011).




Article ID: 10789
DOI: 10.14419/ijet.v7i1.1.10789

Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.