Events Tagging in Twitter Using Twitter Latent Dirichlet Allocation

Ghaidaa A. Al-Sultany; Hiba J. Aleqabie

doi:10.14419/ijet.v7i4.19.28065

Article Summary Abstract References Full Article How to cite

Authors
- Ghaidaa A. Al-Sultany
- Hiba J. Aleqabie
2018-11-27

https://doi.org/10.14419/ijet.v7i4.19.28065
Twitter, TLDA, PMI, and Perplexity.
Abstract

Twitter has become a great platform to publish and carrying news, advisements, events, topics and even daily events in our lives. Twitter Post has limitations on the length and noise. These limitations make that the post is unsuitable for topic modeling due to sparsity.Â Â In this paper, Twitter Latent Dirichlet allocation (TLDA) methodÂ for topics modeling was applied to overcome the sparsity problem of tweets modeling. Many steps were implemented for eventÂ tagging on Twitter. First: construct aÂ datasetÂ by hashtag pooling technique, and then theÂ preprocessingÂ was performed to extract the features.Â Secondly, find the suitable number of topics through Perplexity criterion, then,Â the topics are labeled by WordNet lexicon.Â Finally,Â events are tagging using Pricewise Mutual Information (PMI) criterion.Â The dataset is constructed about various topics including the American elections, Football world cup 2018, and a natural phenomenon and many others; the number of tweets is 63458. This study shows good results in training tweets dataset.
Â
References
1. [1] A. O. Steinskog, J. F. Therkelsen, and B. GambÃ¤ck, â€œTwitter Topic Modeling by Tweet Aggregation,â€ Proc. 21st Nord. Conf. Comput. Linguist., no. May, pp. 77â€“86, 2017.
  [2] H. Cai, Y. Yang, X. Li, and Z. Huang, â€œWhat are Popular : Exploring Twitter Features for Event Detection , Tracking and Visualization,â€ MM â€™15 Proc. 23rd ACM Int. Conf. Multimed., pp. 89â€“98, 2015.
  [3] X. Zhao, J. Jiang, and W. X. Zhao, â€œAn Empirical Comparison of Topics in Twitter and Traditional Media,â€ Singapore Manag. Univ. Sch. Inf. Syst. Tech. Pap. Ser., 2011.
  [4] R. Mehrotra, S. Sanner, W. Buntine, and L. Xie, â€œImproving LDA topic models for microblogs via tweet pooling and automatic labeling,â€ Proc. 36th Int. ACM SIGIR Conf. Res. Dev. Inf. Retr. - SIGIR â€™13, p. 889, 2013.
  [5] D. Alvarez-Melis and M. Saveski, â€œTopic Modeling in Twitter: Aggregating Tweets by Conversations,â€ $Icwsm16, no. Icwsm, pp. 519â€“522, 2016.
  [6] W. D. Penniman, Social Informatics, vol. 6430. 2010.
  [7] H. Kwak, C. Lee, H. Park, and S. Moon, â€œWhat is Twitter , a Social Network or a News Media?,â€ Int. World Wide Web Conf. Comm., pp. 1â€“10, 2010.
  [8] K. Sarkar and R. Law, â€œA Novel Approach to Document Classification using WordNet,â€ arXiv1510.02755 [cs], pp. 1â€“14, 2015.
  [9] G. Ifrim, B. Shi, and I. Brigadir, â€œEvent detection in Twitter using aggressive filtering and hierarchical tweet clustering,â€ CEUR Workshop Proc., vol. 1150, pp. 33â€“40, 2014.
  [10] L. Liu, L. Tang, W. Dong, S. Yao, and W. Zhou, â€œAn overview of topic modeling and its current applications in bioinformatics,â€ Springerplus, vol. 5, no. 1, 2016.
  [11] D. A. Ostrowski, â€œUsing latent dirichlet allocation for topic modelling in twitter,â€ Proc. 2015 IEEE 9th Int. Conf. Semant. Comput. IEEE ICSC 2015, pp. 493â€“497, 2015.
  [12] X. Wan and T. Wang, â€œAutomatic Labeling of Topic Models Using Text Summaries,â€ Proc. 54th Annu. Meet. Assoc. Comput. Linguist. (Volume 1 Long Pap., pp. 2297â€“2305, 2016.
  [13] C. C. MuÅŸat, Åž. TrÇŽuÅŸan-Matu, J. Velcin, and M.-A. Rizoiu, â€œAutomatic extraction of conceptual labels from topic models,â€ UPB Sci. Bull. Ser. C Electr. Eng., vol. 74, no. 2, pp. 57â€“68, 2012.
  [14] A. Huang, R. Lehavy, A. Zang, and R. Zheng, â€œAnalyst Information Discovery and Interpretation Roles: A Topic Modeling Approach,â€ Ssrn, 2014.
  [15] W. X. Zhao et al., â€œTopical keyphrase extraction from Twitter,â€ Proc. 49th Annu. Meet. Assoc. Comput. Linguist. Hum. Lang. Technol. 1, pp. 379â€“388, 2011.
Downloads
How to Cite
A. Al-Sultany, G., & J. Aleqabie, H. (2018). Events Tagging in Twitter Using Twitter Latent Dirichlet Allocation. International Journal of Engineering & Technology, 7(4.19), 884-888. https://doi.org/10.14419/ijet.v7i4.19.28065
ACM

ACS

APA

ABNT

Chicago

Harvard

IEEE

MLA

Turabian

Vancouver

Download Citation

Endnote/Zotero/Mendeley (RIS)

BibTeX
Received date: 2019-03-01

Accepted date: 2019-03-01

Published date: 2018-11-27

Events Tagging in Twitter Using Twitter Latent Dirichlet Allocation

Authors

Abstract

References

Downloads

How to Cite

Published