RANER: RDI Framework for Arabic Named Entity Recognition

  • Authors

    • Amr M. Sayed
    • Sherif Abdou
    • Mohsen Rashwan
    • Hassanin Al-Barhamtoshy
    https://doi.org/10.14419/ijet.v8i1.11.28187
  • named entitiy recognition, condetional random fields, machine learning-based approach, natural language processing.
  • Named Entity Recognition (NER) task allows NLP applications to extract proper names from text. In addition, a significant component affects the performance of other NLP tasks such as text summarization, topic detection and key phrase extraction. Although many researches are conducted to enhance the NER task, only limited researches in Arabic named Entity Recognition have been performed. Researches in this field use either rule-based approach or machine learning approach. This paper introduces a solution for the NER problem using a machine learning approach, which combines Conditional Random Fields (CRF) classifier and predefined gazetteers. Our system uses syntactic features and morphological lookup-table features to train the classifier. This features extraction approach saves the time of morphological features that depend on analyzers without affecting the precision of the system. We evaluate our system by way of experiments using different sets of features to extract three of main named-entity types; Person, Location, and Organization. Experimental results showed that our approach has achieved better performance that rule-based approaches. Our system achieved a performance of 70.7%, 90.0%, and 79.2% in term of recall, precision, and F-measure, respectively..

     

     

  • References

    1. [1] N. Chinchor, "Overview of MUC-7," in Proceedings of the Seventh Message Understanding Conference (MUC-7), 1998.

      [2] S. Abuleil, "Extracting Names from Arabic Text for Question-Answering Systems," in Proceedings of Coupling approaches, coupling media and coupling languages for information retrieval (RIAO 2004), Avignon, France, 2004.

      [3] A. Alâ€Sughaiyer and I. A. Alâ€Kharashi, "Arabic morphological analysis techniques: A comprehensive survey," Journal of the American Society for Information Science and Technology, pp. 189-213, 2004.

      [4] S. AbdelRahman, M. Elarnaoty, M. Magdy and A. Fahmy, "Integrated machine learning techniques for Arabic named entity recognition," IJCSI, vol. 7, pp. 27-36, 2010.

      [5] O. Bender, F. J. Och and H. Ney, "Maximum Entropy Models For Named Entity Recognition," in Proc. of CoNLL-2003, Edmonton, Canada, 2003.

      [6] J. Maloney and M. Niv, "TAGARAB: A Fast, Accurate Arabic Name Recogniser Using High Precision Morphological Analysis," in Proceedings of the Workshop on Computational Approaches to Semitic Languages, Montreal, Canada, 1998.

      [7] S. Abuleil, "Extracting Names from Arabic Text for Question-Answering Systems," in Proceedings of Coupling approaches, coupling media and coupling languages for information retrieval (RIAO 2004), Avignon, France, 2004.

      [8] D. Samy, A. Moreno and J. Guirao, "A Proposal for an Arabic Named Entity Tagger Leveraging a Parallel Corpus," in International Conference RANLP, Borovets, Bulgaria.

      [9] Y. Benajiba, P. Rosso and J. M. Bened´ıruiz, "Anersys:

      [10] Anarabicnamedentity recognition system based on maximum entropy," in Computational Linguistics and Intelligent Text Processing, 2007.

      [11] Y. Benajiba and P. Rosso, "Arabic named entity recognition using conditional random fields," in Proc. of Workshop on HLT & NLP within the Arabic World, LREC, 2008.

      [12] Y. Benajiba, M. Diab and P. Rosso, "Arabic named entity recognition: An svmbased approach," in Proceedings of 2008 Arab International Conference on Information Technology (ACIT), 2008.

      [13] Y. Shao, C. Hardmeier and J. Nivre, "Multilingual Named Entity Recognition using Hybrid Neural Networks," in The Sixth Swedish Language Technology Conference (SLTC), 2016.

      [14] M. Gridach, "Character-Aware Neural Networks for Arabic Named Entity Recognition for Social Media," in WSSANLP 2016 , 2016.

      N. Limsopatham and N. Collier, "Bidirectional lstm for named entity recognition in twitter messages," in WNUT 20
  • Downloads

  • How to Cite

    M. Sayed, A., Abdou, S., Rashwan, M., & Al-Barhamtoshy, H. (2019). RANER: RDI Framework for Arabic Named Entity Recognition. International Journal of Engineering & Technology, 8(1.11), 161-164. https://doi.org/10.14419/ijet.v8i1.11.28187