Arabic Part-of-Speech Tagger, an Approach Based on Neural Network Modelling

  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract

    POS-tagging gained the interest of researchers in computational linguistics sciences in the recent years. Part-of-speech tagging systems assign the proper grammatical tag or morpho-syntactical category labels automatically to every word in the corpus per its appearance on the text. POS-tagging serves as a fundamental and preliminary step in linguistic analysis which can help in developing many natural language processing applications such as: word processing systems, spell checking systems, building dictionaries and in parsing systems. Arabic language gained the interest of researchers which led to increasing demand for Arabic natural language processing systems. Artificial neural networks has been applied in many applications such as speech recognition and part of speech prediction, but it is considered as a new approach in Part-of-speech tagging. In this research, we developed an Arabic POS-tagger using artificial neural network. A corpus of 20,620 words, which were manually assigned to the appropriate tags was developed and used to train the artificial neural network and to test the part of speech tagger systems’ overall performance. The accuracy of the developed tagger reaches 89.04% using the testing dataset. While, it reaches 98.94% using the training dataset. By combining the two datasets, the accuracy rate for the whole system is 96.96%.



  • Keywords

    Part of speech tagging; Arabic tagger; artificial neural networks

  • References

    1. [1] Khoja S. APT : Arabic Part-Of-speech Tagger. Proceedings of the Student Workshop at NAACL. 2001:20--5.

      [2] Jurafsky D, Martin JH. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Speech and Language Processing An Introduction to Natural Language Processing Computational Linguistics and Speech Recognition. 1999;21:0-934.

      [3] Larsen J. Introduction to Arti cial Neural Networks. 1999[November].

      [4] Kasabov NK. Foundations of neural networks, fuzzy systems, and knowledge engineering. Engineering. 1996:581-.

      [5] Alqrainy S. A morphological-syntactical analysis approach for Arabic textual tagging. 2008:1-2.

      [6] Abumalloh RA, Al-Sarhan HM, Abu-Ulbeh W. Building Arabic corpus applied to part-of-speech tagging. Indian Journal of Science and Technology. 2016;9[46].

      [7] Al-serhan HM, Montfort DE. Of Arabic Word Roots Extraction An Approach Based on. 2008.

      [8] Demuth H. Neural Networks. Mathworks inc. 2006;19[1]:1-7.

      [9] Al-Serhan HM, Muaidi H. Extraction of Arabic word roots: An Approach Based on Computational Model and Multi-Backpropagation Neural Networks. 2008.

      [10] Lai S, Serra M. Concrete strength prediction by means of neural network. Construction and Building Materials. 1997;11[2]:93-8.




Article ID: 14009
DOI: 10.14419/ijet.v7i2.29.14009

Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.