An open Architecture for enhancing performance of complex OCR applications

  • Authors

    • K. Jambi
    • H. Al-Barhamtoshy
    • A. Fattouh
    • A. Al-Ghamdi
    • F. Eassa
    • M. Khemakhem
    https://doi.org/10.14419/ijet.v8i1.11.28188
  • Open Architecture, Complex OCR applications, performance.
  • Abstract

    Taking advantages of the different existing computing environments, infrastructures and resources for running in the optimal and/or in a customized manner any given complex Optical Character Recognition (OCR) software or application constitutes nowadays a challenge. Indeed, we mean by a complex OCR software or application any document digitization and computerization process which includes either plenty heterogeneous documents to process as input and/or several strong and sometimes complex OCR techniques to use in order to achieve good accuracy (recognition) rates. Moreover, the diversity and the very high computing and storage powers provided by such computing environments and infrastructures in one hand and the lack of powerful software and tools allowing their optimal or good utilization in the other hand make the problem a challenge.

     

    Consequently, this paper proposes a novel open architecture, which attempts to use properly such environments and infrastructures in order to run at least in a pseudo optimal and/or a customized manner any given complex OCR software or application. Actually, the two most important features, which make the proposed architecture original, are first, its complete independency from the different existing distributed infrastructures that can run any given complex OCR application. Second, its flexibility, which allows any new distributed infrastructure to be considered during the scheduling process of any given complex OCR application since the scheduler, is able to detect automatically and consider any added infrastructure. Our architecture presents several advantages, indeed, it improves drastically the performances of any given complex OCR application, it is platform and software independent in addition to its flexibility as described and explained later.

     

     

     

  • References

    1. [1] H. El Abed, L. Wenyin and V. Margner, International Conference on Document Analysis and Recognition (ICDAR 2011) Competitions Overview, International Conference on Document Analysis and Recognition, 2011.

      [2] I. Abdelaziz, S. Abdou, and H. Al-Barhamtoshy, A large vocabulary system for Arabic online handwriting recognition, Pattern Analysis & Applications, Springer, Dec. 2015, DOI 10.1007/s10044-015-0526-7. http://link.springer.com/article/10.1007%2Fs10044-015-0526-7#page-1

      [3] A. Hesham, S. Abdou, A. Badr, M. Rashwan, H. Al-Barhamtoshy, A Zone Classification Approach for Arabic Documents using Hybrid Features, (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 7, No. 7, 2016, pp. 158-162, https://www.researchgate.net/profile/Sherif_Abdou/publication/30582077 3_A_Zone_Classification_Approach_for_Arabic_Documents_using_Hyb rid_Features/links/57b1c50d08ae15c76cbb2e8b.pdf

      [4] S. Eskenazi, P. Gomez-Krämer, J. Ogier, A comprehensive survey of mostly textual document segmentation algorithms since 2008. Pattern Recognition 64 (2017) 1–14.

      [5] D. Petcu, S. Panica, D. Banciu, V. Negru, A. Eckstein, Optical Character Recognition on a Grid Infrastructure», 3rd International Conference on Automated Production of Cross Media Content for Multi-channel Distribution, IEEE, Nov, 2007.

      [6] M. Khemakhem, A. Belghith: Agent based architecture for Parallel and Distributed Complex Information Processing, the International Revue on computers and software (IRECOS), Vol.2, No.1, p. 38-44, January 2007.

      [7] M. Khemakhem, A. Belghith and M. Labidi: The DTW data distribution over a grid computing architecture, International Journal of Computer Sciences and Engineering Systems (IJCSES), Vol.1, No. 4, p. 241-247, December 2007.

      [8] M. Labidi, M. Khemakhem and M. Jemni, Grid’5000 Based Large Scale OCR Using the DTW Algorithm: Case of the Arabic Cursive Writing, in the book, Recent Advances in Document Recognition and Understanding, ISBN 978-953-307-320-0 InTech, Rijeka, Croatia, October, 2011.

      [9] Z. Trifa, M. Labidi and M. Khemakhem: Arabic Cursive Characters Distributed Recognition using the DTW Algorithm on BOINC: Performance Analysis, the International Journal of Advanced Computer Science and Applications, Vol. 2 No. 3, March 2011.

      [10] H. Hamdi and M. Khemakhem: Distributing Arabic Handwriting Recognition system based on the combination of Grid Meta-Scheduling and Peer-to-Peer Technologies (Omnivore), the Universal Journal of Computer Science and Engineering Technology, 1(1), 31 – 35, Oct, 2010.

      [11] M. Khemakhem, A. Belghith: Towards a distributed Arabic OCR based on the DTW algorithm, the International Arab Journal of Information Technology (IAJIT), Vol. 6, No. 2, p. 153-161, April 2009.

      [12] M. Khemakhem, A. Belghith: A P2p Grid Architecture for Distributed Arabic OCR Based On the DTW Algorithm », The International Journal of Computers and Applications (ACTA PRESS, IJCA), Vol. 31, N°. 1, 2009.

      [13] H. Hamdi, Kay Dornemann and M. Khemakhem: Advanced Distributed Architecture for a Complex and Large Scale Arabic Handwriting Recognition Framework, Accepted paper in the International Journal of High Performance Computing and Networking, IJHPCN, 2016.

      [14] H. Hamdi, M. Khemakhem and Aisha Zaidan: Complementary Approaches built as Web Services for Arabic Handwriting OCR Systems via Amazon Elastic MapReduce (EMR) Model, Accepted paper in the International Arab journal of Information Technology IAJIT 2016.

      [15] H. Hamdi and M. Khemakhem: A Secured Distributed OCR System in a Pervasive Environment with Authentication as a Service in the Cloud, in Proc of the (IEEE) International Conference on Multimedia Computing and Systems, April 14-16, 2014, Marrakesh, Morocco.

      [16] H. Hamdi and M. Khemakhem: Arabic Islamic Manuscripts Digitization based on Hybrid K-NN/ SVM Approach and Cloud Computing Technologies, Accepted paper in Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences (NOORIC2013), December 2013, Al-Madinah, Saudi Arabia.

      [17] H. Al-Barhamtoshy, M. Khemakhem, F. Eassa, A. Fattouh, A. Al-Ghamdi, K. Jambi, Universal Metadata Repository for Document Analysis and Recognition, The 13th ACS/IEEE International Conference on Computer Systems and Applications, AICCSA 2016.

  • Downloads

  • How to Cite

    Jambi, K., Al-Barhamtoshy, H., Fattouh, A., Al-Ghamdi, A., Eassa, F., & Khemakhem, M. (2019). An open Architecture for enhancing performance of complex OCR applications. International Journal of Engineering & Technology, 8(1.11), 154-157. https://doi.org/10.14419/ijet.v8i1.11.28188

    Received date: 2019-03-03

    Accepted date: 2019-03-03