Skew detection based on vertical projection in latin character recognition of text document image
Keywords:Optical Character Recognition, Preprocessing, Skew Detection, Projection Profile, Vertical Projection.
The accuracy of Optical Character Recognition is deeply affected by the skew of the image. Skew detection & correction is one of the steps in OCR preprocessing to detect and correct the skew of document image. This research measures the effect of Combined Vertical Projection skew detection method to the accuracy of OCR. Accuracy of OCR is measured in Character Error Rate, Word Error Rate, and Word Error Rate (Order Independent). This research also measures the computational time needed in Combined Vertical Projection with different iteration. The experiment of Combined Vertical Projection is conducted by using iteration 0.5, 1, and 2 with rotation angle within -10 until 10 degrees. The experiment results show that the use of Combined Vertical Projection could lower the Character Error Rate, Word Error Rate, and Word Error Rate (Order Independent) up to 35.53, 34.51, and 32.74 percent, respectively. Using higher iteration value could lower the computational time but also decrease the accuracy of OCR.
 Chandarana J & Kapadia MR, â€œOptical character recognitionâ€, International Journal of Emerging Technology and Advanced Engineering, Vol. 4, No. 5, (2014), pp. 219-223.
 Minoru M, Character Recognition, IntechOpen, (2010).
 Berchmans D & Kumar SS, â€œOptical character recognition: an overview and an insightâ€, Proceedings of International Control, Instrumentation, Communication and Computational Technologies (ICCICCT), (2014), pp: 1361-1365.
 Papandreou A & Gatos B, â€œA novel Skew Detection technique based on Vertical Projectionsâ€, Proceedings of International Document Analysis and Recognition (ICDAR), (2011), pp: 384-388.
 Postl W, â€œDetection of linear oblique structures and skew scan in digitized documentsâ€, Proceedings of International Conference on Pattern Recognition, (1986), pp: 687-689.
 Chauduri BB & Pal U, â€œAn improved document skew angle estimation techniqueâ€, Journal of Pattern Recognition Letters, Vol. 17, No. 8, (1996), pp. 899-904.
 Kant AJ & Vyavahare AJ, â€œDevanagari OCR using projection profile segmentation methodâ€, International Research Journal of Engineering and Technology, Vol. 3, No. 7, (2016), pp. 132-134.
 Carrasco RC, â€œAn open-source OCR evaluation toolâ€, Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage, (2014), pp: 179-184.
 Smith R, et.al., â€œTesseract Open Source OCR Engineâ€, (2017), available online: https://github.com/tesseract-ocr/tesseract
 Vijayarani S & Sakila A, â€œPerformance comparison of OCR Toolsâ€, International Journal of UbiComp (IJU), Vol. 6, No. 3, (2015), pp. 19-30.
 Al-Khatatneh A, Pitchay SA, & Al-qudah M, â€œA Review of Skew Detection Techniques for Documentâ€, Proceedings of International Conference on Modelling and Simulation (UKSim), (2015), pp: 316-321.
 Jain B & Borah M, â€œA survey paper on skew detection of offline handwritten character recognition systemâ€, International Journal of Computer Engineering and Applications, Vol. 6, No. 1, (2014).
 Poovizhi P, â€œA study on preprocessing techniques for the character recognitionâ€, International Journal of Open Information Technologies, Vol. 2, No. 12, (2014), pp. 21-24.
View Full Article:
How to Cite
LicenseAuthors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under aÂ Creative Commons Attribution Licensethat allows others to share the work with an acknowledgement of the work''s authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal''s published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (SeeÂ The Effect of Open Access).