Pre-Processing of Mobile Camera Captured Images for OCR
Keywords:
Pre-processing, Mobile camera captured images, Skew detection and correction, Cropping, Perspective projection, Noise removal, BinarizationAbstract
Optical Character Recognition (OCR) systems are nowadays capable of recognizing different printed scripts but the accuracy of any OCR system mainly depends upon the quality of text image. Mobile phones have become the most popular handheld device in this era of technology. A new way to digitize the image is using the mobile camera. Although it is very easy to capture the image with mobile camera but it also brings a lot of challenges. Various challenges in mobile camera captured images are discussed in this paper. Various pre-processing operations need to be performed on the camera captured input image to enhance its quality. This paper also presents the implementation of different pre-processing techniques to improve the quality of camera captured image which can be further used in text recognition.
Downloads
References
Puneet & Garg, Naresh. (2013). Binarization Techniques used for Grey Scale Images. International Journal of Computer Applications. 71. 8-11, 2013.
Jyotsna, S. Chauhan, E. Sharma and A. Doegar, "Binarization techniques for degraded document images — A review," 2016 5th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India, 2016, pp. 163-166.
O’Gorman, L.: “The document spectrum for page layout analysis,”IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 15, no. 11,1993, pp. 1162–1173.
Al-Khatatneh, S. A. Pitchay and M. Al-qudah, "A Review of Skew Detection Techniques for Document," 2015 17th UKSim-AMSS International Conference on Modelling and Simulation (UKSim), Cambridge, UK, 2015, pp. 316-321.
W. Postl, “Detection of linear oblique structures and skew scan in digitized documents,” In Proceedings of the 8th International Conference on Pattern Recognition, pp. 687-689. 1986.
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun, “Faster r-cnn: Towards real-time object detection with region proposal networks,” in Advances in neural information processing systems, pp. 91–99, 2015.
Lei Zhang, Fan Yang, Yimin Daniel Zhang, and Ying Julie Zhu, “Road crack detection using deep convolutional neural network,” in 2016 IEEE international conference on image processing (ICIP). IEEE, pp. 3708–3712, 2016.
Tianmei Guo, Jiwen Dong, Henjian Li, and Yunxing Gao, “Simple convolutional neural network on image classification,” in 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA), IEEE, pp. 721–724, 2017.
Wolfgang Postl, “Detection of linear oblique structures and skew scan in digitized documents,” in Proc. Int. Conf. on Pattern Recognition, pp. 687–689, 1986.
R.C. Gonzalez and R.E. Woods, Digital Image Processing 26, Sept. 1992., Addison- Wesley, 1992.
K. Chinnasarn, Y. Rangsanseri and P. Thitimajshima, "Removing salt-and-pepper noise in text/graphics images," IEEE. APCCAS 1998. 1998 IEEE Asia-Pacific Conference on Circuits and Systems. Microelectronics and Integrating Systems. Proceedings (Cat. No.98EX242), Chiang Mai, Thailand, 1998, pp. 459-462.
D. Maheshwari, V. Radha,” Noise removal in compound image using median filter”,” International Journal of Advanced Trends in Computer Science and Engineering”, January 2010, pp. 1359-1362.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Pushpinder Singh, Dharam Veer Sharma
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.