Pre-Processing of Mobile Camera Captured Images for OCR

Authors

  • Pushpinder Singh Research Scholar, Department of Computer Science, Punjabi University, Patiala, Punjab, India
  • Dharam Veer Sharma Department of Computer Science, Punjabi University, Patiala, Punjab, India

Keywords:

Pre-processing, Mobile camera captured images, Skew detection and correction, Cropping, Perspective projection, Noise removal, Binarization

Abstract

Optical Character Recognition (OCR) systems are nowadays capable of recognizing different printed scripts but the accuracy of any OCR system mainly depends upon the quality of text image. Mobile phones have become the most popular handheld device in this era of technology.  A new way to digitize the image is using the mobile camera. Although it is very easy to capture the image with mobile camera but it also brings a lot of challenges. Various challenges in mobile camera captured images are discussed in this paper. Various pre-processing operations need to be performed on the camera captured input image to enhance its quality. This paper also presents the implementation of different pre-processing techniques to improve the quality of camera captured image which can be further used in text recognition.     

Downloads

Download data is not yet available.

References

Puneet & Garg, Naresh. (2013). Binarization Techniques used for Grey Scale Images. International Journal of Computer Applications. 71. 8-11, 2013.

Jyotsna, S. Chauhan, E. Sharma and A. Doegar, "Binarization techniques for degraded document images — A review," 2016 5th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India, 2016, pp. 163-166.

O’Gorman, L.: “The document spectrum for page layout analysis,”IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 15, no. 11,1993, pp. 1162–1173.

Al-Khatatneh, S. A. Pitchay and M. Al-qudah, "A Review of Skew Detection Techniques for Document," 2015 17th UKSim-AMSS International Conference on Modelling and Simulation (UKSim), Cambridge, UK, 2015, pp. 316-321.

W. Postl, “Detection of linear oblique structures and skew scan in digitized documents,” In Proceedings of the 8th International Conference on Pattern Recognition, pp. 687-689. 1986.

Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun, “Faster r-cnn: Towards real-time object detection with region proposal networks,” in Advances in neural information processing systems, pp. 91–99, 2015.

Lei Zhang, Fan Yang, Yimin Daniel Zhang, and Ying Julie Zhu, “Road crack detection using deep convolutional neural network,” in 2016 IEEE international conference on image processing (ICIP). IEEE, pp. 3708–3712, 2016.

Tianmei Guo, Jiwen Dong, Henjian Li, and Yunxing Gao, “Simple convolutional neural network on image classification,” in 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA), IEEE, pp. 721–724, 2017.

Wolfgang Postl, “Detection of linear oblique structures and skew scan in digitized documents,” in Proc. Int. Conf. on Pattern Recognition, pp. 687–689, 1986.

R.C. Gonzalez and R.E. Woods, Digital Image Processing 26, Sept. 1992., Addison- Wesley, 1992.

K. Chinnasarn, Y. Rangsanseri and P. Thitimajshima, "Removing salt-and-pepper noise in text/graphics images," IEEE. APCCAS 1998. 1998 IEEE Asia-Pacific Conference on Circuits and Systems. Microelectronics and Integrating Systems. Proceedings (Cat. No.98EX242), Chiang Mai, Thailand, 1998, pp. 459-462.

D. Maheshwari, V. Radha,” Noise removal in compound image using median filter”,” International Journal of Advanced Trends in Computer Science and Engineering”, January 2010, pp. 1359-1362.

Steps in pre-processing of Mobile camera captured image

Downloads

Published

31.01.2023

How to Cite

Singh, P. ., & Sharma, D. V. . (2023). Pre-Processing of Mobile Camera Captured Images for OCR . International Journal of Intelligent Systems and Applications in Engineering, 11(2s), 147–155. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2518

Issue

Section

Research Article