Recognition of Printed Kannada Text in Scene Images using Machine Learning Techniques

Authors

  • Mahadeva Prasad Y. N. Department of computer Science , University of Mysore, Mysore- 570005, Karnataka, India
  • Chethan H. K. Department of computer Science and Engineering, MITT Mysore-571302 VTUniversity,Karnataka, India

Keywords:

Filters, Kannada text, Text Recognition, LSTM Model, YOLO V7

Abstract

Images are essential and plays a key and an important function in an electronics and communication media for sharing the information. In recent days, every activity has to be recorded as a digital image. Despite the rapid progress in the growth of strong deep learning representations for detecting and recognizing scene text, current state-of-the-art techniques fall short of delivering satisfactory outcomes in complex scenarios, such as those involving diverse logos, decorated boards, or text embedded within components. This proposed research article provides a new approach to enhancing the detection of scene text and recognition of scene text performances by detecting defects in the detection results. In this research, four important stages such as pre-processing, Kannada text detection, feature extraction and recognition are carried after the collection of multilingual scene text images. Initially, the multilingual scene text images are collected and is preprocessed for the effectual removal of noise using bilateral filter and wiener filter for enrich the image quality. After pre-processing, detection step is performed for analyzing only Kannada text by leaving the other texts using Convolutional block attention-based YOLO V7. The highly discriminative feature/s are picked out from the detected scene images by using Dense stacked LSTM network model. Finally, the Convolutional Residual network assisted auto encoder model is employed for optimal recognition of Kannada language can be created using machine-readable strings. This yields the better recognition rate compared to cutting-edge techniques.

Downloads

Download data is not yet available.

References

Zheng Xiao et. al., An extended attention mechanism for scene text recognition. Expert Systems With Applications, 203, pp.1-11, May 2022.

Sahana K Adyanthaya, Text Recognition from Images: A Study. International Journal of Engineering Research and Technology, Volume 8, Issue 13, pp.18-20, 2020.

Matteo Brisinello et. al., Review on text detection methods on scene images. 61st International Symposium ELMAR-2019, Zadar, Croatia, pp.51-56, 2019.

Xiaoqian Li,Jie Liu, Shuwu Zhang, Text Recognition in Natural Scenes: A Review. International Conference on Culture-oriented Science & Technology, pp.154-159, 2020.

Vishnuvardhan and Dhanalakshmi Miryala. Scene Text Recognition of Indian Languages in Natural Scene Images. International Journal of Advanced research in engineering and Technology [IJARET-2020].

Shangbang Long, Xin He and Cong Yao, Scene Text Detection and Recognition: The Deep Learning Era. Received: 14 April 2020 / Accepted: 8 August 2020.

Aggarwal, C. C. Neural Networks and Deep Learning: A Textbook. Basingstoke, England: Springer.2018.

Z. Cheng, Y. Xu, F. Bai, Y. Niu, S. Pu, and S. Zhou, AON: Towards arbitrarily-oriented text recognition. In: Proceedings of CVPR, 2018, pp. 5571–5579, [CVPR-2018].

Basavaraj S. Anami, Deepa S. Garag. A Semiautomatic Methodology for Recognition of Printed Kannada Character Primitives Useful in Character Construction. Recent Trends in Image Processing and Pattern Recognition. [Springer Singapore-2019].

Francis, L. M., & Sreenath, N, TEDLESS: Text detection using least-square SVM for natural scene. Journal of King Saud University-Computer and information sciences, 32(3), 87–299.

Roy, S., Shivakumara, P., Pal, U., Lu, T., & Kumar, G. H. (2020). Delaunay triangulation based text detection from multi-view images of natural scene. Pattern Recognition Letters, 129, 92–100.

Raghunandan, K. S., Shivakumara, P., Roy, S., Kumar, G. H., Pal, U., & Lu, T, Multi-script oriented text detection and recognition in video / scene /born digital images. In:IEEE transactions on circuits and systems for video technology, pp. 1145–1161.

Guo, J., You, R., & Hung, L, Mixed vertical and horizontal text traffic sign detection and recognition for street level scene. IEEE Access, 8, 69413–69425.

Liu, S., Xian, Y., Li, H., & Yu, Z. (2020). Detection in natural scene images using morphological component analysis and Laplacian dictionary. IEEE Journal of Automatic Sinica, 7(1), 214–222.

Panwar, M. A., Memon, K. A., Abro, A., Zhongliang, D., Khuhro, S. A., & Memon, S. (2020). Signboard detection and recognition using artificial neural networks. In: Proceedings on ICEIEC, pp. 16–19.

Dajian Zhong et. al., SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition. arXiv:2207.10256v1 [cs.CV], July 2022.

H. T. Basavaraju et. al., Arbitrary oriented multilingual text detection and segmentation using level set and Gaussian mixture model. Evolutionary Intelligence, Vol. 14, pp.881– 894, 2021.

Hamam Mokayed et. al., A new defect detection method for improving text detection and Recognition performances in natural scene images. IEEE Explorer.

A. Vishnuvardhan, and Dhanalakshmi Miryala, Scene Text Recognition Of Indian Languages In Natural Scene Images. International Journal of Advanced Research in Engineering and Technology, Vol. 11, Issue.12, pp. 2773-2781 , 2020

A. Ram Bharadwaj et. al.,Telugu text extraction and recognition using convolutional and recurrent neural networks. International Journal of Engineering and Advanced Technology, pp.1449-1451, Vol.8, Issue-5, 2019.

Dr. Sana'a khudayer Jadwa,Wiener Filter based Medical Image De-noising. International Journal of Science and Engineering Applications Vol. 7–Issue 09, pp.318-323, 2018

T. E. De Campos, B. R. Babu and M. Varma, Character recognition in natural images. VISAPP 2009 (2).

C. Yao, X. Bai and W. Liu, ”A Unified Framework for Multioriented Text Detection and Recognition. In:IEEE Transactions on Image Processing, vol. 23, no. 11, Nov. 2014, pp. 4737-4749.

M. Valdenegro-Toro, P. Plo ̈ger, S. Eickeler and I. Konya,Histograms of Stroke Widths for Multi-script Text Detection and Verification in Road Scenes.2016 IFAC-Papers Online, 49(15), pp. 100–107. [Online].

M. R. Phangtriastu, J. Harefa, and D. F. Tanoto, Comparison between Neural Netwok and Support Vector Machine in Optical Character Recognition. Procedia Comput. Sci., vol. 116, 2017, pp. 351–357.

Janani, S., Dilip, R., Talukdar, S.B., Talukdar, V.B., Mishra, K.N., Dhabliya, D. IoT and machine learning in smart city healthcare systems (2023) Handbook of Research on Data-Driven Mathematical Modeling in Smart Cities, pp. 262-279.

Juneja, V., Singh, S., Jain, V., Pandey, K.K., Dhabliya, D., Gupta, A., Pandey, D. Optimization-based data science for an IoT service applicable in smart cities (2023) Handbook of Research on Data-Driven Mathematical Modeling in Smart Cities, pp. 300-321.

Downloads

Published

13.12.2023

How to Cite

Prasad Y. N., M. ., & H. K., C. . (2023). Recognition of Printed Kannada Text in Scene Images using Machine Learning Techniques. International Journal of Intelligent Systems and Applications in Engineering, 12(8s), 600–614. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4233

Issue

Section

Research Article