Assamese Script Identification and Content-Based Image Retrieval using Improved Siamese Neural Network

Authors

  • Sathiyaviradhan Janarthanan, Sivagnana Raj Shanmuganathan

Keywords:

Assamese script, Grayscale Conversion, threshold technique, CBIR System, Inception V3, Segmentation, word recognition, shape features, deep learning, feature extraction.

Abstract

Content based image retrieval (CBIR) is a remarkable system that allows users to effortlessly search and retrieve images from a vast dataset. With a large number of images in the dataset, identifying similar images to our query image can be quite challenging. An effective and efficient approach is needed to utilize information from these image repositories. This system allows users to easily access Assamese Script images from the database by searching for specific content. Assamese is a language that falls under the Indo-Aryan language family and is one of the 22 scheduled languages in India. There is a wide range of Assamese printed documents available in digital format. However, retrieving information from these digital document images is a task of the highest priority. Current studies have shown remarkable precision rates in initial retrieval levels, such as the top 10 and top 20 images. However, these rates drop significantly in subsequent levels, like the top 40, 50, and 70. Therefore, the objective of this paper is to introduce a novel CBIR approach that attains high precision values across all retrieval levels. The system consists of three main stages: Preprocessing, feature extraction, and feature similarity matching. Effective feature extraction techniques are crucial for optimizing the performance of a CBIR system. Therefore, we suggest utilizing the Improved Siamese Neural Network (ISNN) as a technique for extracting shape-based features. The Cosine distance with ISNN is utilized to match the query image features provided by users and identify a particular word in the document. The experimental findings demonstrate that, in comparison to the other current approaches taken into consideration in this study, the suggested methods yield superior outcomes.

Downloads

Download data is not yet available.

References

Garg, M., & Dhiman, G. (2021). A novel content-based image retrieval approach for classification using GLCM features and texture fused LBP variants. Neural Computing and Applications, 33(4), 1311-1328.

Ahmad, F. (2022). Deep image retrieval using artificial neural network interpolation and indexing based on similarity measurement. CAAI Transactions on Intelligence Technology, 7(2), 200-218.

Alshehri, M. (2020). A content-based image retrieval method using neural network-based prediction technique. Arabian journal for science and engineering, 45(4), 2957-2973.

Fachruddin, F., Saparudin, S., Rasywir, E., &Pratama, Y. (2022). Network and layer experiment using convolutional neural network for content based image retrieval work. TELKOMNIKA (Telecommunication Computing Electronics and Control), 20(1), 118-128.

Kundu, S., Paul, S., Singh, P. K., Sarkar, R., &Nasipuri, M. (2020). Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition. Neural Computing and Applications, 32, 7879-7895.

Ghosh, S., Chatterjee, A., Sen, S., Kumar, N., & Sarkar, R. (2021). CTRL–CapTuRedLight: a novel feature descriptor for online Assamese numeral recognition. Multimedia Tools and Applications, 80, 30033-30056.

Medhi, K., & Kalita, S. K. (2020). Assamese Handwritten Character Recognition using Supervised Fuzzy Logic. Int. J. Recent Technol. Eng, 8(5), 3750-3758.

Choudhury, A., & Sarma, K. K. (2021). A CNN-LSTM based ensemble framework for in-air handwritten Assamese character recognition. Multimedia Tools and Applications, 80(28), 35649-35684.

Zhao, L., & Tang, J. (2010, October). Content-based image retrieval using optimal feature combination and relevance feedback. In 2010 International Conference on Computer Application and system modeling (iccasm 2010) (Vol. 4, pp. V4-436). IEEE.

Ghahremani, M., Ghadiri, H., &Hamghalam, M. (2021). Local features integration for content-based image retrieval based on color, texture, and shape. Multimedia Tools and Applications, 80(18), 28245-28263.

Li, X., Yang, J., & Ma, J. (2021). Recent developments of content-based image retrieval (CBIR). Neurocomputing, 452, 675-689.

Hafemann, L. G., Oliveira, L. S., & Sabourin, R. (2018). Fixed-sized representation learning from offline handwritten signatures of different sizes. International Journal on Document Analysis and Recognition (IJDAR), 21, 219-232.

Theetchenya, S., Ramasubbareddy, S., Sankar, S., & Basha, S. M. (2021). Hybrid approach for content-based image retrieval. International Journal of Data Science, 6(1), 45-56.

Maiwald, F., Lehmann, C., &Lazariv, T. (2021). Fully automated pose estimation of historical images in the context of 4D geographic information systems utilizing machine learning methods. ISPRS International Journal of Geo-Information, 10(11), 748.

Byju, A. P., Demir, B., & Bruzzone, L. (2020). A progressive content-based image retrieval in JPEG 2000 compressed remote sensing archives. IEEE Transactions on Geoscience and Remote Sensing, 58(8), 5739-5751.

Zhao, S., Wang, L., Qian, X., & Chen, J. (2022). Enhancing performance-based generative architectural design with sketch-based image retrieval: a pilot study on designing building facade fenestrations. The Visual Computer, 38(8), 2981-2997.

Mohite, N. B., & Gonde, A. B. (2022). Deep features based medical image retrieval. Multimedia Tools and Applications, 81(8), 11379-11392.

Hamreras, S., Boucheham, B., Molina-Cabello, M. A., Benitez-Rochel, R., & Lopez-Rubio, E. (2020). Content based image retrieval by ensembles of deep learning object classifiers. Integrated computer-aided engineering, 27(3), 317-331.

Ghahremani, M., Ghadiri, H., &Hamghalam, M. (2021). Local features integration for content-based image retrieval based on color, texture, and shape. Multimedia Tools and Applications, 80(18), 28245-28263.

Alsmadi, M. K. (2020). Content-based image retrieval using color, shape and texture descriptors and features. Arabian Journal for Science and Engineering, 45(4), 3317-3330.

Varish, N. (2022). A modified similarity measurement for image retrieval scheme using fusion of color, texture and shape moments. Multimedia Tools and Applications, 81(15), 20373-20405.

Dubey, S. R. (2021). A decade survey of content based image retrieval using deep learning. IEEE Transactions on Circuits and Systems for Video Technology, 32(5), 2687-2704.

Du, W. S. (2021). Subtraction and division operations on intuitionistic fuzzy sets derived from the Hamming distance. Information Sciences, 571, 206-224.

Müller, H., Michoux, N., Bandon, D., & Geissbuhler, A. (2004). A review of content-based image retrieval systems in medical applications—clinical benefits and future directions. International journal of medical informatics, 73(1), 1-23.

Ballerini, L., Li, X., Fisher, R. B., & Rees, J. (2010). A query-by-example content-based image retrieval system of non-melanoma skin lesions. In Medical Content-Based Retrieval for Clinical Decision Support: First MICCAI International Workshop, MCBR-CDS 2009, London, UK, September 20, 2009, Revised Selected Papers 1 (pp. 31-38). Springer Berlin Heidelberg.

Rasheed, A., Ali, N., Zafar, B., Shabbir, A., Sajid, M., & Mahmood, M. T. (2022). Handwritten Urdu characters and digits recognition using transfer learning and augmentation with AlexNet. IEEE Access, 10, 102629-102645.

Banerjee, D., Bhowal, P., Malakar, S., Cuevas, E., Pérez‑Cisneros, M., & Sarkar, R. (2022). Z-transform-based profile matching to develop a learning-free keyword spotting method for handwritten document images. International Journal of Computational Intelligence Systems, 15(1), 93.

Ghazal, T. M. (2022). Convolutional neural network based intelligent handwritten document recognition. Computers, Materials & Continua, 70(3), 4563-4581.

Lakshmi, K. M. (2021). An efficient telugu word image retrieval system using deep cluster. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(11), 3247-3255.

Zagoris, K., Amanatiadis, A., &Pratikakis, I. (2021). Word spotting as a service: an unsupervised and segmentation-free framework for handwritten documents. Journal of Imaging, 7(12), 278.

Wunnava, A., Naik, M. K., Panda, R., Jena, B., & Abraham, A. (2020). A novel interdependence based multilevel thresholding technique using adaptive equilibrium optimizer. Engineering Applications of Artificial Intelligence, 94, 103836.

Yang, P., Song, W., Zhao, X., Zheng, R., & Qingge, L. (2020). An improved Otsu threshold segmentation algorithm. International Journal of Computational Science and Engineering, 22(1), 146-153.

Peng, Z., Li, X., Zhu, Z., Unoki, M., Dang, J., & Akagi, M. (2020). Speech emotion recognition using 3d convolutions and attention-based sliding recurrent networks with auditory front-ends. IEEE Access, 8, 16560-16572.

Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., ... & Rabinovich, A. (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1-9).

Wang, C., Chen, D., Hao, L., Liu, X., Zeng, Y., Chen, J., & Zhang, G. (2019). Pulmonary image classification based on inception-v3 transfer learning model. IEEE Access, 7, 146533-146541.

Subrahmanyeswara Rao, B. (2020). Accurate leukocoria predictor based on deep VGG‐net CNN technique. IET Image Processing, 14(10), 2241-2248.

Lu, S., Lu, Z., & Zhang, Y. D. (2019). Pathological brain detection based on AlexNet and transfer learning. Journal of computational science, 30, 41-47.

Madduri, A. (2021). Content based Image Retrieval System using Local Feature Extraction Techniques. International Journal of Computer Applications, 183(20), 16-20.

Chicco, D. (2021). Siamese neural networks: An overview. Artificial neural networks, 73-94.

Varish, N., Kumar, S., & Pal, A. K. (2017). A novel similarity measure for content based image retrieval in discrete cosine transform domain. FundamentaInformaticae, 156(2), 209-235.

Downloads

Published

16.01.2023

How to Cite

Sathiyaviradhan Janarthanan. (2023). Assamese Script Identification and Content-Based Image Retrieval using Improved Siamese Neural Network. International Journal of Intelligent Systems and Applications in Engineering, 11(1), 401–433. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/7153

Issue

Section

Research Article