Image Caption Generation Using Recurrent Convolutional Neural Network

BV Subba  Rao; K.  Meenakshi; K.  Kalaiarasi; Ramesh  Babu P.; J.  Kavitha; V.  Saravanan

Image Caption Generation Using Recurrent Convolutional Neural Network

Authors

BV Subba Rao Professor, Department of Information Technology, PVP Siddhartha Institute of Technology, Vijayawada, Andhra Pradesh, India
K. Meenakshi Professor, Department of Mathematics, VTU(RC), CMR Institute of Technology, Bengaluru, Karnataka, India
K. Kalaiarasi Assistant Professor, PG and Research Department of Mathematics, Cauvery College for Women (Autonomous), Tiruchirappalli, Tamil Nadu, India.
Ramesh Babu P. Associate Professor, Department of Computer Science, College of Engineering and Technology, Wollega University, Nekemte, Oromia Region, Ethiopia
J. Kavitha Associate Professor, Department of Basic Sciences, Cambridge Institute of Technology (CIT), Bengaluru, India.
V. Saravanan Associate Professor, Department of Computer Science, College of Engineering and Technology, Dambi Dollo University, Dambi Dollo, Oromia Region, Ethiopia.

Keywords:

Image Captioning, Recurrent neural network, convolutional layers

Abstract

This paper presents a residual learning (RL) approach to generate automated captions for any given image. In this approach, a convolutional neural network (CNN) is employed to extract the spectral and spatial characteristics of the image, which is essential to solve the caption generation problem, which necessitates the use of CNN. In addition to this, we consider the nuanced quality of language by incorporating an image annotation generator into the system that has been recommended. The results of the experiments that have been presented here provide convincing evidence that the developed model is an improvement upon the various approaches to image captioning that are currently being used.

Downloads

Download data is not yet available.

References

Ding, S., Qu, S., Xi, Y., & Wan, S. (2020). Stimulus-driven and concept-driven analysis for image caption generation. Neurocomputing, 398, 520-530.

Chen, S., Jin, Q., Wang, P., & Wu, Q. (2020). Say as you wish: Fine-grained control of image caption generation with abstract scene graphs. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 9962-9971).

Liu, M., Li, L., Hu, H., Guan, W., & Tian, J. (2020). Image caption generation with dual attention mechanism. Information Processing & Management, 57(2), 102178.

He, X., Shi, B., Bai, X., Xia, G. S., Zhang, Z., & Dong, W. (2019). Image caption generation with part of speech guidance. Pattern Recognition Letters, 119, 229-237.

Zhou, Z., Zhang, X., Li, Z., Huang, F., & Xu, J. (2022). Multilevel attention networks and policy reinforcement learning for image caption generation. Big Data, 10(6), 481-492.

Agrawal, V., Dhekane, S., Tuniya, N., & Vyas, V. (2021, July). Image Caption Generator Using Attention Mechanism. In 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT) (pp. 1-6). IEEE.

Zhao, S., Li, L., Peng, H., Yang, Z., & Zhang, J. (2020). Image caption generation via unified retrieval and generation-based method. Applied Sciences, 10(18), 6235.

Liu, X., & Xu, Q. (2020). Adaptive attention-based high-level semantic introduction for image caption. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 16(4), 1-22.

Mounika, S., & Vijaybabu, P. (2022). Image caption generator using cnn and lstm. South Asian Journal of Engineering and Technology, 12(3), 78-86.

Downloads

Published

05.12.2023

How to Cite

Rao, B. S. ., Meenakshi, K. ., Kalaiarasi, K. ., Babu P., R. ., Kavitha, J. ., & Saravanan, V. . (2023). Image Caption Generation Using Recurrent Convolutional Neural Network . International Journal of Intelligent Systems and Applications in Engineering, 12(7s), 76–80. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4033

Download Citation

Issue

Vol. 12 No. 7s (2024)

Section

Research Article

License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.

IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.

Image Caption Generation Using Recurrent Convolutional Neural Network

Authors

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Similar Articles

Announcements

Information for Authors

ijisae

Information

trindex