Under Water Image Enhancement using Custom VGG19
Keywords:
Custom VGG-19 Architecture, Deep Learning, Encoder-Decoder Network, High-Resolution Imaging, Relational Bilevel Aggregation Graph Convolutional Network, Underwater Image Enhancement, Variational Bayesian-based Robust Adaptive Filtering.Abstract
This research introduces an innovative approach to underwater image processing through the use of a modified VGG-19 architecture, renowned for its exceptional image processing capabilities using Variational Bayesian-based Robust Adaptive Filtering (VBRAF). At the heart of this approach is the integration of custom layers within the Relational Bilevel Aggregation Graph Convolutional Network (RBAGCN)-VGG-19 frameworks. Enhancing the model's adaptability to the unique challenges of underwater condition using image processing; this method incorporates a specialized encoder based on the VGG-19 architecture, enriched with additional custom layers for detailed from underwater imagery. The decoder component then reconstructs these details into images of higher clarity and resolution. A notable aspect of method is the inclusion of a max unpooling mechanism, which streamlines the processing of complex deep learning models and improves the clarity of lower-resolution. This research marks a significant advancement in underwater image processing, setting a new standard for high-quality underwater photography and marine research, facilitated by the customized VGG-19 architecture.
Downloads
References
Y. Wang, J. Zhang, Y. Cao, and Z. Wang, 2017, September. A deep CNN method for underwater image enhancement. In 2017 IEEE international conference on image processing (ICIP) (pp. 1382-1386). IEEE.
S. Anwar, C. Li, and F. Porikli, 2018. Deep underwater image enhancement. arXiv preprint arXiv:1807.03528.
C. Li, C. Guo, W. Ren, R. Cong, J. Hou, S. Kwong, and D. Tao, 2019. An underwater image enhancement benchmark dataset and beyond. IEEE transactions on image processing, 29, pp.4376-4389.
C.O. Ancuti, C. Ancuti, C. De Vleeschouwer, and P. Bekaert, 2017. Color balance and fusion for underwater image enhancement. IEEE Transactions on image processing, 27(1), pp.379-393.
Y. Wang, J. Guo, H. Gao, and H. Yue, 2021. UIEC^ 2-Net: CNN-based underwater image enhancement using two color space. Signal Processing: Image Communication, 96, p.116250.
M. Jian, X. Liu, H. Luo, X. Lu, H. Yu, and J. Dong, 2021. Underwater image processing and analysis: A review. Signal Processing: Image Communication, 91, p.116088.
H. Lu, Y. Li, Y. Zhang, M. Chen, S. Serikawa, and H. Kim, 2017. Underwater optical image processing: a comprehensive review. Mobile networks and applications, 22, pp.1204-1211.
R. Eustice, O. Pizarro, H. Singh, and J. Howland, 2002, April. UWIT: Underwater Image Toolbox for optical image processing and mosaicking in MATLAB. In Proceedings of the 2002 Interntional Symposium on Underwater Technology (Cat. No. 02EX556) (pp. 141-145). IEEE.
Y. Ma, X. Feng, L. Chao, D. Huang, Z. Xia, and X. Jiang, 2018, November. A new database for evaluating underwater image processing methods. In 2018 Eighth International Conference on Image Processing Theory, Tools and Applications (IPTA) (pp. 1-6). IEEE.
D.G. Zawada, 2003. Image processing of underwater multispectral imagery. IEEE Journal of Oceanic Engineering, 28(4), pp.583-594.
McGlamery, B.L., 1980, March. A computer model for underwater camera systems. In Ocean Optics VI (Vol. 208, pp. 221-231). SPIE.
J.S. Jaffe, 1990. Computer modeling and the design of optimal underwater imaging systems. IEEE Journal of Oceanic Engineering, 15(2), pp.101-111.
C.J. Funk, S.B. Bryant, and P.J. Heckman, 1972. Handbook of underwater imaging system design (p. 0336). San Diego, CA: Ocean Technology Department, Naval Undersea Center.
https://www.kaggle.com/datasets/akshatsng/underwater-dataset-for-8-classes-with-label
C. Pan, Z. Li, J. Gao, and F. Li, 2021. A variational Bayesian-based robust adaptive filtering for precise point positioning using undifferenced and uncombined observations. Advances in Space Research, 67(6), pp.1859-1869.
L. Yuan, G. Huang, F. Li, X. Yuan, C.M. Pun, and G. Zhong, 2023. Rba-gcn: Relational bilevel aggregation graph convolutional network for emotion recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing.
J. Xiao, J. Wang, S. Cao, and B. Li, 2020, April. Application of a novel and improved VGG-19 network in the detection of workers wearing masks. In Journal of Physics: Conference Series (Vol. 1518, No. 1, p. 012041). IOP Publishing.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.