Enhancing Faster-RCNN Performance: A Hybrid Approach Integrating VAE, HOG, and Recursive Feature Elimination

Authors

  • Rozzi Kesuma Dinata, Fajriana, Novia Hasdyna, Sujacka Retno

Keywords:

feature extraction, variational autoencoders, histogram of oriented gradients, recursive feature elimination, faster r-cnn optimization

Abstract

Deep learning has transformed image analysis by providing powerful feature extraction techniques, particularly beneficial for detecting microscopic pathogens such as Vibrio parahaemolyticus. Traditional methods often fail to capture the complex, high-dimensional features required for accurate bacterial classification. This study presents the enhancement of Faster-RCNN performance through advanced feature extraction techniques for the classification of Vibrio parahaemolyticus bacteria. The goal is to improve classification accuracy by integrating sophisticated feature extraction methods. Microscopic images observed directly are initially processed using the ResNet architecture to derive preliminary features. Variational Autoencoders (VAE) are then employed to extract high-dimensional, abstract features, while Histogram of Oriented Gradients (HOG) captures shape and orientation based features. Recursive Feature Elimination (RFE) is used to optimize these feature sets, leading to significant improvements in classification performance. The VAE+RFE + Faster-RCNN approach achieves a detection accuracy of 92%, precision of 88%, recall of 93%, and an F1-score of 90%. In comparison, the HOG + RFE + Faster-RCNN configuration results in 85% accuracy, 82% precision, 87% recall, and an F1-score of 84%. The conventional Faster-RCNN model records an accuracy of 89%, precision of 86%, recall of 90%, and an F1-score of 88%. These results underscore the substantial impact of advanced feature extraction techniques on optimizing Faster-RCNN performance, demonstrating significant improvements in classification accuracy and efficiency.

Downloads

Download data is not yet available.

References

K. Athanasopoulou, G. N. Daneva, P. G. Adamopoulos, A. Scorilas, “Artificial intelligence: the milestone in modern biomedical research”. BioMedInformatics, 2(4), 727-744, 2022, doi: 10.3390/biomedinformatics2040049.

I. H. Sarker “Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions”. SN computer science, 2(6), 420, 2021, doi: 10.1007/s42979-021-00815-1.

D. McNeely-White, J.R. Beveridge, B. A. Draper , “Inception and ResNet features are (almost) equivalent”. Cognitive Systems Research, 59, 312-318, 2020, doi: 10.1016/j.cogsys.2019.10.004

A. Siradjuddin, A. Muntasa, "Faster region-based convolutional neural network for mask face detection," in 2021 5th International Conference on Informatics and Computational Sciences (ICICoS), 2021, pp. 282-286. doi: 10.1109/ICICoS53627.2021.9651744.

P. Preethi, H. R. Mamatha, "Region-based convolutional neural network for segmenting text in epigraphical images," Artificial Intelligence and Applications, vol. 1, no. 2, pp. 119-127, 2023. doi: 10.47852/bonviewAIA2202293.

C. Duan, B. Hu, W. Liu, T. Ma, Q. Ma, H. Wang, "Infrared small target detection method based on frequency domain clutter suppression and spatial feature extraction," IEEE Access, 2023. doi: 10.1109/ACCESS.2023.3303486.

A. Baskar, T. G. Kumar, S. Samiappan, "A vision system to assist visually challenged people for face recognition using multi-task cascaded convolutional neural network (MTCNN) and local binary pattern (LBP)," Journal of Ambient Intelligence and Humanized Computing, vol. 14, no. 4, pp. 4329-4341, 2023. doi: 10.1007/s12652-023-04542-8.

V. A. Mohammed, M. A. Mohammed, M. A. Mohammed, R. Ramakrishnan, J. Logeshwaran, "The spreading prediction and severity analysis of blood cancer using scale-invariant feature transform," in 2023 International Conference on Network, Multimedia and Information Technology (NMITCON), 2023, pp. 1-7. doi: 10.1109/NMITCON58196.2023.10276289.

A. W. Muzaffar, F. Riaz, T. Abuain, W. A. K. Abu-Ain, F. Hussain, M. U. Farooq, M. A. Azad, "Gabor contrast patterns: A novel framework to extract features from texture images," IEEE Access, vol. 11, pp. 60324-60334, 2023. doi: 10.1109/ACCESS.2023.3280053.

T. Hayıt, H. Erbay, F. Varçın, F. Hayıt, N. Akci, "The classification of wheat yellow rust disease based on a combination of textural and deep features," Multimedia Tools and Applications, vol. 82, no. 30, pp. 47405-47423, 2023. doi: 10.1007/s11042-023-15199-y.

S. Y. Alaba, J. E. Ball, "Wcnn3d: Wavelet convolutional neural network-based 3d object detection for autonomous driving," Sensors, vol. 22, no. 18, pp. 7010, 2022. doi: 10.3390/s22187010.

S. Qiao, Q. Yu, Z. Zhao, L. Song, H. Tao, T. Zhang, C. Zhao, "Edge extraction method for medical images based on improved local binary pattern combined with edge-aware filtering," Biomed. Signal Process. Control, vol. 74, p. 103490, 2022. doi: 10.1016/j.bspc.2022.103490.

D. Cazzato, C. Cimarelli, J. L. Sanchez-Lopez, H. Voos, M. Leo, "A survey of computer vision methods for 2d object detection from unmanned aerial vehicles," J. Imaging, vol. 6, no. 8, pp. 78, 2020. doi: 10.3390/jimaging6080078.

[14] O. C. Koyun, R. K. Keser, I. B. Akkaya, B. U. Töreyin, "Focus-and-Detect: A small object detection framework for aerial images," Signal Process. Image Commun., vol. 104, p. 116675, 2022. doi: 10.1016/j.image.2022.116675.

E. T. Lee, Z. Fan, B. Sencer, "A new approach to detect surface defects from 3D point cloud data with surface normal Gabor filter (SNGF)," J. Manuf. Process., vol. 92, pp. 196-205, 2023. doi: 10.1016/j.jmapro.2023.02.047.

J. M. Fortuna-Cervantes, M. T. Ramírez-Torres, J. Martínez-Carranza, J. S. Murguía-Ibarra, M. Mejía-Carlos, "Object detection in aerial navigation using wavelet transform and convolutional neural networks: A first approach," Program. Comput. Softw., vol. 46, pp. 536-547, 2020. doi: 10.1134/S0361768820080113.

Downloads

Published

12.06.2024

How to Cite

Rozzi Kesuma Dinata. (2024). Enhancing Faster-RCNN Performance: A Hybrid Approach Integrating VAE, HOG, and Recursive Feature Elimination. International Journal of Intelligent Systems and Applications in Engineering, 12(4), 4523–4536. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/7141

Issue

Section

Research Article