Enhancing Faster-RCNN Performance: A Hybrid Approach Integrating VAE, HOG, and Recursive Feature Elimination
Keywords:
feature extraction, variational autoencoders, histogram of oriented gradients, recursive feature elimination, faster r-cnn optimizationAbstract
Deep learning has transformed image analysis by providing powerful feature extraction techniques, particularly beneficial for detecting microscopic pathogens such as Vibrio parahaemolyticus. Traditional methods often fail to capture the complex, high-dimensional features required for accurate bacterial classification. This study presents the enhancement of Faster-RCNN performance through advanced feature extraction techniques for the classification of Vibrio parahaemolyticus bacteria. The goal is to improve classification accuracy by integrating sophisticated feature extraction methods. Microscopic images observed directly are initially processed using the ResNet architecture to derive preliminary features. Variational Autoencoders (VAE) are then employed to extract high-dimensional, abstract features, while Histogram of Oriented Gradients (HOG) captures shape and orientation based features. Recursive Feature Elimination (RFE) is used to optimize these feature sets, leading to significant improvements in classification performance. The VAE+RFE + Faster-RCNN approach achieves a detection accuracy of 92%, precision of 88%, recall of 93%, and an F1-score of 90%. In comparison, the HOG + RFE + Faster-RCNN configuration results in 85% accuracy, 82% precision, 87% recall, and an F1-score of 84%. The conventional Faster-RCNN model records an accuracy of 89%, precision of 86%, recall of 90%, and an F1-score of 88%. These results underscore the substantial impact of advanced feature extraction techniques on optimizing Faster-RCNN performance, demonstrating significant improvements in classification accuracy and efficiency.
Downloads
References
K. Athanasopoulou, G. N. Daneva, P. G. Adamopoulos, A. Scorilas, “Artificial intelligence: the milestone in modern biomedical research”. BioMedInformatics, 2(4), 727-744, 2022, doi: 10.3390/biomedinformatics2040049.
I. H. Sarker “Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions”. SN computer science, 2(6), 420, 2021, doi: 10.1007/s42979-021-00815-1.
D. McNeely-White, J.R. Beveridge, B. A. Draper , “Inception and ResNet features are (almost) equivalent”. Cognitive Systems Research, 59, 312-318, 2020, doi: 10.1016/j.cogsys.2019.10.004
A. Siradjuddin, A. Muntasa, "Faster region-based convolutional neural network for mask face detection," in 2021 5th International Conference on Informatics and Computational Sciences (ICICoS), 2021, pp. 282-286. doi: 10.1109/ICICoS53627.2021.9651744.
P. Preethi, H. R. Mamatha, "Region-based convolutional neural network for segmenting text in epigraphical images," Artificial Intelligence and Applications, vol. 1, no. 2, pp. 119-127, 2023. doi: 10.47852/bonviewAIA2202293.
C. Duan, B. Hu, W. Liu, T. Ma, Q. Ma, H. Wang, "Infrared small target detection method based on frequency domain clutter suppression and spatial feature extraction," IEEE Access, 2023. doi: 10.1109/ACCESS.2023.3303486.
A. Baskar, T. G. Kumar, S. Samiappan, "A vision system to assist visually challenged people for face recognition using multi-task cascaded convolutional neural network (MTCNN) and local binary pattern (LBP)," Journal of Ambient Intelligence and Humanized Computing, vol. 14, no. 4, pp. 4329-4341, 2023. doi: 10.1007/s12652-023-04542-8.
V. A. Mohammed, M. A. Mohammed, M. A. Mohammed, R. Ramakrishnan, J. Logeshwaran, "The spreading prediction and severity analysis of blood cancer using scale-invariant feature transform," in 2023 International Conference on Network, Multimedia and Information Technology (NMITCON), 2023, pp. 1-7. doi: 10.1109/NMITCON58196.2023.10276289.
A. W. Muzaffar, F. Riaz, T. Abuain, W. A. K. Abu-Ain, F. Hussain, M. U. Farooq, M. A. Azad, "Gabor contrast patterns: A novel framework to extract features from texture images," IEEE Access, vol. 11, pp. 60324-60334, 2023. doi: 10.1109/ACCESS.2023.3280053.
T. Hayıt, H. Erbay, F. Varçın, F. Hayıt, N. Akci, "The classification of wheat yellow rust disease based on a combination of textural and deep features," Multimedia Tools and Applications, vol. 82, no. 30, pp. 47405-47423, 2023. doi: 10.1007/s11042-023-15199-y.
S. Y. Alaba, J. E. Ball, "Wcnn3d: Wavelet convolutional neural network-based 3d object detection for autonomous driving," Sensors, vol. 22, no. 18, pp. 7010, 2022. doi: 10.3390/s22187010.
S. Qiao, Q. Yu, Z. Zhao, L. Song, H. Tao, T. Zhang, C. Zhao, "Edge extraction method for medical images based on improved local binary pattern combined with edge-aware filtering," Biomed. Signal Process. Control, vol. 74, p. 103490, 2022. doi: 10.1016/j.bspc.2022.103490.
D. Cazzato, C. Cimarelli, J. L. Sanchez-Lopez, H. Voos, M. Leo, "A survey of computer vision methods for 2d object detection from unmanned aerial vehicles," J. Imaging, vol. 6, no. 8, pp. 78, 2020. doi: 10.3390/jimaging6080078.
[14] O. C. Koyun, R. K. Keser, I. B. Akkaya, B. U. Töreyin, "Focus-and-Detect: A small object detection framework for aerial images," Signal Process. Image Commun., vol. 104, p. 116675, 2022. doi: 10.1016/j.image.2022.116675.
E. T. Lee, Z. Fan, B. Sencer, "A new approach to detect surface defects from 3D point cloud data with surface normal Gabor filter (SNGF)," J. Manuf. Process., vol. 92, pp. 196-205, 2023. doi: 10.1016/j.jmapro.2023.02.047.
J. M. Fortuna-Cervantes, M. T. Ramírez-Torres, J. Martínez-Carranza, J. S. Murguía-Ibarra, M. Mejía-Carlos, "Object detection in aerial navigation using wavelet transform and convolutional neural networks: A first approach," Program. Comput. Softw., vol. 46, pp. 536-547, 2020. doi: 10.1134/S0361768820080113.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.


