Improving Human Character Recognition Performance Based on Facial Images with the Addition of Channel Attention Module in ResNet50 Model

Authors

  • Nurul Khairina, Muhathir, Fadhillah Azmi, Ferawaty, Wenripin Chandra, Mega Puspita Sari, Tika Ermita Wulandari

Keywords:

CNN, CBAM, Human Characters, ResNet50

Abstract

This study investigates the performance of several convolutional neural network (CNN) architectures in classifying human characters based on facial images, considering the addition of the Channel Attention Module (CBAM) to the ResNet50 model. This research aims to evaluate and compare the capabilities of the ResNet50 model with and without the addition of CBAM, and to compare it with the EfficientNetB3 and GoogleNet architectures in recognizing human characters based on facial images. This research uses an experimental approach by utilizing a tagged facial image dataset. Accuracy, precision, recall, and F1-score metrics are used to quantitatively evaluate the performance of the models. The addition of CBAM to the ResNet50 model successfully improves its performance in classifying human characters, especially in identifying the Savory and Unsavory classes. ResNet50 with CBAM demonstrates higher accuracy compared to ResNet50 without CBAM, and outperforms EfficientNetB3 and GoogleNet. This research indicates that the addition of CBAM to the ResNet50 model can enhance accuracy in recognizing human characters based on facial images. These results provide valuable insights into the importance of integrating enrichment techniques into CNN architectures. However, this research has limitations in dataset variation and further research is needed with more varied datasets and additional experiments to understand the factors that affect model performance more deeply.

Downloads

Download data is not yet available.

References

H. Vasudevan, A. Michalas, N. Shekokar, and M. Narvekar, Eds., Advanced Computing Technologies and Applications: Proceedings of 2nd International Conference on Advanced Computing Technologies and Applications—ICACTA 2020. in Algorithms for Intelligent Systems. Singapore: Springer Singapore, 2020. doi: 10.1007/978-981-15-3242-9.

R. R. Koli and T. I. Bagban, “Human Action Recognition Using Deep Neural Networks,” in 2020 Fourth World Conference on Smart Trends in Systems, Security and Sustainability (WorldS4), London, United Kingdom: IEEE, Jul. 2020, pp. 376–380. doi: 10.1109/WorldS450073.2020.9210345.

B. R. Ilyas, B. Mohammed, M. Khaled, and K. Miloud, “Enhanced Face Recognition System Based on Deep CNN,” in 2019 6th International Conference on Image and Signal Processing and their Applications (ISPA), Mostaganem, Algeria: IEEE, Nov. 2019, pp. 1–6. doi: 10.1109/ISPA48434.2019.8966797.

V. R. R. Chirra, S. R. Uyyala, and V. K. K. Kolli, “Virtual facial expression recognition using deep CNN with ensemble learning,” J Ambient Intell Human Comput, vol. 12, no. 12, pp. 10581–10599, Dec. 2021, doi: 10.1007/s12652-020-02866-3.

T. J. Iyer, R. K., R. Nersisson, Z. Zhuang, A. N. Joseph Raj, and I. Refayee, “Machine Learning-Based Facial Beauty Prediction and Analysis of Frontal Facial Images Using Facial Landmarks and Traditional Image Descriptors,” Computational Intelligence and Neuroscience, vol. 2021, pp. 1–14, Aug. 2021, doi: 10.1155/2021/4423407.

V. D. A. Kumar, V. D. A. Kumar, G. K. Rajeswari, and M. Anitha, “Human Character Identification Based On a New Biometric Pattern – A Contemporary Approach,” Procedia Computer Science, vol. 133, pp. 99–107, 2018, doi: 10.1016/j.procs.2018.07.013.

B. Li and D. Lima, “Facial expression recognition via ResNet-50,” International Journal of Cognitive Computing in Engineering, vol. 2, pp. 57–64, Jun. 2021, doi: 10.1016/j.ijcce.2021.02.002.

D. Setyadi, T. Harsono, and S. Wasista, “Human character recognition application based on facial feature using face detection,” in 2015 International Electronics Symposium (IES), Surabaya, Indonesia: IEEE, Sep. 2015, pp. 263–267. doi: 10.1109/ELECSYM.2015.7380852.

E. A. Moh. Iqbal, R. Kusumawati, and I. B. Santoso, “Hybrid Model Transfer Learning ResNet50 and Support Vector Machine for Face Mask Detection,” International Journal of Advances in Data and Information Systems, vol. 4, no. 2, pp. 125–134, Sep. 2023, doi: 10.25008/ijadis.v4i2.1297.

J. Y. I. Alzamily, S. B. Ariffin, and S. S. A. Naser, “Classification of Encrypted Images using Deep Learning – ResNet50,” . Vol., no. 21, 2022.

K. Shivam, J.-C. Tzou, and S.-C. Wu, “Multi-Step Short-Term Wind Speed Prediction Using a Residual Dilated Causal Convolutional Network with Nonlinear Attention,” Energies, vol. 13, no. 7, p. 1772, Apr. 2020, doi: 10.3390/en13071772.

P. Chen, S. Liu, and S. Kolmanič, “Research on Vehicle Re-Identification Algorithm Based on Fusion Attention Method,” Applied Sciences, vol. 13, no. 7, p. 4107, Mar. 2023, doi: 10.3390/app13074107.

Z. Zhang, H. Mamat, X. Xu, A. Aysa, and K. Ubul, “FAS-Res2net: An Improved Res2net-Based Script Identification Method for Natural Scenes,” Applied Sciences, vol. 13, no. 7, p. 4434, Mar. 2023, doi: 10.3390/app13074434.

G. Wang, Y. Yuan, X. Chen, J. Li, and X. Zhou, “Learning Discriminative Features with Multiple Granularities for Person Re-Identification,” in Proceedings of the 26th ACM international conference on Multimedia, Seoul Republic of Korea: ACM, Oct. 2018, pp. 274–282. doi: 10.1145/3240508.3240552.

W. Islam, M. Jones, R. Faiz, N. Sadeghipour, Y. Qiu, and B. Zheng, “Improving Performance of Breast Lesion Classification Using a ResNet50 Model Optimized with a Novel Attention Mechanism,” Tomography, vol. 8, no. 5, pp. 2411–2425, Sep. 2022, doi: 10.3390/tomography8050200.

L. Yang, C. Wang, J. Yu, N. Xu, and D. Wang, “Method of Peanut Pod Quality Detection Based on Improved ResNet,” Agriculture, vol. 13, no. 7, p. 1352, Jul. 2023, doi: 10.3390/agriculture13071352.

J. Zhang, Y. Xie, Y. Xia, and C. Shen, “Attention Residual Learning for Skin Lesion Classification,” IEEE Trans. Med. Imaging, vol. 38, no. 9, pp. 2092–2103, Sep. 2019, doi: 10.1109/TMI.2019.2893944.

Rujito, Muhathir, N. Khairina, V. Ilhadi, M. Ula, and I. Sahputra, “Enhancing Larval Classification Accuracy Through Hyperparameter Optimization in ResNet50 with Three Different Optimizers,” in 2023 International Conference on Modeling & E-Information Research, Artificial Learning and Digital Applications (ICMERALDA), Karawang, Indonesia: IEEE, Nov. 2023, pp. 56–61. doi: 10.1109/ICMERALDA60125.2023.10458194.

J. Hemalatha, S. Roseline, S. Geetha, S. Kadry, and R. Damaševičius, “An Efficient DenseNet-Based Deep Learning Model for Malware Detection,” Entropy, vol. 23, no. 3, p. 344, Mar. 2021, doi: 10.3390/e23030344.

S. Tammina, “Transfer learning using VGG-16 with Deep Convolutional Neural Network for Classifying Images,” IJSRP, vol. 9, no. 10, p. p9420, Oct. 2019, doi: 10.29322/IJSRP.9.10.2019.p9420.

A. Michele, V. Colin, and D. D. Santika, “MobileNet Convolutional Neural Networks and Support Vector Machines for Palmprint Recognition,” Procedia Computer Science, vol. 157, pp. 110–117, 2019, doi: 10.1016/j.procs.2019.08.147.

R. Syuhada, Muhathir, N. Khairina, R. Muliono, Susilawati, and Z. Sembiring, “Analyzing the Effectiveness of VGG Deep Learning Architecture for Mushroom Type Classification,” in 2023 International Conference of Computer Science and Information Technology (ICOSNIKOM), 2023, pp. 1–6. doi: 10.1109/ICoSNIKOM60230.2023.10364551.

M. Muhathir, N. Khairina, R. K. I. Barus, M. Ula, and I. Sahputra, “Preserving Cultural Heritage Through AI: Developing LeNet Architecture for Wayang Image Classification,” IJACSA, vol. 14, no. 9, 2023, doi: 10.14569/IJACSA.2023.0140919.

A. Indrawati, A. Rahman, E. Pane, and Muhathir, “Classification of Diseases in Oil Palm Leaves Using the GoogLeNet Model,” Baghdad Sci.J, vol. 20, no. 6(Suppl.), p. 2508, Dec. 2023, doi: 10.21123/bsj.2023.8547.

K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA: IEEE, Jun. 2016, pp. 770–778. doi: 10.1109/CVPR.2016.90.

M. Azwan, Muhathir, D. Noviandri, A. Indrawati, and R. Aziz, “Robust Classification of Red Chili Plant Leaves Using Smartphone Camera Data and ResNet Model in Noisy Environments,” in 2023 International Conference on Modeling & E-Information Research, Artificial Learning and Digital Applications (ICMERALDA), 2023, pp. 191–196. doi: 10.1109/ICMERALDA60125.2023.10458175.

M. Govindan, V. K. Dhakshnamurthy, K. Sreerangan, M. D. Nagarajan, and S. K. Rajamanickam, “A Framework for Early Detection of Glaucoma in Retinal Fundus Images Using Deep Learning,” in CC 2023, MDPI, Feb. 2024, p. 3. doi: 10.3390/engproc2024062003.

M. Muhathir, M. F. D. Ryandra, R. B. Y. Syah, N. Khairina, and R. Muliono, “Convolutional Neural Network (CNN) of Resnet-50 with Inceptionv3 Architecture in Classification on X-Ray Image,” in Artificial Intelligence Application in Networks and Systems, P. Silhavy Radek and Silhavy, Ed., Cham: Springer International Publishing, 2023, pp. 208–221.

S. Woo, J. Park, J.-Y. Lee, and I. S. Kweon, “CBAM: Convolutional Block Attention Module,” in Computer Vision – ECCV 2018, vol. 11211, V. Ferrari, M. Hebert, C. Sminchisescu, and Y. Weiss, Eds., in Lecture Notes in Computer Science, vol. 11211. , Cham: Springer International Publishing, 2018, pp. 3–19. doi: 10.1007/978-3-030-01234-2_1.

S. Dou, L. Wang, D. Fan, L. Miao, J. Yan, and H. He, “Classification of Citrus Huanglongbing Degree Based on CBAM-MobileNetV2 and Transfer Learning,” Sensors, vol. 23, no. 12, p. 5587, Jun. 2023, doi: 10.3390/s23125587.

Z. Bai, R. Zhu, D. He, S. Wang, and Z. Huang, “Adulteration Detection of Pork in Mutton Using Smart Phone with the CBAM-Invert-ResNet and Multiple Parts Feature Fusion,” Foods, vol. 12, no. 19, p. 3594, Sep. 2023, doi: 10.3390/foods12193594.

W. Sheng, X. Yu, J. Lin, and X. Chen, “Faster RCNN Target Detection Algorithm Integrating CBAM and FPN,” Applied Sciences, vol. 13, no. 12, p. 6913, Jun. 2023, doi: 10.3390/app13126913.

Y. Muhtar, M. Muhammat, N. Yadikar, A. Aysa, and K. Ubul, “FC-ResNet: A Multilingual Handwritten Signature Verification Model Using an Improved ResNet with CBAM,” Applied Sciences, vol. 13, no. 14, p. 8022, Jul. 2023, doi: 10.3390/app13148022.

S. Agac and O. Durmaz Incel, “On the Use of a Convolutional Block Attention Module in Deep Learning-Based Human Activity Recognition with Motion Sensors,” Diagnostics, vol. 13, no. 11, p. 1861, May 2023, doi: 10.3390/diagnostics13111861.

E. Haque and R. Ahmed, “Classification of Human Monkeypox Disease Using Deep Learning Models and Attention Mechanisms”.

Areesha Ijaz et al., “Modality Specific CBAM-VGGNet Model for the Classification of Breast Histopathology Images via Transfer Learning,” IEEE, pp. 15750–15762, 2023, doi: https://doi.org/10.1109/ACCESS.2023.3245023.

Sheng Yu, Shangzhu Jin, Jun Peng, Haiyang Liu, and Yuanyuan He, “Application of a new deep learning method with CBAM in clothing image classification,” presented at the 2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT), Chongqing, China: IEEE, 2021. doi: https://doi.org/10.1109/ICESIT53460.2021.9696783.

Downloads

Published

24.03.2024

How to Cite

Nurul Khairina. (2024). Improving Human Character Recognition Performance Based on Facial Images with the Addition of Channel Attention Module in ResNet50 Model. International Journal of Intelligent Systems and Applications in Engineering, 12(3), 3544–3554. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/5990

Issue

Section

Research Article