American Sign Language Recognition Based on Transfer Learning Algorithms

Authors

  • Sanaa Mohsin Business Informatics Technology Dept., College of business Informatics, University Information Technology and Communication, Baghdad, Iraq
  • Baraa Wasfi Salim ITM Dept., Technical College of Administration, Duhok Polytechnic University, Duhok, Iraq
  • Ashty Kamal Mohamedsaeed Lalav High School, Erbil Education Directorate, Ministry of Education, Erbil, Iraq
  • Banar Fareed Ibrahim IT Dept., Lebanese French University, Erbil, Iraq
  • Subhi R. M. Zeebaree Energy Eng. Dept., Technical College of Engineering, Duhok Polytechnic University, Duhok, Iraq

Keywords:

Gesture Recognition, American Sign Language (ASL), Deep Learning, Transfer Learning

Abstract

This research focuses on recognizing American Sign Language (ASL) letters and numbers, addressing the evolving technology landscape and the growing demand for improved user experiences among those primarily using sign language for communication. Leveraging deep learning, particularly through transfer learning, this study aims to enhance ASL recognition technology. Various deep learning models, including VGG16, ResNet50, MobileNetV2, InceptionV3, and CNN, are evaluated using an ASL dataset sourced from the Modified National Institute of Standards and Technology (MNIST) database, featuring ASL alphabetic letters represented through hand gestures. InceptionV3 emerges as the top-performing model, achieving an accuracy of 0.96. Transfer learning, which fine-tunes pre-trained models with ASL data, significantly improves recognition accuracy, making it especially valuable when labeled ASL data is limited. While InceptionV3 stands out, other models like VGG16, MobileNetV2, and ResNet50 demonstrate acceptable performance, offering flexibility for model selection based on specific application needs and computational resources. These findings underscore the effectiveness of deep learning and transfer learning techniques, providing a foundation for intuitive sign language recognition systems and contributing to breaking down communication barriers for the deaf and mute community.

Downloads

Download data is not yet available.

References

Pleva, Matus & Liao, Yuan-fu & Bours, Patrick. (2022). Human–Computer Interaction for Intelligent Systems. Electronics. 12. 161. 10.3390/electronics12010161.

Lv, Zhihan & Poiesi, Fabio & Dong, Qi & Lloret, Jaime & Song, Houbing. (2022). Deep Learning for Intelligent Human–Computer Interaction. Applied Sciences. 12. 11457. 10.3390/app122211457.

Hela, Daassi-Gnaba & Krahe, Jaime. (2023). Combination of speech recognition, emotion recognition and talking head for deaf and hard of hearing people.

"International Day of Sign Languages," United Nations. [Online]. Available: https://www.un.org/en/observances/sign-languages-day. [Accessed: 24-Jan-2023].

I.A. Adeyanju, O.O. Bello, M.A. Adegboye. Machine learning methods for sign language recognition: a critical review and analysis. Intell. Syst. Appl., 12 (2021), Article 200056, 10.1016/j.iswa.2021.200056

Hekmat, Atyaf & Abbas, Hawraa & Shahadi, Ismael. (2022). Sign Language Recognition and Hand Gestures Review, Kerbala Journal for Engineering Science, 2(4), 192-316.

Manoharan, Madhiarasan & Roy, Prof. (2022). A Comprehensive Review of Sign Language Recognition: Different Types, Modalities, and Datasets, arXiv preprint arXiv:2204.03328.

CABRERA, MARIA & BOGADO, JUAN & Fermín, Leonardo & Acuña, Raul & RALEV, DIMITAR. (2012). GLOVE-BASED GESTURE RECOGNITION SYSTEM. 10.1142/9789814415958_0095.

Lv, Zhihan & Poiesi, Fabio & Dong, Qi & Lloret, Jaime & Song, Houbing. (2022). Deep Learning for Intelligent Human–Computer Interaction. Applied Sciences. 12. 11457. 10.3390/app122211457.

Xu, Yong-Jun & Wang, Qi & An, Zhulin & Wang, Fei & Zhang, Libo & Wu, Yanjun & Dong, Fengliang & Qiu, Cheng-Wei & Liu, Xin & Qiu, Junjun & Hua, Keqin & Su, Wentao & Xu, Huiyu & Han, Yong & Cao, Xin & Liu, Enke & Fu, Chenguang & Yin, Zhigang & Liu, Miao & Zhang, Jiabao. (2021). Artificial Intelligence: A Powerful Paradigm for Scientific Research. The Innovation. 2. 100179. 10.1016/j.xinn.2021.100179.

Wang, Ya-Hong, and Wen-Hao Su. 2022. "Convolutional Neural Networks in Computer Vision for Grain Crop Phenotyping: A Review" Agronomy 12, no. 11: 2659. https://doi.org/10.3390/agronomy12112659.

Pu, Junfu & Zhou, Wengang & Li, Houqiang. (2018). Dilated Convolutional Network with Iterative Optimization for Continuous Sign Language Recognition. 885-891. 10.24963/ijcai.2018/123.

Aly, Walaa & Aly, Saleh & Almotairi, Sultan. (2019). User-Independent American Sign Language Alphabet Recognition Based on Depth Image and PCANet Features. IEEE Access. PP. 1-1. 10.1109/ACCESS.2019.2938829.

Oyedotun,Oyebade & Khashman, Adnan. (2017). Deep learning in vision-based static hand gesture recognition. Neural Computing and Applications. 28. 10.1007/s00521-016-2294-8.

Tolentino, Lean Karlo & Serfa Juan, Ronnie & Thio-ac, August & Pamahoy, Maria & Forteza, Joni & Garcia, Xavier. (2019). Static Sign Language Recognition Using Deep Learning. International Journal of Machine Learning and Computing. 9. 821-827. 10.18178/ijmlc.2019.9.6.879. International Journal of Intelligent Systems and Applications in EngineeringIJISAE, 2023, 11(6s), 232–245|243

Bendarkar, D., Somase, P., Rebari, P., Paturkar, R. & Khan, A. (2021).Web-Based Recognition and Translation of American Sign Language with CNN and RNN. International Association of Online Engineering.Retrieved November 9, 2022, fromhttps://www.learntechlib.org/p/218958/

Amrutha D , Bhumika M , Shivani Hosangadi , Shravya, Manoj H M, (2022), Real Time Static and Dynamic Hand Gesture Recognition using CNN, INTERNATIONAL JOURNAL OF ENGINEERING RESEARCH & TECHNOLOGY (IJERT) Volume 11, Issue 05 (May 2022),

Liu, Lei & Wan, Shaohua & Hui, Xiaozhe & Pei, Qingqi. (2022). Data Dissemination for Industry 4.0 Applications in Internet of Vehicles Based on Short-term Traffic Prediction. ACM Transactions on Internet Technology. 22. 1-18. 10.1145/3430505.

Mehreen Hurroo , Mohammad Elham, 2020, Sign Language Recognition System using Convolutional Neural Network and Computer Vision, INTERNATIONAL JOURNAL OF ENGINEERING RESEARCH & TECHNOLOGY (IJERT) Volume 09, Issue 12 (December 2020)

Taye, Mohammad. (2023). Understanding of Machine Learning with Deep Learning: Architectures, Workflow, Applications and Future Directions. Computers. 12. 91. 10.3390/computers12050091.

Sarker, Iqbal. (2021). Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions. SN Computer Science. 2. 10.1007/s42979-021-00815-1.

Iman, Mohammadreza & Arabnia, Hamid & Rasheed, Khaled. (2023). A Review of Deep Transfer Learning and Recent Advancements. Technologies. 11. 40. 10.3390/technologies11020040.

Zhu, Zhuangdi & Lin, Kaixiang & Jain, Anil & Zhou, Jiayu. (2023). Transfer Learning in Deep Reinforcement Learning: A Survey. IEEE transactions on pattern analysis and machine intelligence. PP. 10.1109/TPAMI.2023.3292075.

Taye, Mohammad. (2023). Theoretical Understanding of Convolutional Neural Network: Concepts, Architectures, Applications, Future Directions. Computation. 11. 52. 10.3390/computation11030052.

Alzubaidi, Laith & Zhang, Jinglan & Humaidi, Amjad & Al-Dujaili, Ayad & Duan, Ye & Al-Shamma, Omran & Santamaría, J. & Fadhel, Mohammed & Al-Amidie, Muthana & Farhan, Laith. (2021). Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data. 8. 10.1186/s40537-021-00444-8.

Tammina, Srikanth. (2019). Transfer learning using VGG-16 with Deep Convolutional Neural Network for Classifying Images. International Journal of Scientific and Research Publications (IJSRP). 9. p9420. 10.29322/IJSRP.9.10.2019.p9420.

Shafiq, Muhammad & Gu, Zhaoquan. (2022). Deep Residual Learning for Image Recognition: A Survey. Applied Sciences. 10.3390/app12188972.

Hong, Kiseong & Kim, Gyeong-hyeon & Kim, Eunwoo. (2023). GhostNeXt: Rethinking Module Configurations for Efficient Model Design. Applied Sciences. 13. 3301. 10.3390/app13053301.

Sandler, Mark & Howard, Andrew & Zhu, Menglong & Zhmoginov, Andrey & Chen, Liang-Chieh. (2018). MobileNetV2: Inverted Residuals and Linear Bottlenecks. 4510-4520. 10.1109/CVPR.2018.00474.

Shaheed, Kashif & Abbas, Qaisar & Hussain, Ayyaz & Qureshi, Imran. (2023). Optimized Xception Learning Model and XgBoost Classifier for Detection of Multiclass Chest Disease from X-ray Images. Diagnostics. 13. 2583. 10.3390/diagnostics13152583.

Barcic, Ena & Grd, Petra & Tomicic, Igor. (2023). Convolutional Neural Networks for Face Recognition: A Systematic Literature Review. 10.21203/rs.3.rs-3145839/v1.

Bragg, Danielle & Caselli, Naomi & Hochgesang, Julie & Huenerfauth, Matt & Katz-Hernandez, Leah & Koller, Oscar & Kushalnagar, Raja & Vogler, Christian & Ladner, R.E.. (2021). The FATE Landscape of Sign Language AI Datasets: An Interdisciplinary Perspective. ACM Transactions on Accessible Computing. 14. 1-45. 10.1145/3436996.

Baraa Wasfi Salim , Subhi R. M. Zeebaree(2023). Kurdish Sign Language Recognition Based on Transfer Learning. International Journal of Intelligent Systems and Applications in Engineering, international Journal of Intelligent Systems and Applications in Engineering, 11(6s), 232–245. Retrieved from

Subhi R. M. Zeebaree, & Hanan M. Shukur, & Lailan M. Haji, & Rizgar R. Zebari,& Karwan Jacksi, & Shakir M. Abas (2020), Characteristics and Analysis of Hadoop Distributed Systems, Technology Reports of Kansai University Journal, ISSN: 04532198 Volume 62, Issue 04, April, 2020.

Rajiv, A., Saxena, A.K., Singh, D., Awasthi, A., Dhabliya, D., Yadav, R.K., Gupta, A. IoT and machine learning on smart home-based data and a perspective on fog computing implementation (2023) Handbook of Research on Machine Learning-Enabled IoT for Smart Applications Across Industries, pp. 336-349.

Anand, R., Ahamad, S., Veeraiah, V., Janardan, S.K., Dhabliya, D., Sindhwani, N., Gupta, A. Optimizing 6G wireless network security for effective communication (2023) Innovative Smart Materials Used in Wireless Communication Technology, pp. 1-20.

Downloads

Published

24.11.2023

How to Cite

Mohsin, S. ., Salim, B. W. ., Mohamedsaeed, A. K. ., Ibrahim, B. F. ., & Zeebaree, S. R. M. . (2023). American Sign Language Recognition Based on Transfer Learning Algorithms. International Journal of Intelligent Systems and Applications in Engineering, 12(5s), 390–399. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3920

Issue

Section

Research Article

Most read articles by the same author(s)

Similar Articles

You may also start an advanced similarity search for this article.