Evaluates the Prognosis Of AI-Powered Voice-Guided System Aided for Individuals with Partial Disabilities Using Machine Learning.

Authors

  • Anil Kumar, Ravi Arora, Gaurav Mitra, Apurva Jain, Ruchi Sharma, Vijay Kumar

Keywords:

CNN, RNN, IVR, TTS, voice control, home automation, self-assistance, and disabled individuals

Abstract

Design of a new low-cost self-assistance system that facilitates the operation of household appliances and wheelchairs by using sophisticated voice commands from individuals with disabilities. With the modern advancements in technologies, each individual nowadays is moving over to an easier and more effective lifestyle. People are moving on with technology and finding solutions to problems faced in everyday life. Normal people are getting privileges of technology but sometimes the benefits could not reach the partially disabled ones. Partially disabled face a number of problems in day-to-day lives from navigation to communicating with others. The partially sighted people and hearing-impaired ones try to cope up with the normal ones but they do not get many opportunities. The study focuses on partially disabled people to provide them with some of the features to overcome few of the problems faced in the real world. This project demonstrates the aid for the partially visual and hearing impaired through communication via voice for the visually impaired and communication via text for the hearing impaired. The project is divided into two parts, initially consisting of text-to-speech and voice-to-speech capabilities, and object recognition for people with disabilities. This project includes a brief analysis of various models and algorithms such as interactive speech response (IVR), convolutional neural network (CNN), recurrent neural network (RNN), and Text-to-speech (TTS). Another part is the integration with Android applications. Here, the trained deep learning model serves as the source for the backend in object detection. Models are imported to predict outcomes, and text-to-speech helps people with disabilities access a variety of features such as voice-based email, object recognition, and virtual navigation.

Downloads

Download data is not yet available.

References

Yoon, S. W., Khatiwada, S., Shrestha, U., Ra, S., & Kwon, K. (2019). The impact of technology on skills and its rise. 17(sup1), 26–40, International Journal of Training Research.

Ahmetovic, D.; Kitani, K.; Asakawa, C.; Ahmetovic, D.; Sato, D.; Oh, U.; Ishihara, T. Recog: Assisting individuals who are blind in identifying personal items. In CHI Conference on Human Factors in Computing Systems 2020 Proceedings (pp. 1–12).

Anas Majid, P., K. M. Smrithi, V. Nikhitha, & V. P. Basheer (2020). A Mobility Aid for Blind People Based on Machine Learning. Machine Learning, 7 (1).

Ohn-Bar, E., Kitani, K. M., Asakawa, C., Sato, D., Ahmetovic, D., & Guerrero, J. (2020). Blind individuals can use virtual navigation to transfer their route expertise to the real world. 135, 102369, International Journal of Human-Computer Studies.

Tiwari, P. A., Wanjari, P. G., Rotke, T., Nimkar, H. P., Zodawan, P., & Samarth, U. (2020, February). A Review on Voice based E-Mail System for Blind. Inventive Computation Technologies International Conference (ICICT), 2020 (pp. 435-438). IEEE.

Hiralkar, M. K. R., Jawandhiya, P. M., Talole, M. M. S., Wagh, M. R. B., & Dandage, S. M. Voice-Activated Email for the Blind.

Patil, M., Tejankar, N., Alhat, A., and Kulkarni, O. A Voice Based Email System for the Blind. In 2019, Nikose, A. A., Sheikh, A., Kumari, P., Masurkar, B., and Mohadikar, P. Mail System for the Blind Using Voice. Ijics.net.

Narawade, P. S., Ghyar, C. J., Wankhede, J. S., and Dudhbale, P. (2018). Blind individuals can use voice-based systems on desktop and mobile devices. Journal of Scientific Research in Science and Technology: International Journal, 4, 188-193.

Yang, D. L., Wu, G. W., Li, J. X., Chen, W. J., Ying, J. C., & Li, C. Y. (2018, June). a sensory navigation device with deep learning for blind steering. 2018's IEEE 16th International Conference on Smart Cities, IEEE 20th International Conference on High Performance Computing and Communications, and IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS) (pp. 1195–1200). IEEE.

Nizar Banu, P. K., and Ummuhanysifa, U. (2013). voice-activated web page reader and search engine. Journal of International Computational Engineering Research (IJCER), 1–5.

Drake Jr., F. L., and Van Rossum, G. (1995). Tutorial on Python (Vol. 620). The Center for Mathematics and Informatics, Amsterdam.

Hinton, G., LeCun, Y., and Bengio, Y. (2015). Nature, 521(7553), 436-444, deep learning.

Shin, H. C., Summers, R. M. (2016); Roth, H. R., Gao, M.; Lu, L.; Xu, Z.; Nogues, I. Deep convolutional neural networks: CNN designs, dataset properties, and transfer learning for computer-aided detection. 1285–1298 in IEEE Transactions on Medical Imaging, 35(5).

Van Merriënboer, B., Gulcehre, C., Bougares, F., Schwenk, H., Bengio, Y., and Cho, K. (2014). Statistical machine translation employs RNN encoder-decoder to learn phrase representations. arXiv preprint

Drake Jr., F. L., and Van Rossum, G. (1995). Tutorial on Python (Vol. 620). The Center for Mathematics and Informatics, Amsterdam.

Hinton, G., LeCun, Y., and Bengio, Y. (2015). Nature, 521(7553), 436-444, deep learning.

Shin, H. C., Summers, R. M. (2016); Roth, H. R., Gao, M.; Lu, L.; Xu, Z.; Nogues, I. Deep convolutional neural networks: CNN designs, dataset properties, and transfer learning for computer-aided detection. 1285–1298 in IEEE Transactions on Medical Imaging, 35(5).

Bengio, Y., Schwenk, H., Bougares, F., Gulcehre, C., Bahdanau, D., Van Merriënboer, B., & Cho, K. (2014). Statistical machine translation employs RNN encoder-decoder to learn phrase representations. The preprint arXiv is arXiv:1406.1078.

Developments, A. (2011) [15]. What is an Android device? Download the file here: http://www. academia. edu/download/30551848/andoid--tech.pdf.

H. R. Esmaeel (2015). Install the SDK (Android Studio) tools. Computer Science and Software Engineering International Journal of Advanced Research, 5 (5).

S. Brahler (2010). An examination of the architecture of Android. Institute for Technology Karlsruhe, 7 (8).

Peters, J., & Klakow, D. (2002). examining the relationship between word mistake rate and confusion. 38(1-2) Speech Communication, 19–28.

Downloads

Published

30.06.2024

How to Cite

Anil Kumar. (2024). Evaluates the Prognosis Of AI-Powered Voice-Guided System Aided for Individuals with Partial Disabilities Using Machine Learning. International Journal of Intelligent Systems and Applications in Engineering, 12(22s), 33–41. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6385

Issue

Section

Research Article