Enhancement in Real Time Deep Learning Object Detection and Direction Prediction for Visually Impaired using YOLO and OpenCV
Keywords:
Object Detection, Deep Learning, Visually Impaired, YOLO, OpenCV, Image ProcessingAbstract
Millions of individuals around the globe have permanent visual impairment, underscoring the importance of facilitating their understanding of people and the identification of essential daily-use products. To address this need, we propose the system to recognize such items within their daily routines. Numerous initiatives are underway in this field to aid the visually impaired without end to end deployment. The objective is to identify objects and translate them into auditory cues to inform individuals with visual impairment about these items with the system comprises a camera, a speaker, and an image processing system. The primary focus of this study is the amalgamation of real-time object detection and recognition using advanced deep learning techniques. The aim is to detect and label the position and names of multiple objects captured by the camera through an object detection algorithm.
Downloads
References
https://www.who.int/news-room/fact-sheets/detail/ blindness-and-visual-impairment
M. Mahendru and S. K. Dubey, "Real Time Object Detection with Audio Feedback using Yolo vs. Yolo_v3," 2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India, 2021, pp. 734-740, doi: 10.1109/Confluence51648.2021.9377064.
Vaidya, Sunit, et al. "Real-time object detection for visually challenged people." 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS). IEEE, 2020.
M. I. Thariq Hussan, D. Saidulu, P. T. Anitha, A. Manikandan and P. Naresh (2022), Object Detection and Recognition in Real Time Using Deep Learning for Visually Impaired People. IJEER 10(2), 80-86. DOI: 10.37391/IJEER.100205.
Therese Yamuna Mahesh et al 2021 IOP Conf. Ser.: Mater. Sci. Eng. 1085 012006.
Ferdousi Rahman, Israt Jahan Ritun, Nafisa Farhin, and Jia Uddin. 2019. An assistive model for visually impaired people using YOLO and MTCNN. In Proceedings of the 3rd International Conference on Cryptography, Security and Privacy (ICCSP '19). Association for Computing Machinery, New York, NY, USA, 225–230. https://doi.org/10.1145/3309074.3309114
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.