SceneGuide: An Indoor and Outdoor Scene Recognition Wearable Aid for Visually Impaired People

Authors

  • Jyoti Madake Vishwakarma Institute of Technology, Pune, Maharashtra – 411037, India
  • Shripad Bhatlawande Vishwakarma Institute of Technology, Pune, Maharashtra – 411037, India
  • Anjali Solanke Marathwada Mitra Mandal’s College of Engineering, Pune, 411052, India

Keywords:

Computer vision, machine learning, scene recognition, visually impaired, wearable aid

Abstract

This paper proposes a new SceneGuide wearable aid for providing information about the surrounding scene to the visually impaired people. Its main feature is its ability to understand the scene and offer simplified information in an intuitive way. SceneGuide aid is designed as a wearable jacket with low-power embedded processing unit, monocular camera, and Bluetooth headphones.  It is a lightweight, low-cost, battery-operated blind assistive aid. The aid employs a novel, computationally efficient model, using multi-feature fusion and multi-level optimum feature selection approach. SceneGuide serves as a complementary assistive aid to the conventional white cane and helps reduce the cognitive information load and anxiety experienced by visually impaired people. The functional evaluation of the aid presented scene recognition accuracy of 95.25% on a custom dataset and 85.82% on the 15 Scene Standard Dataset. This aid was evaluated with 10 blindfolded volunteers. The volunteers expressed 77% acceptance towards usability to identify the scene with lower levels of confusion and anxiety. This highlights that the SceneGuide aid can enhance the understanding of visually impaired people about their surroundings.

Downloads

Download data is not yet available.

References

World Report on Vision. 2022. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/blindness-and-visual-impairment

Seybold, Diana. "Investigating stress associated with mobility training through consumer discussion groups." Journal of Visual Impairment & Blindness 87, no. 4 (1993): 111-112.

S. Bhatlawande, M. Mahadevappa, J. Mukherjee, M. Biswas, D. Das and S. Gupta, "Design, Development, and Clinical Evaluation of the Electronic Mobility Cane for Vision Rehabilitation," in IEEE Transactions on Neural Systems and Rehabilitation Engineering, vol. 22, no. 6, pp. 1148-1159, Nov. 2014..

Lopes, Sérgio I., José MN Vieira, Óscar FF Lopes, Pedro RM Rosa, and Nuno AS Dias. "MobiFree: a set of electronic mobility aids for the blind." Procedia Computer Science 14 (2012): 10-19.

Balakrishnan, Meenakshi Rao, Parigi Vedanti Madhusudhan Valiyaveetil, Sashi Kumar Paul, Rohan Venkatesan, Arun Kumar Harikesavan, Karthikeyan Kolappan, Bhagavatheesh Chanana, Piyush Mehra, Dheeraj, "A Split Grip Cane Handle Unit With Tactile Feedback for Directed Ranging", Patent, WO/2015/121872, issued 20/08/2015

Dernayka, Aya, Michel-Ange Amorim, Roger Leroux, Lucas Bogaert, and René Farcy. "Tom Pouce III, an electronic white cane for blind people: Ability to detect obstacles and mobility performances." Sensors 21, no. 20 (2021): 6854.

S. S. Bhatlawande, J. Mukhopadhyay and M. Mahadevappa, "Ultrasonic spectacles and waist-belt for visually impaired and blind person," 2012 National Conference on Communications (NCC), Kharagpur, India, 2012, pp. 1-4.

Katzschmann, Robert K., Brandon Araki, and Daniela Rus. "Safe local navigation for visually impaired users with a time-of-flight and haptic feedback device." IEEE Transactions on Neural Systems and Rehabilitation Engineering 26, no. 3 (2018): 583-593.

Kilian, Jakob, Alexander Neugebauer, Lasse Scherffig, and Siegfried Wahl. "The unfolding space glove: A wearable spatio-visual to haptic sensory substitution device for blind people." Sensors 22, no. 5 (2022): 1859.

J. Bai, S. Lian, Z. Liu, K. Wang and D. Liu, "Virtual-Blind-Road Following-Based Wearable Navigation Device for Blind People," in IEEE Transactions on Consumer Electronics, vol. 64, no. 1, pp. 136-143, Feb. 2018.

Meshram, V. V., Patil, K., Meshram, V. A., & Shu, F. C. (2019). An astute assistive device for mobility and objectrecognition for visually impaired people. IEEE Transactions on Human-Machine Systems, 49(5), 449-460.

Dutta, Senjuti, Mridul S. Barik, Chandreyee Chowdhury, and Deep Gupta. "Divya-Dristi: A smartphone-based campus navigation system for the visually impaired." In 2018 Fifth International Conference on Emerging Applications of Information Technology (EAIT), pp. 1-3. IEEE, 2018.

Garcia-Macias, J. Antonio, Alberto G. Ramos, Rogelio Hasimoto-Beltran, and Saul E. Pomares Hernandez. "Uasisi: A modular and adaptable wearable system to assist the visually impaired." Procedia Computer Science 151 (2019): 425-430.

Tap Tap See App, 2012 [Mobile App]. Available: https://play.google.com/store/apps/details?id=com.msearcher.taptapsee.android&hl=en_US

Be My Eyes App, 2015 [Mobile App]. Available: https://play.google.com/store/apps/details?id=com.bemyeyes.bemyeyes&hl=en&gl=US

Szummer, Martin, and Rosalind W. Picard. "Indoor-outdoor image classification." In Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database, pp. 42-51. IEEE, 1998.

Oliva, Aude, and Antonio Torralba. "Modeling the shape of the scene: A holistic representation of the spatial envelope." International journal of computer vision 42 (2001): 145-175.

Han, Yina, and Guizhong Liu. "A hierarchical GIST model embedding multiple biological feasibilities for scene classification." In 2010 20th International Conference on Pattern Recognition, pp. 3109-3112. IEEE, 2010.

Lowe, David G. "Distinctive image features from scale-invariant keypoints." International journal of computer vision 60 (2004): 91-110.

Fei-Fei, Li, and Pietro Perona. "A bayesian hierarchical model for learning natural scene categories." In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), vol. 2, pp. 524-531. IEEE, 2005.

Lazebnik, Svetlana, Cordelia Schmid, and Jean Ponce. "Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories." In 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06), vol. 2, pp. 2169-2178. IEEE, 2006.

Yang, Jianchao, Kai Yu, Yihong Gong, and Thomas Huang. "Linear spatial pyramid matching using sparse coding for image classification." In 2009 IEEE Conference on computer vision and pattern recognition, pp. 1794-1801. IEEE, 2009.

Sánchez, Jorge, Florent Perronnin, Thomas Mensink, and Jakob Verbeek. "Image classification with the fisher vector: Theory and practice." International journal of computer vision 105 (2013): 222-245.

Ojala, Timo, Matti Pietikäinen, and David Harwood. "A comparative study of texture measures with classification based on featured distributions." Pattern recognition 29, no. 1 (1996): 51-59.

Xiao, Jianxiong, Krista A. Ehinger, James Hays, Antonio Torralba, and Aude Oliva. "Sun database: Exploring a large collection of scene categories." International Journal of Computer Vision 119 (2016): 3-22.

Ahonen, Timo, Jiří Matas, Chu He, and Matti Pietikäinen. "Rotation invariant image description with local binary pattern histogram fourier features." In Image Analysis: 16th Scandinavian Conference, SCIA 2009, Oslo, Norway, June 15-18, 2009. Proceedings 16, pp. 61-70. Springer Berlin Heidelberg, 2009.

Song, Kechen, and Yunhui Yan. "A noise robust method based on completed local binary patterns for hot-rolled steel strip surface defects." Applied Surface Science 285 (2013): 858-864.

Qian, Xueming, Xian-Sheng Hua, Ping Chen, and Liangjun Ke. "PLBP: An effective local binary patterns texture descriptor with pyramid representation." Pattern Recognition 44, no. 10-11 (2011): 2502-2515.

Wu, Jianxin, and Jim M. Rehg. "Centrist: A visual descriptor for scene categorization." IEEE transactions on pattern analysis and machine intelligence 33, no. 8 (2010): 1489-1501.

Dalal, Navneet, and Bill Triggs. "Histograms of oriented gradients for human detection." In 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05), vol. 1, pp. 886-893. Ieee, 2005.

Xiao, Jianxiong, Krista A. Ehinger, James Hays, Antonio Torralba, and Aude Oliva. "Sun database: Exploring a large collection of scene categories." International Journal of Computer Vision 119 (2016): 3-22.

Ke, Yan, and Rahul Sukthankar. "PCA-SIFT: A more distinctive representation for local image descriptors." In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., vol. 2, pp. II-II. IEEE, 2004.

Malhi, Arnaz, and Robert X. Gao. "PCA-based feature selection scheme for machine defect classification." IEEE transactions on instrumentation and measurement 53, no. 6 (2004): 1517-1525.

Ye, Jieping, Ravi Janardan, and Qi Li. "Two-dimensional linear discriminant analysis." Advances in neural information processing systems 17 (2004).

Prewitt, Judith MS. "Object enhancement and extraction." Picture processing and Psychopictorics 10, no. 1 (1970): 15-19.

LeCun, Yann, Léon Bottou, Yoshua Bengio, and Patrick Haffner. "Gradient-based learning applied to document recognition." Proceedings of the IEEE 86, no. 11 (1998): 2278-2324.

Gong, Yunchao, Liwei Wang, Ruiqi Guo, and Svetlana Lazebnik. "Multi-scale orderless pooling of deep convolutional activation features." In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part VII 13, pp. 392-407. Springer International Publishing, 2014.

Yang, Songfan, and Deva Ramanan. "Multi-scale recognition with DAG-CNNs." In Proceedings of the IEEE international conference on computer vision, pp. 1215-1223. 2015.

Chen, Liang-Chieh, George Papandreou, Florian Schroff, and Hartwig Adam. "Rethinking atrous convolution for semantic image segmentation." arXiv preprint arXiv:1706.05587 (2017).

He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. "Deep residual learning for image recognition." In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770-778. 2016.

Szegedy, Christian, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. "Going deeper with convolutions." In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1-9. 2015.

Huang, Gao, Zhuang Liu, Laurens Van Der Maaten, and Kilian Q. Weinberger. "Densely connected convolutional networks." In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700-4708. 2017.

Wang, Chen, Guohua Peng, and Bernard De Baets. "Deep feature fusion through adaptive discriminative metric learning for scene recognition." Information Fusion 63 (2020): 1-12.

Wang, Chen, Guohua Peng, and Bernard De Baets. "Embedding metric learning into an extreme learning machine for scene recognition." Expert Systems with Applications 203 (2022): 117505.

Parshapa, P. ., & Rani, P. I. . (2023). A Survey on an Effective Identification and Analysis for Brain Tumour Diagnosis using Machine Learning Technique. International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 68–78. https://doi.org/10.17762/ijritcc.v11i3.6203

Muñoz, S., Hernandez, M., González, M., Thomas, P., & Anderson, C. Enhancing Engineering Education with Intelligent Tutoring Systems using Machine Learning. Kuwait Journal of Machine Learning, 1(2). Retrieved from http://kuwaitjournals.com/index.php/kjml/article/view/116

Downloads

Published

21.09.2023

How to Cite

Madake, J. ., Bhatlawande, S. ., & Solanke, A. . (2023). SceneGuide: An Indoor and Outdoor Scene Recognition Wearable Aid for Visually Impaired People. International Journal of Intelligent Systems and Applications in Engineering, 11(4), 623–637. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3597

Issue

Section

Research Article