Enhancing Scene Identification Performance through Hierarchical Classification, Context-Aware Analysis, and Active Learning Operations

Authors

  • Meghana Deshmukh, Amit Gaikwad

Keywords:

Enhancing Scene Identification Performance through Hierarchical Classification, Context-Aware Analysis, and Active Learning Process

Abstract

The increasing demand for intelligent systems in various applications, such as smart homes, autonomous vehicles, and surveillance, has underscored the need for robust and efficient Scene Identification techniques. Accurate Scene Identification plays a critical role in understanding complex environments, enabling more effective decision-making and interaction with the environment. However, existing methods often suffer from high computational costs, low adaptability to new scenarios, and limited ability to capture context and object relationships. In this paper, we address these limitations by lodging a novel Scene Identification model that combines Hierarchical Scene Identification, Context-Aware Detection, and Active Learning techniques. Our approach capitalizes on the inherent hierarchical nature of objects, leveraging a two-stage object detector to categorize objects first into broad categories and then into specific objects. We introduce an efficient Graph Neural Network (GNN) to capture contextual information between objects, enhancing the detection process's accuracy and robustness. Active Learning is applied to actively query labels for uncertain instances, significantly reducing the manual labeling effort and improving model’s performance levels. The proposed model demonstrates remarkable improvements in various evaluation metrics compared to existing methods.

Downloads

Download data is not yet available.

References

E. Mohamed, K. Sirlantzis and G. Howells, "Indoor/Outdoor Semantic Segmentation Using Deep Learning for Visually Impaired Wheelchair Users," in IEEE Access, vol. 9, pp. 147914-147932, 2021, doi: 10.1109/ACCESS.2021.3123952.

A. Famili, A. Stavrou, H. Wang and J. -M. Park, "PILOT: High-Precision Indoor Localization for Autonomous Drones," in IEEE Transactions on Vehicular Technology, vol. 72, no. 5, pp. 6445-6459, May 2023, doi: 10.1109/TVT.2022.3229628.

P. Alves et al., "A Novel Approach for User Equipment Indoor/Outdoor Classification in Mobile Networks," in IEEE Access, vol. 9, pp. 162671-162686, 2021, doi: 10.1109/ACCESS.2021.3130429.

S. Zhao, L. Zhang, Y. Shen and Y. Zhou, "RefineDNet: A Weakly Supervised Refinement Framework for Single Image Dehazing," in IEEE Transactions on Image Processing, vol. 30, pp. 3391-3404, 2021, doi: 10.1109/TIP.2021.3060873.

V. Kachurka et al., "WeCo-SLAM: Wearable Cooperative SLAM System for Real-Time Indoor Localization Under Challenging Conditions," in IEEE Sensors Journal, vol. 22, no. 6, pp. 5122-5132, 15 March15, 2022, doi: 10.1109/JSEN.2021.3101121.

S. Montella, B. Berruet, O. Baala, V. Guillet, A. Caminada and F. Lassabe, "A Funnel Fukunaga–Koontz Transform for Robust Indoor–Outdoor Detection Using Channel-State Information in 5G IoT Context," in IEEE Internet of Things Journal, vol. 9, no. 15, pp. 14018-14029, 1 Aug.1, 2022, doi: 10.1109/JIOT.2022.3147068.

R. Li, P. Ji, Y. Xu and B. Bhanu, "MonoIndoor++: Towards Better Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 33, no. 2, pp. 830-846, Feb. 2023, doi: 10.1109/TCSVT.2022.3207105.

M. Krawez, T. Caselitz, J. Sundram, M. Van Loock and W. Burgard, "Real-Time Outdoor Illumination Estimation for Camera Tracking in Indoor Environments," in IEEE Robotics and Automation Letters, vol. 6, no. 3, pp. 6084-6091, July 2021, doi: 10.1109/LRA.2021.3090455.

C. -J. Liu and T. -N. Lin, "DET: Depth-Enhanced Tracker to Mitigate Severe Occlusion and Homogeneous Appearance Problems for Indoor Multiple-Object Tracking," in IEEE Access, vol. 10, pp. 8287-8304, 2022, doi: 10.1109/ACCESS.2022.3144153.

S. Nazir, L. Vaquero, M. Mucientes, V. M. Brea and D. Coltuc, "Depth Estimation and Image Restoration by Deep Learning From Defocused Images," in IEEE Transactions on Computational Imaging, vol. 9, pp. 607-619, 2023, doi: 10.1109/TCI.2023.3288335.

S. Bakirtzis, K. Qiu, I. Wassell, M. Fiore and J. Zhang, "Deep-Learning-Based Multivariate Time-Series Classification for Indoor/Outdoor Detection," in IEEE Internet of Things Journal, vol. 9, no. 23, pp. 24529-24540, 1 Dec.1, 2022, doi: 10.1109/JIOT.2022.3190555.

M. Rong, H. Cui and S. Shen, "Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images," in IEEE Transactions on Image Processing, vol. 32, pp. 3521-3535, 2023, doi: 10.1109/TIP.2023.3286708.

R. Gao, X. Xiao, W. Xing, C. Li and L. Liu, "Unsupervised Learning of Monocular Depth and Ego-Motion in Outdoor/Indoor Environments," in IEEE Internet of Things Journal, vol. 9, no. 17, pp. 16247-16258, 1 Sept.1, 2022, doi: 10.1109/JIOT.2022.3151629.

Y. -T. Liu, J. -J. Chen, Y. -C. Tseng and F. Y. Li, "An Auto-Encoder Multitask LSTM Model for Boundary Localization," in IEEE Sensors Journal, vol. 22, no. 11, pp. 10940-10953, 1 June1, 2022, doi: 10.1109/JSEN.2022.3168253.

H. Cui, D. Tu, F. Tang, P. Xu, H. Liu and S. Shen, "VidSfM: Robust and Accurate Structure-From-Motion for Monocular Videos," in IEEE Transactions on Image Processing, vol. 31, pp. 2449-2462, 2022, doi: 10.1109/TIP.2022.3156375.

F. Gao, F. Deng, L. Li, L. Zhang, J. Zhu and C. Yu, "MGG: Monocular Global Geolocation for Outdoor Long-Range Targets," in IEEE Transactions on Image Processing, vol. 30, pp. 6349-6363, 2021, doi: 10.1109/TIP.2021.3093789.

G. M. Mendoza-Silva et al., "Beyond Euclidean Distance for Error Measurement in Pedestrian Indoor Location," in IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1-11, 2021, Art no. 1001511, doi: 10.1109/TIM.2020.3021514.

A. Mingozzi, A. Conti, F. Aleotti, M. Poggi and S. Mattoccia, "Monitoring Social Distancing With Single Image Depth Estimation," in IEEE Transactions on Emerging Topics in Computational Intelligence, vol. 6, no. 6, pp. 1290-1301, Dec. 2022, doi: 10.1109/TETCI.2022.3171769.

K. Hatakeyama et al., "A Hybrid ToF Image Sensor for Long-Range 3D Depth Measurement Under High Ambient Light Conditions," in IEEE Journal of Solid-State Circuits, vol. 58, no. 4, pp. 983-992, April 2023, doi: 10.1109/JSSC.2023.3238031.

J. V. -V. Gerwen, K. Geebelen, J. Wan, W. Joseph, J. Hoebeke and E. De Poorter, "Indoor Drone Positioning: Accuracy and Cost Trade-Off for Sensor Fusion," in IEEE Transactions on Vehicular Technology, vol. 71, no. 1, pp. 961-974, Jan. 2022, doi: 10.1109/TVT.2021.3129917.

M. S. S. Suresh and V. Menon, "A Generic and Scalable Approach to Maximize Coverage in Diverse Indoor and Outdoor Multicamera Surveillance Scenarios," in IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 53, no. 2, pp. 1172-1182, Feb. 2023, doi: 10.1109/TSMC.2022.3194209.

D. Esparza and G. Flores, "The STDyn-SLAM: A Stereo Vision and Semantic Segmentation Approach for VSLAM in Dynamic Outdoor Environments," in IEEE Access, vol. 10, pp. 18201-18209, 2022, doi: 10.1109/ACCESS.2022.3149885.

V. Matus, V. Guerra, C. Jurado-Verdu, J. Rabadan and R. Perez-Jimenez, "Demonstration of a Sub-Pixel Outdoor Optical Camera Communication Link," in IEEE Latin America Transactions, vol. 19, no. 10, pp. 1798-1805, Oct. 2021, doi: 10.1109/TLA.2021.9477281.

M. Y. Moemen, H. Elghamrawy, S. N. Givigi and A. Noureldin, "3-D Reconstruction and Measurement System Based on Multimobile Robot Machine Vision," in IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1-9, 2021, Art no. 5003109, doi: 10.1109/TIM.2020.3026719.

Downloads

Published

24.03.2024

How to Cite

Meghana Deshmukh. (2024). Enhancing Scene Identification Performance through Hierarchical Classification, Context-Aware Analysis, and Active Learning Operations. International Journal of Intelligent Systems and Applications in Engineering, 12(3), 3676 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6029

Issue

Section

Research Article