High-Performance Video Retrieval Using SIFT and Deep Learning Methods

Authors

  • G. S. Naveen Kumar, V. S. K. Reddy, Leela Kumari Balivada

Keywords:

SIFT, Shot Boundary Detection, CNN, RNN, LSTM, Key Frame Extraction, CBVR

Abstract

Shot boundary detection and key frame extraction are critical steps for video indexing, summarization, and retrieval. This paper proposes an advanced approach that combines a sophisticated SIFT (Scale-Invariant Feature Transform) keypoint matching algorithm with deep learning-based key frame extraction techniques. The SIFT-based method effectively captures both abrupt and gradual shot transitions, overcoming the limitations of existing algorithms that perform well only for abrupt changes. For key frame extraction, a deep learning framework leveraging convolutional neural networks (CNNs) for spatial feature representation and recurrent neural networks (RNNs/LSTMs) for temporal modeling is employed. This approach automatically identifies the most informative and representative frames while reducing redundancy, enabling efficient processing of large-scale video data. Extensive experiments on benchmark video datasets demonstrate that the proposed algorithms significantly outperform traditional methods in terms of accuracy, robustness, and computational efficiency, making them highly suitable for modern video analysis applications.

Downloads

Download data is not yet available.

References

C. Cotsaces, N. Nikolaidis, and I. Pitas, “Video Shot Detection and Condensed Representation”, IEEE Signal Processing Magazine, March, 2006, pp. 28-37, 2006

Srinivasa Naveen Kumar, G., Reddy, V.S.K., Balivada, L.K. (2023). Content-Based Video Retrieval Using Deep Learning Algorithms. In: Reddy, V.S., Prasad, V.K., Wang, J., Rao Dasari, N.M. (eds) Intelligent Systems and Sustainable Computing. ICISSC 2022. Smart Innovation, Systems and Technologies, vol 363. Springer, Singapore. https://doi.org/10.1007/978-981-99-4717-1_52

J. H. Yuan, H. Y. Wang, and B. Zhang, “A formal study of shot boundary detection”, Journal of Transactions on Circuits and Systems for Video Technology, vol. 17, no. 2, pp. 168-186, February 2007.

Zhao Guang-sheng , A Novel Approach for Shot Boundary Detection and Key Frames Extraction, 2008 International Conference on Multimedia and Information Technology,IEEE

GS Naveen Kumar, VSK Reddy, “Detection of Shot Boundaries and Extraction of Key Frames for Video Retrieval”, International Journal of Knowledge-Based and Intelligent Engineering Systems, Vol. 24, Issue 1, ISSN: 1327-2314, Indexed in: Scopus, Web of Science: Emerging Sources Citation Index, DBLP, Google Scholar, ACM Digital Library

GS Naveen Kumar, VSK Reddy, S. Srinivas Kumar “Video Shot Boundary Detection and Keyframe Extraction for Video Retrieval”, International Conference on Computational Intelligence and Informatics ICCII-2017, Sep 2017, Springer Publication, AISC Series, (Scopus Indexed)

A. Hassanien, M. Elgharib, A. Selim, S.-H. Bae, M. Hefeeda, and W. Matusik, “Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks,” arXiv preprint arXiv:1705.03281, 2017.

A. Benoughidene and F. Titouna, “A Novel Method for Video Shot Boundary Detection Using a CNN-LSTM Approach,” International Journal of Multimedia Information Retrieval, vol. 11, pp. 653–667, 2022.

Jordi Mas and Gabriel Fernandez, “Video Shot Boundary Detection based on Color Histogram”, Digital Television Center (CeTVD), La Salle School of Engineering, Ramon Llull University, Barcelona, Spain.

D. G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, vol. 60, pp. 91-110, 2004.

J. Zhang, Y. Wang, and D. Li, “Shot Boundary Detection with 3D Depthwise Convolutions and Visual Attention,” Sensors, vol. 23, no. 16, p. 7022, 2023.

Hannane, Rachida, et al. "An efficient method for video shot boundary detection and keyframe extraction using SIFT-point distribution histogram." International Journal of Multimedia Information Retrieval 5.2 (2016): 89-104.

GS Naveen Kumar, VSK Reddy “Key frame Extraction using Rough Set Theory for Video Retrieval”, International Conference on Soft Computing and Signal Processing ICSCSP-2018, Jun 2018, Springer Publication, AISC series, Scopus Indexed

Downloads

Published

30.11.2024

How to Cite

G. S. Naveen Kumar. (2024). High-Performance Video Retrieval Using SIFT and Deep Learning Methods. International Journal of Intelligent Systems and Applications in Engineering, 12(9s), 594 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/7879

Issue

Section

Research Article