Human Pose Detection System Using Machine Learning

Authors

  • Pratiksha Daphal Vishwakarma Institute of Information Technology, Pune – 411048, India
  • Srushti Pokale Vishwakarma Institute of Information Technology, Pune – 411048, India
  • Pranali Chavhan Vishwakarma Institute of Information Technology, Pune – 411048, India
  • Namrata Wasatkar Vishwakarma Institute of Information Technology, Pune – 411048, India
  • Snehal Rathi Vishwakarma Institute of Information Technology, Pune – 411048, India
  • Yashwant Dongre Vishwakarma Institute of Information Technology, Pune – 411048, India
  • Vikas Kolekar Vishwakarma Institute of Information Technology, Pune – 411048, India

Keywords:

Human pose detection, deep learning, machine learning, Human pose estimation, python, CNN, DNN

Abstract

Human pose detection is one of the essential factors in many surveillance-based applications such as fall detection, human-computer interaction activities, sports and fitness, motion or movement analysis, robotics, and many other 'Artificial Intelligence projects and applications. In this survey, we aim to cover the methods that are used before, for human pose detection- single person or multiple people, and examine their efficiency using required parameters, and their real-time compatibility. We will compare and discuss the different methods and technologies used for posture detection and their results. This research can be used to improve the results of systems that use pose detection as their primary parameter, hence can be very helpful for many life-saving applications such as fall detection. We also aim to use this research to develop an efficient model for human pose detection using deep neural networks. The model works on both images and video Human pose detection is one of the essential factors in many surveillance-based applications such as fall detection, human-computer interaction activities, sports and fitness, motion or movement analysis, robotics, and many other 'Artificial Intelligence projects and applications. In this survey, we aim to cover the methods that are used before, for human pose detection- single person or multiple people, and examine their efficiency using required parameters, and their real-time compatibility. We will compare and discuss the different methods and technologies used for posture detection and their results. This research can be used to improve the results of systems that use pose detection as their primary parameter, hence can be very helpful for many life-saving applications such as fall detection.

We also aim to use this research to develop an efficient model for human pose detection using deep neural networks. The model works on both images and videos. The model is built for single-person pose estimation with the help of Machine Learning.

The model is built for single-person pose estimation with the help of Machine Learning.

Downloads

Download data is not yet available.

References

Nishani, E., & Cico, B. (2017). Computer vision approaches based on deep learning and neural networks: Deep neural networks for video analysis of human pose estimation. 2017 6th Mediterranean Conference on Embedded Computing (MECO). doi:10.1109/meco.2017.7977207

Shotton, J., Girshick, R., Fitzgibbon, A., Sharp, T., Cook, M., Finocchio, M., ... & Blake, A. (2012). Efficient human pose estimation from single depth images. IEEE transactions on pattern analysis and machine intelligence, 35(12), 2821-2840.

Jalal, A., Kim, Y., & Kim, D. (2014). Ridge body parts features for human pose estimation and recognition from RGB-D video data. Fifth International Conference on Computing, Communications and Networking Technologies (ICCCNT). doi:10.1109/icccnt.2014.6963015

Yang, C., Wang, X., & Mao, S. (2021). RFID-Pose: Vision-Aided Three-Dimensional Human Pose Estimation With Radio-Frequency Identification. IEEE Transactions on Reliability, 70(3), 1218–1231. doi:10.1109/tr.2020.3030952

Amine Elforaici, M. E., Chaaraoui, I., Bouachir, W., Ouakrim, Y., & Mezghani, N. (2018). Posture Recognition Using an RGB-D Camera: Exploring 3D Body Modeling and Deep Learning Approaches. 2018 IEEE Life Sciences Conference (LSC). doi:10.1109/lsc.2018.8572079

Liaqat, S., Dashtipour, K., Arshad, K., Assaleh, K., & Ramzan, N. (2021). A Hybrid Posture Detection Framework: Integrating Machine Learning and Deep Neural Networks. IEEE Sensors Journal, 21(7), 9515–9522. doi:10.1109/jsen.2021.3055898

Shotton, J., Girshick, R., Fitzgibbon, A., Sharp, T., Cook, M., Finocchio, M., . . . Blake, A. (2013). Efficient Human Pose Estimation from Single Depth Images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(12), 2821–2840. doi:10.1109/tpami.2012.241

Liaqat, S., Dashtipour, K., Arshad, K., Assaleh, K., & Ramzan, N. (2021). A Hybrid Posture Detection Framework: Integrating Machine Learning and Deep Neural Networks. IEEE Sensors Journal, 21(7), 9515–9522. doi:10.1109/jsen.2021.3055898

Yu, J., Weng, K., Liang, G., & Xie, G. (2013). A vision-based robotic grasping system using deep learning for 3D object recognition and pose estimation. 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO). doi:10.1109/robio.2013.6739623

Huang, Z., Liu, Y., Fang, Y., & Horn, B. K. P. (2018). Videobased Fall Detection for Seniors with Human Pose Estimation. 2018 4th International Conference on Universal Village (UV). doi:10.1109/uv.2018.8642130

Zhao, L., Xu, J., Gong, C., Yang, J., Zuo, W., & Gao, X. (2020). Learning to Acquire the Quality of Human Pose Estimation. IEEE Transactions on Circuits and Systems for Video Technology, 1–1. doi:10.1109/tcsvt.2020.3005522

Chen, Y., Du, R., Luo, K., & Xiao, Y. (2021, March). Fall detection system based on real-time pose estimation and SVM. In 2021 IEEE 2nd international conference on big data, artificial intelligence and internet of things engineering (ICBAIE) (pp. 990-993). IEEE.

Wang, C., Wang, Y., Lin, Z., & Yuille, A. (2018). Robust 3D Human Pose Estimation from Single Images or Video Sequences. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1–1. doi:10.1109/tpami.2018.2828427

E. Insafutdinov, L. Pishchulin, B. Andres, M. Andriluka, and B. Schiele, Deepercut: A deeper, stronger, and faster multi-person pose estimation model, in European Conference on Computer Vision, 2016, pp. 34–50.

Q. Dang, J. Yin, B. Wang and W. Zheng, "Deep learning based 2D human pose estimation: A survey," in Tsinghua Science and Technology, vol. 24, no. 6, pp. 663-676, Dec. 2019, doi: 10.26599/TST.2018.9010100.

Wei, S., Ramakrishna, V., Kanade, T., & Sheikh, Y. (2016).Convolutional Pose Machines. ArXiv. /abs/1602.00134

Pavlakos, G., Zhu, L., Zhou, X., & Daniilidis, K. (2018). Learning to Estimate 3D Human Pose and Shape from a Single Color Image. ArXiv. /abs/1805.04092

G. Papandreou, T. Zhu, N. Kanazawa, A. Toshev, J. Tompson, C. Bregler, and K. Murphy, Towards accurate multiperson pose estimation in the wild, arXiv preprint arXiv:1701.01779, 2017.

E. Insafutdinov, L. Pishchulin, B. Andres, M. Andriluka, and B. Schiele, Deepercut: A deeper, stronger, and faster multi-person pose estimation model, in European Conference on Computer Vision, 2016, pp. 34–50.

Z. Cao, T. Simon, S.-E. Wei, and Y. Sheikh, Realtime multi-person 2d pose estimation using part affinity fields, in CVPR, 2017, vol. 1, p. 7.

S.-E. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh, Convolutional pose machines, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4724–4732.

A. Toshev and C. Szegedy, Deeppose: Human pose estimation via deep neural networks, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, WI, USA, 2014, pp. 1653–1660.

X. Chen and A. L. Yuille, Articulated pose estimation by a graphical model with image dependent pairwise relations, in Advances in Neural Information Processing Systems, 2014, pp. 1736–1744.

J. Martinez, R. Hossain, J. Romero, and J. J. Little, A simple yet effective baseline for 3d human pose estimation, in IEEE International Conference on Computer Vision, Venice, Italy, 2017, vol. 206, p. 3

Y. Chen, C. Shen, X.-S. Wei, L. Liu, and J. Yang, Adversarial posenet: A structure-aware convolutional network for human pose estimation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 1212–1221

L. Pishchulin, E. Insafutdinov, S. Tang, B. Andres, M. Andriluka, P. V. Gehler, and B. Schiele, Deepcut: Joint subset partition and labeling for multi person pose estimation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4929–4937

Y. Yang and D. Ramanan, Articulated pose estimation with flexible mixtures-of-parts, in Computer Vision and Pattern Recognition (CVPR), Colorado Springs, CO, USA, 2011, pp. 1385–1392

Y. Yang and D. Ramanan, Articulated human detection with flexible mixtures of parts, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 12, pp. 2878–2890, 2013

F. Wang and Y. Li, Beyond physical connections: Tree models in human pose estimation, in Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA, 2013, pp. 596–603

M. Sun and S. Savarese, Articulated part-based model for joint object detection and pose estimation, in Computer Vision (ICCV), Barcelona, Spain, 2011, pp. 723–730

M. Eichner, M. Marin-Jimenez, A. Zisserman, and V. Ferrari, 2d articulated human pose estimation and retrieval in (almost) unconstrained still images, International Journal of Computer Vision, vol. 99, no. 2, pp. 190–214, 2012.

M. Eichner and V. Ferrari, We are family: Joint pose estimation of multiple persons, in European Conference on Computer Vision, Crete, Greece, 2010, pp. 228–242.

Y. Guo, Y. Liu, A. Oerlemans, S. Lao, S. Wu, and M. S. Lew, Deep learning for visual understanding: A review, Neurocomputing, vol. 187, pp. 27–48, 2016

R. Poppe, Vision-based human motion analysis: An overview, Computer Vision and Image Understanding, vol. 108, nos. 12, pp. 4–18, 2007

Z. Liu, J. Zhu, J. Bu, and C. Chen, A survey of human pose estimation: The body parts parsing based methods, Journal of Visual Communication and Image Representation, vol. 32, pp. 10–19, 2015.

H.-B. Zhang, Q. Lei, B.-N. Zhong, J.-X. Du, and J. Peng, A survey on human pose estimation, Intelligent Automation Soft Computing, vol. 22, no. 3, pp. 483–489, 2016

E. Murphy-Chutorian and M. M. Trivedi, Head pose estimation in computer vision: A survey, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 4, pp. 607–626, 2009

A. Erol, G. Bebis, M. Nicolescu, R. D. Boyle, and X. Twombly, Visionbased hand pose estimation: A review, Computer Vision and Image Understanding, vol. 108, nos. 12, pp. 52–73, 2007.

M. Asadi-Aghbolaghi, A. Clapes, M. Bellantonio, H. ´ J. Escalante, V. Ponce-Lopez, X. Bar ´ o, I. Guyon, S. ´ Kasaei, and S. Escalera, A survey on deep learning based approaches for action and gesture recognition in image sequences, in Automatic Face Gesture Recognition (FG 2017), Washington, DC, USA, 2017, pp. 476–483

V. Ferrari, M. Marin-Jimenez, and A. Zisserman, Progressive search space reduction for human pose estimation, in Computer Vision and Pattern Recognition, Anchorage, AK, USA, 2008, pp. 1–8

J. Carreira, P. Agrawal, K. Fragkiadaki, and J. Malik, Human pose estimation with iterative error feedback, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4733–4742.

X. Sun, J. Shang, S. Liang, and Y. Wei, Compositional human pose regression, in The IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, vol. 2

D. C. Luvizon, H. Tabia, and D. Picard, Human pose regression by combining indirect part detection and contextual information, arXiv preprint arXiv:1710.02322, 2017.

I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, Generative adversarial nets, in Advances in Neural Information Processing Systems, Montreal, Canada, 2014, pp. 2672–2680.

A. Newell, K. Yang, and J. Deng, Stacked hourglass networks for human pose estimation, in European Conference on Computer Vision, Amsterdam, Netherlands, 2016, pp. 483–499.

X. Chu, W. Yang, W. Ouyang, C. Ma, A. L. Yuille, and X. Wang, Multi-context attention for human pose estimation, arXiv preprint arXiv:1702.07432, 2017.

T. Pfister, K. Simonyan, J. Charles, and A. Zisserman, Deep convolutional neural networks for efficient pose estimation in gesture videos, in Asian Conference on Computer Vision, Singapore, 2014, pp. 538–552

B. Sapp and B. Taskar, Modec: Multimodal decomposable models for human pose estimation, in Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA, 2013, pp. 3674–3681.

Y. Chen, Z. Wang, Y. Peng, Z. Zhang, G. Yu, and J. Sun, Cascaded pyramid network for multi-person pose estimation, arXiv preprint arXiv:1711.07319, 2017.

Xiao, B., Wu, H., & Wei, Y. (2018). Simple Baselines for Human Pose Estimation and Tracking. ArXiv. /abs/1804.06208

Fang, H., Li, J., Tang, H., Xu, C., Zhu, H., Xiu, Y., Li, Y., & Lu, C. (2022). AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time. ArXiv. /abs/2211.03375

Kocabas, M., Karagoz, S., & Akbas, E. (2018). MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network. ArXiv. /abs/1807.04067

Cao, Z., Hidalgo, G., Simon, T., Wei, S., & Sheikh, Y. (2018). OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. ArXiv. /abs/1812.08008

Xiao, B., Wu, H., & Wei, Y. (2018). Simple Baselines for Human Pose Estimation and Tracking. ArXiv. /abs/1804.0620

Prof. Prachiti Deshpande. (2016). Performance Analysis of RPL Routing Protocol for WBANs. International Journal of New Practices in Management and Engineering, 5(01), 14 - 21. Retrieved from http://ijnpme.org/index.php/IJNPME/article/view/43

Kshirsagar, P. R., Reddy, D. H., Dhingra, M., Dhabliya, D., & Gupta, A. (2023). A Scalable Platform to Collect, Store, Visualize and Analyze Big Data in Real-Time. 2023 3rd International Conference on Innovative Practices in Technology and Management (ICIPTM), 1–6. IEEE.

Chinthamu, N. ., Gooda, S. K. ., Venkatachalam, C. ., S., S. ., & Malathy, G. . (2023). IoT-based Secure Data Transmission Prediction using Deep Learning Model in Cloud Computing. International Journal on Recent and Innovation Trends in Computing and Communication, 11(4s), 68–76. https://doi.org/10.17762/ijritcc.v11i4s.6308

Downloads

Published

16.07.2023

How to Cite

Daphal, P. ., Pokale, S. ., Chavhan, P. ., Wasatkar, N. ., Rathi, S. ., Dongre, Y. ., & Kolekar, V. . (2023). Human Pose Detection System Using Machine Learning. International Journal of Intelligent Systems and Applications in Engineering, 11(3), 553–561. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3208

Issue

Section

Research Article