Human Pose Detection System Using Machine Learning
Keywords:
Human pose detection, deep learning, machine learning, Human pose estimation, python, CNN, DNNAbstract
Human pose detection is one of the essential factors in many surveillance-based applications such as fall detection, human-computer interaction activities, sports and fitness, motion or movement analysis, robotics, and many other 'Artificial Intelligence projects and applications. In this survey, we aim to cover the methods that are used before, for human pose detection- single person or multiple people, and examine their efficiency using required parameters, and their real-time compatibility. We will compare and discuss the different methods and technologies used for posture detection and their results. This research can be used to improve the results of systems that use pose detection as their primary parameter, hence can be very helpful for many life-saving applications such as fall detection. We also aim to use this research to develop an efficient model for human pose detection using deep neural networks. The model works on both images and video Human pose detection is one of the essential factors in many surveillance-based applications such as fall detection, human-computer interaction activities, sports and fitness, motion or movement analysis, robotics, and many other 'Artificial Intelligence projects and applications. In this survey, we aim to cover the methods that are used before, for human pose detection- single person or multiple people, and examine their efficiency using required parameters, and their real-time compatibility. We will compare and discuss the different methods and technologies used for posture detection and their results. This research can be used to improve the results of systems that use pose detection as their primary parameter, hence can be very helpful for many life-saving applications such as fall detection.
We also aim to use this research to develop an efficient model for human pose detection using deep neural networks. The model works on both images and videos. The model is built for single-person pose estimation with the help of Machine Learning.
The model is built for single-person pose estimation with the help of Machine Learning.
Downloads
References
Nishani, E., & Cico, B. (2017). Computer vision approaches based on deep learning and neural networks: Deep neural networks for video analysis of human pose estimation. 2017 6th Mediterranean Conference on Embedded Computing (MECO). doi:10.1109/meco.2017.7977207
Shotton, J., Girshick, R., Fitzgibbon, A., Sharp, T., Cook, M., Finocchio, M., ... & Blake, A. (2012). Efficient human pose estimation from single depth images. IEEE transactions on pattern analysis and machine intelligence, 35(12), 2821-2840.
Jalal, A., Kim, Y., & Kim, D. (2014). Ridge body parts features for human pose estimation and recognition from RGB-D video data. Fifth International Conference on Computing, Communications and Networking Technologies (ICCCNT). doi:10.1109/icccnt.2014.6963015
Yang, C., Wang, X., & Mao, S. (2021). RFID-Pose: Vision-Aided Three-Dimensional Human Pose Estimation With Radio-Frequency Identification. IEEE Transactions on Reliability, 70(3), 1218–1231. doi:10.1109/tr.2020.3030952
Amine Elforaici, M. E., Chaaraoui, I., Bouachir, W., Ouakrim, Y., & Mezghani, N. (2018). Posture Recognition Using an RGB-D Camera: Exploring 3D Body Modeling and Deep Learning Approaches. 2018 IEEE Life Sciences Conference (LSC). doi:10.1109/lsc.2018.8572079
Liaqat, S., Dashtipour, K., Arshad, K., Assaleh, K., & Ramzan, N. (2021). A Hybrid Posture Detection Framework: Integrating Machine Learning and Deep Neural Networks. IEEE Sensors Journal, 21(7), 9515–9522. doi:10.1109/jsen.2021.3055898
Shotton, J., Girshick, R., Fitzgibbon, A., Sharp, T., Cook, M., Finocchio, M., . . . Blake, A. (2013). Efficient Human Pose Estimation from Single Depth Images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(12), 2821–2840. doi:10.1109/tpami.2012.241
Liaqat, S., Dashtipour, K., Arshad, K., Assaleh, K., & Ramzan, N. (2021). A Hybrid Posture Detection Framework: Integrating Machine Learning and Deep Neural Networks. IEEE Sensors Journal, 21(7), 9515–9522. doi:10.1109/jsen.2021.3055898
Yu, J., Weng, K., Liang, G., & Xie, G. (2013). A vision-based robotic grasping system using deep learning for 3D object recognition and pose estimation. 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO). doi:10.1109/robio.2013.6739623
Huang, Z., Liu, Y., Fang, Y., & Horn, B. K. P. (2018). Videobased Fall Detection for Seniors with Human Pose Estimation. 2018 4th International Conference on Universal Village (UV). doi:10.1109/uv.2018.8642130
Zhao, L., Xu, J., Gong, C., Yang, J., Zuo, W., & Gao, X. (2020). Learning to Acquire the Quality of Human Pose Estimation. IEEE Transactions on Circuits and Systems for Video Technology, 1–1. doi:10.1109/tcsvt.2020.3005522
Chen, Y., Du, R., Luo, K., & Xiao, Y. (2021, March). Fall detection system based on real-time pose estimation and SVM. In 2021 IEEE 2nd international conference on big data, artificial intelligence and internet of things engineering (ICBAIE) (pp. 990-993). IEEE.
Wang, C., Wang, Y., Lin, Z., & Yuille, A. (2018). Robust 3D Human Pose Estimation from Single Images or Video Sequences. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1–1. doi:10.1109/tpami.2018.2828427
E. Insafutdinov, L. Pishchulin, B. Andres, M. Andriluka, and B. Schiele, Deepercut: A deeper, stronger, and faster multi-person pose estimation model, in European Conference on Computer Vision, 2016, pp. 34–50.
Q. Dang, J. Yin, B. Wang and W. Zheng, "Deep learning based 2D human pose estimation: A survey," in Tsinghua Science and Technology, vol. 24, no. 6, pp. 663-676, Dec. 2019, doi: 10.26599/TST.2018.9010100.
Wei, S., Ramakrishna, V., Kanade, T., & Sheikh, Y. (2016).Convolutional Pose Machines. ArXiv. /abs/1602.00134
Pavlakos, G., Zhu, L., Zhou, X., & Daniilidis, K. (2018). Learning to Estimate 3D Human Pose and Shape from a Single Color Image. ArXiv. /abs/1805.04092
G. Papandreou, T. Zhu, N. Kanazawa, A. Toshev, J. Tompson, C. Bregler, and K. Murphy, Towards accurate multiperson pose estimation in the wild, arXiv preprint arXiv:1701.01779, 2017.
E. Insafutdinov, L. Pishchulin, B. Andres, M. Andriluka, and B. Schiele, Deepercut: A deeper, stronger, and faster multi-person pose estimation model, in European Conference on Computer Vision, 2016, pp. 34–50.
Z. Cao, T. Simon, S.-E. Wei, and Y. Sheikh, Realtime multi-person 2d pose estimation using part affinity fields, in CVPR, 2017, vol. 1, p. 7.
S.-E. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh, Convolutional pose machines, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4724–4732.
A. Toshev and C. Szegedy, Deeppose: Human pose estimation via deep neural networks, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, WI, USA, 2014, pp. 1653–1660.
X. Chen and A. L. Yuille, Articulated pose estimation by a graphical model with image dependent pairwise relations, in Advances in Neural Information Processing Systems, 2014, pp. 1736–1744.
J. Martinez, R. Hossain, J. Romero, and J. J. Little, A simple yet effective baseline for 3d human pose estimation, in IEEE International Conference on Computer Vision, Venice, Italy, 2017, vol. 206, p. 3
Y. Chen, C. Shen, X.-S. Wei, L. Liu, and J. Yang, Adversarial posenet: A structure-aware convolutional network for human pose estimation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 1212–1221
L. Pishchulin, E. Insafutdinov, S. Tang, B. Andres, M. Andriluka, P. V. Gehler, and B. Schiele, Deepcut: Joint subset partition and labeling for multi person pose estimation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4929–4937
Y. Yang and D. Ramanan, Articulated pose estimation with flexible mixtures-of-parts, in Computer Vision and Pattern Recognition (CVPR), Colorado Springs, CO, USA, 2011, pp. 1385–1392
Y. Yang and D. Ramanan, Articulated human detection with flexible mixtures of parts, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 12, pp. 2878–2890, 2013
F. Wang and Y. Li, Beyond physical connections: Tree models in human pose estimation, in Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA, 2013, pp. 596–603
M. Sun and S. Savarese, Articulated part-based model for joint object detection and pose estimation, in Computer Vision (ICCV), Barcelona, Spain, 2011, pp. 723–730
M. Eichner, M. Marin-Jimenez, A. Zisserman, and V. Ferrari, 2d articulated human pose estimation and retrieval in (almost) unconstrained still images, International Journal of Computer Vision, vol. 99, no. 2, pp. 190–214, 2012.
M. Eichner and V. Ferrari, We are family: Joint pose estimation of multiple persons, in European Conference on Computer Vision, Crete, Greece, 2010, pp. 228–242.
Y. Guo, Y. Liu, A. Oerlemans, S. Lao, S. Wu, and M. S. Lew, Deep learning for visual understanding: A review, Neurocomputing, vol. 187, pp. 27–48, 2016
R. Poppe, Vision-based human motion analysis: An overview, Computer Vision and Image Understanding, vol. 108, nos. 12, pp. 4–18, 2007
Z. Liu, J. Zhu, J. Bu, and C. Chen, A survey of human pose estimation: The body parts parsing based methods, Journal of Visual Communication and Image Representation, vol. 32, pp. 10–19, 2015.
H.-B. Zhang, Q. Lei, B.-N. Zhong, J.-X. Du, and J. Peng, A survey on human pose estimation, Intelligent Automation Soft Computing, vol. 22, no. 3, pp. 483–489, 2016
E. Murphy-Chutorian and M. M. Trivedi, Head pose estimation in computer vision: A survey, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 4, pp. 607–626, 2009
A. Erol, G. Bebis, M. Nicolescu, R. D. Boyle, and X. Twombly, Visionbased hand pose estimation: A review, Computer Vision and Image Understanding, vol. 108, nos. 12, pp. 52–73, 2007.
M. Asadi-Aghbolaghi, A. Clapes, M. Bellantonio, H. ´ J. Escalante, V. Ponce-Lopez, X. Bar ´ o, I. Guyon, S. ´ Kasaei, and S. Escalera, A survey on deep learning based approaches for action and gesture recognition in image sequences, in Automatic Face Gesture Recognition (FG 2017), Washington, DC, USA, 2017, pp. 476–483
V. Ferrari, M. Marin-Jimenez, and A. Zisserman, Progressive search space reduction for human pose estimation, in Computer Vision and Pattern Recognition, Anchorage, AK, USA, 2008, pp. 1–8
J. Carreira, P. Agrawal, K. Fragkiadaki, and J. Malik, Human pose estimation with iterative error feedback, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4733–4742.
X. Sun, J. Shang, S. Liang, and Y. Wei, Compositional human pose regression, in The IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, vol. 2
D. C. Luvizon, H. Tabia, and D. Picard, Human pose regression by combining indirect part detection and contextual information, arXiv preprint arXiv:1710.02322, 2017.
I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, Generative adversarial nets, in Advances in Neural Information Processing Systems, Montreal, Canada, 2014, pp. 2672–2680.
A. Newell, K. Yang, and J. Deng, Stacked hourglass networks for human pose estimation, in European Conference on Computer Vision, Amsterdam, Netherlands, 2016, pp. 483–499.
X. Chu, W. Yang, W. Ouyang, C. Ma, A. L. Yuille, and X. Wang, Multi-context attention for human pose estimation, arXiv preprint arXiv:1702.07432, 2017.
T. Pfister, K. Simonyan, J. Charles, and A. Zisserman, Deep convolutional neural networks for efficient pose estimation in gesture videos, in Asian Conference on Computer Vision, Singapore, 2014, pp. 538–552
B. Sapp and B. Taskar, Modec: Multimodal decomposable models for human pose estimation, in Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA, 2013, pp. 3674–3681.
Y. Chen, Z. Wang, Y. Peng, Z. Zhang, G. Yu, and J. Sun, Cascaded pyramid network for multi-person pose estimation, arXiv preprint arXiv:1711.07319, 2017.
Xiao, B., Wu, H., & Wei, Y. (2018). Simple Baselines for Human Pose Estimation and Tracking. ArXiv. /abs/1804.06208
Fang, H., Li, J., Tang, H., Xu, C., Zhu, H., Xiu, Y., Li, Y., & Lu, C. (2022). AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time. ArXiv. /abs/2211.03375
Kocabas, M., Karagoz, S., & Akbas, E. (2018). MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network. ArXiv. /abs/1807.04067
Cao, Z., Hidalgo, G., Simon, T., Wei, S., & Sheikh, Y. (2018). OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. ArXiv. /abs/1812.08008
Xiao, B., Wu, H., & Wei, Y. (2018). Simple Baselines for Human Pose Estimation and Tracking. ArXiv. /abs/1804.0620
Prof. Prachiti Deshpande. (2016). Performance Analysis of RPL Routing Protocol for WBANs. International Journal of New Practices in Management and Engineering, 5(01), 14 - 21. Retrieved from http://ijnpme.org/index.php/IJNPME/article/view/43
Kshirsagar, P. R., Reddy, D. H., Dhingra, M., Dhabliya, D., & Gupta, A. (2023). A Scalable Platform to Collect, Store, Visualize and Analyze Big Data in Real-Time. 2023 3rd International Conference on Innovative Practices in Technology and Management (ICIPTM), 1–6. IEEE.
Chinthamu, N. ., Gooda, S. K. ., Venkatachalam, C. ., S., S. ., & Malathy, G. . (2023). IoT-based Secure Data Transmission Prediction using Deep Learning Model in Cloud Computing. International Journal on Recent and Innovation Trends in Computing and Communication, 11(4s), 68–76. https://doi.org/10.17762/ijritcc.v11i4s.6308
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.