Neural Network Pruning Techniques for Efficient Model Compression

Authors

  • Kukati Aruna Kumari Sr. Assistant Professor, Department of Electronics and Communication Engineering, Prasad V Potluri Siddhartha Institute of Technology, Vijaywada, Andhra Pradesh, India
  • Shahanawaj Ahamad Associate Professor, Department of Software Engineering, College of Computer Science and Engineering, University of Hail, Hail City, Saudi Arabia
  • Trupti Patil Assistant Professor, Bharati Vidyapeeth Deemed to be University Department of Engineering and Technology, Navi Mumbai, Maharashtra, India
  • Kamal Sardana Assistant Professor, Department of Electronics and Communication Engineering, TIT&S, Bhiwani, Haryana, India
  • Elangovan Muniyandy Department of Biosciences, Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, Tamil Nadu, India
  • Daniel Pilli Assistant Professor, Department of MBA, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh, India

Keywords:

Neural Network Pruning, compression, deep learning, performance, accuracy

Abstract

A network of neurons When it comes to meeting the growing need for deploying deep learning models on devices with limited resources, pruning has emerged as an essential strategy for model reduction. The purpose of this study is to offer a detailed review of several pruning approaches that attempt to reduce the size and computational complexity of neural networks while maintaining their predictive accuracy. Specifically, the major emphasis is placed on structured pruning techniques, which include the removal of whole neurons, channels, or layers in a methodical manner based on certain criteria. In this article, we go into the fundamental ideas that underlie magnitude-based pruning, weight clustering, and filter pruning, and we emphasize the usefulness of these techniques in achieving considerable model reduction. In addition, this work investigates the interaction between pruning procedures and fine-tuning tactics in order to reduce the possibility of accuracy loss. In addition, the research investigates unstructured pruning techniques, which entail the elimination of individual weights in order to bring about sparsity in the network. The difficulties that are connected with unstructured pruning are discussed, and methods such as iterative pruning and regularization procedures are investigated as potential ways to improve the effectiveness of this kind of pruning. The comparative comparison of these different pruning strategies gives insight on the advantages, disadvantages, and compromises associated with each of them. Additionally, we highlight recent breakthroughs, including the integration of neural architecture search with pruning and the examination of pruning in the context of specialized neural network topologies like transformers.

Downloads

Download data is not yet available.

References

X. Zhou, Z. Zhang, L. Wang, and P. Wang, “A Model Based on Siamese Neural Network for Online Transaction Fraud Detection,” Proc. Int. Jt. Conf. Neural Networks, vol. 2019-July, 2019, doi: 10.1109/IJCNN.2019.8852295.

H. Jang and J. Lee, “An Empirical Study on Modeling and Prediction of Bitcoin Prices with Bayesian Neural Networks Based on Blockchain Information,” IEEE Access, vol. 6, pp. 5427–5437, 2017, doi: 10.1109/ACCESS.2017.2779181.

F. Casino, T. K. Dasaklis, and C. Patsakis, “A systematic literature review of blockchain-based applications: Current status, classification and open issues,” Telemat. Informatics, vol. 36, no. November 2018, pp. 55–81, 2019, doi: 10.1016/j.tele.2018.11.006.

S. Teng, G. Chen, G. Liu, J. Lv, and F. Cui, “Modal strain energy-based structural damage detection using convolutional neural networks,” Appl. Sci., vol. 9, no. 16, 2019, doi: 10.3390/app9163376.

S. Liu, A. Borovykh, L. A. Grzelak, and C. W. Oosterlee, “A neural network-based framework for financial model calibration,” J. Math. Ind., vol. 9, no. 1, 2019, doi: 10.1186/s13362-019-0066-7.

S. Tanwar, Q. Bhatia, P. Patel, A. Kumari, P. K. Singh, and W. C. Hong, “Machine Learning Adoption in Blockchain-Based Smart Applications: The Challenges, and a Way Forward,” IEEE Access, vol. 8, pp. 474–448, 2020, doi: 10.1109/ACCESS.2019.2961372.

W. Gao and C. Su, “Analysis on block chain financial transaction under artificial neural network of deep learning,” J. Comput. Appl. Math., vol. 380, p. 112991, 2020, doi: 10.1016/j.cam.2020.112991.

W. Zhang, K. X. Tao, J. F. Li, Y. C. Zhu, and J. Li, “Modeling and Prediction of Stock Price with Convolutional Neural Network Based on Blockchain Interactive Information,” Wirel. Commun. Mob. Comput., vol. 2020, 2020, doi: 10.1155/2020/6686181.

D. Prashar et al., “Blockchain-Based Automated System for Identification and Storage of Networks,” Secur. Commun. Networks, vol. 2021, 2021, doi: 10.1155/2021/6694281.

Z. Zhang, X. Song, L. Liu, J. Yin, Y. Wang, and D. Lan, “Recent Advances in Blockchain and Artificial Intelligence Integration: Feasibility Analysis, Research Issues, Applications, Challenges, and Future Work,” Secur. Commun. Networks, vol. 2021, 2021, doi: 10.1155/2021/9991535.

R. C. Aguilera, M. P. Ortiz, A. A. Banda, and L. E. C. Aguilera, “Blockchain cnn deep learning expert system for healthcare emergency,” Fractals, vol. 29, no. 6, pp. 1–10, 2021, doi: 10.1142/S0218348X21502273.

M. Shafay, R. W. Ahmad, K. Salah, I. Yaqoob, R. Jayaraman, and M. Omar, “Blockchain for Deep Learning : Review and Open Challenges,” no. October, pp. 1–32, 2021, doi: 10.36227/techrxiv.16823140.v1.

T. Frikha, F. Chaabane, N. Aouinti, O. Cheikhrouhou, N. Ben Amor, and A. Kerrouche, “Implementation of Blockchain Consensus Algorithm on Embedded Architecture,” Secur. Commun. Networks, vol. 2021, no. June, 2021, doi: 10.1155/2021/9918697.

F. Jamil, N. Iqbal, Imran, S. Ahmad, and D. Kim, “Peer-to-Peer Energy Trading Mechanism Based on Blockchain and Machine Learning for Sustainable Electrical Power Supply in Smart Grid,” IEEE Access, vol. 9, pp. 39193–39217, 2021, doi: 10.1109/ACCESS.2021.3060457.

Kaushik Dushyant; Garg Muskan; Annu; Ankur Gupta; Sabyasachi Pramanik, "Utilizing Machine Learning and Deep Learning in Cybesecurity: An Innovative Approach," in Cyber Security and Digital Forensics: Challenges and Future Trends , Wiley, 2022, pp.271-293, doi: 10.1002/9781119795667.ch12.

M. Zhu and S. Gupta, “To prune, or not to prune: exploring the efficacy of pruning for model compression.” arXiv, 2017. doi: 10.48550/ARXIV.1710.01878.

V. Talukdar, D. Dhabliya, B. Kumar, S. B. Talukdar, S. Ahamad and A. Gupta, "Suspicious Activity Detection and Classification in IoT Environment Using Machine Learning Approach," 2022 Seventh International Conference on Parallel, Distributed and Grid Computing (PDGC), Solan, Himachal Pradesh, India, 2022, pp. 531-535, doi: 10.1109/PDGC56933.2022.10053312.

Y. Cheng, D. Wang, P. Zhou, and T. Zhang, “A Survey of Model Compression and Acceleration for Deep Neural Networks.” arXiv, 2017. doi: 10.48550/ARXIV.1710.09282.

V. Jain, S. M. Beram, V. Talukdar, T. Patil, D. Dhabliya and A. Gupta, "Accuracy Enhancement in Machine Learning During Blockchain Based Transaction Classification," 2022 Seventh International Conference on Parallel, Distributed and Grid Computing (PDGC), Solan, Himachal Pradesh, India, 2022, pp. 536-540, doi: 10.1109/PDGC56933.2022.10053213.

X. Ruan et al., "EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression," in IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 10, pp. 4499-4513, Oct. 2021, doi: 10.1109/TNNLS.2020.3018177.

A. Gupta, D. Kaushik, M. Garg and A. Verma, "Machine Learning model for Breast Cancer Prediction," 2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), Palladam, India, 2020, pp. 472-477, doi: 10.1109/I-SMAC49090.2020.9243323

Y. Hu, S. Sun, J. Li, X. Wang, and Q. Gu, “A novel channel pruning method for deep neural network compression.” arXiv, 2018. doi: 10.48550/ARXIV.1805.11394.

V. Veeraiah, K. R. Kumar, P. Lalitha Kumari, S. Ahamad, R. Bansal and A. Gupta, "Application of Biometric System to Enhance the Security in Virtual World," 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2022, pp. 719-723, doi: 10.1109/ICACITE53722.2022.9823850.

F. Tung and G. Mori, "Deep Neural Network Compression by In-Parallel Pruning-Quantization," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 3, pp. 568-579, 1 March 2020, doi: 10.1109/TPAMI.2018.2886192.

V. Veeraiah, G. P, S. Ahamad, S. B. Talukdar, A. Gupta and V. Talukdar, "Enhancement of Meta Verse Capabilities by IoT Integration," 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2022, pp. 1493-1498, doi: 10.1109/ICACITE53722.2022.9823766.

S. Han, H. Mao, and W. J. Dally, “Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding.” arXiv, 2015. doi: 10.48550/ARXIV.1510.00149.

V. Veeraiah, H. Khan, A. Kumar, S. Ahamad, A. Mahajan and A. Gupta, "Integration of PSO and Deep Learning for Trend Analysis of Meta-Verse," 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2022, pp. 713-718, doi: 10.1109/ICACITE53722.2022.9823883.

A. Salama, O. Ostapenko, T. Klein, and M. Nabi, “Pruning at a Glance: Global Neural Pruning for Model Compression.” arXiv, 2019. doi: 10.48550/ARXIV.1912.00200.

P. R. Kshirsagar, D. H. Reddy, M. Dhingra, D. Dhabliya and A. Gupta, "A Review on Comparative study of 4G, 5G and 6G Networks," 2022 5th International Conference on Contemporary Computing and Informatics (IC3I), Uttar Pradesh, India, 2022, pp. 1830-1833, doi: 10.1109/IC3I56241.2022.10073385.

L. Deng, G. Li, S. Han, L. Shi and Y. Xie, "Model Compression and Hardware Acceleration for Neural Networks: A Comprehensive Survey," in Proceedings of the IEEE, vol. 108, no. 4, pp. 485-532, April 2020, doi: 10.1109/JPROC.2020.2976475.

P. Venkateshwari, V. Veeraiah, V. Talukdar, D. N. Gupta, R. Anand and A. Gupta, "Smart City Technical Planning Based on Time Series Forecasting of IOT Data," 2023 International Conference on Sustainable Emerging Innovations in Engineering and Technology (ICSEIET), Ghaziabad, India, 2023, pp. 646-651, doi: 10.1109/ICSEIET58677.2023.10303480.

V. Veeraiah, J. Kotti, V. Jain, T. Sharma, S. Saini and A. Gupta, "Scope of IoT in Emerging Engineering Technology during Online Education," 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT), Delhi, India, 2023, pp. 1-6, doi: 10.1109/ICCCNT56998.2023.10308107.

Bijender Bansal; V. Nisha Jenipher; Rituraj Jain; R. Dilip; Makhan Kumbhkar; Sabyasachi Pramanik; Sandip Roy; Ankur Gupta, "Big Data Architecture for Network Security," in Cyber Security and Network Security , Wiley, 2022, pp.233-267, doi: 10.1002/9781119812555.ch11.

K. A. Shukla, S. Almal, A. Gupta, R. Jain, R. Mishra and D. Dhabliya, "DL Based System for On-Board Image Classification in Real Time, Applied to Disaster Mitigation," 2022 Seventh International Conference on Parallel, Distributed and Grid Computing (PDGC), Solan, Himachal Pradesh, India, 2022, pp. 663-668, doi: 10.1109/PDGC56933.2022.10053139.

R. Bansal, A. Gupta, R. Singh and V. K. Nassa, "Role and Impact of Digital Technologies in E-Learning amidst COVID-19 Pandemic," 2021 Fourth International Conference on Computational Intelligence and Communication Technologies (CCICT), Sonepat, India, 2021, pp. 194-202, doi: 10.1109/CCICT53244.2021.00046.

A. Gupta, R. Singh, V. K. Nassa, R. Bansal, P. Sharma and K. Koti, "Investigating Application and Challenges of Big Data Analytics with Clustering," 2021 International Conference on Advancements in Electrical, Electronics, Communication, Computing and Automation (ICAECA), Coimbatore, India, 2021, pp. 1-6, doi: 10.1109/ICAECA52838.2021.9675483.

Mamta, V. Veeraiah, D. N. Gupta, B. S. Kumar, A. Gupta and R. Anand, "Prediction of Health Risk Based on Multi-Level IOT Data Using Decision Trees," 2023 International Conference on Sustainable Emerging Innovations in Engineering and Technology (ICSEIET), Ghaziabad, India, 2023, pp. 652-656, doi: 10.1109/ICSEIET58677.2023.10303560.

Downloads

Published

07.02.2024

How to Cite

Kumari, K. A. ., Ahamad, S. ., Patil, T. ., Sardana, K. ., Muniyandy, E. ., & Pilli, D. . (2024). Neural Network Pruning Techniques for Efficient Model Compression. International Journal of Intelligent Systems and Applications in Engineering, 12(15s), 565 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4791

Issue

Section

Research Article

Most read articles by the same author(s)

Similar Articles

You may also start an advanced similarity search for this article.