Dynamic Fault Tolerance Management Algorithm for VM Migration in Cloud Data Centers

Authors

  • Bikash Chandra Pattanaik Gandhi Institute for Education and Technology, Affiliated to Biju Patnaik University of Technology Rourkela, Odisha, India
  • Bidush Kumar Sahoo GIET University, Gunupur, Odisha, India
  • Bibudhendu Pati Ramadevi Women’s University, Bhubaneswar, Odisha, India
  • Suprava Ranjan Laha Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha, India

Keywords:

fault tolerance, load balancing, reactive fault tolerance, proactive fault tolerance, resource utilization, cloud computing, dynamic VM migration, resource allocation

Abstract

Fault tolerance is critical in constructing robust cloud computing systems to ensure uninterrupted service delivery and maintain economic benefits despite potential faults. This paper presents a novel layered modeling architecture that combines reactive and proactive fault modeling theories to enable reliable, survivable cloud-based applications by addressing fault tolerance concerns. This paper examines the issues of dynamic fault tolerance management and virtual machine (VM) migration in cloud data centers. We introduce a comprehensive algorithm that efficiently manages fault tolerance through proactive measures by leveraging a layered modeling architecture. The algorithm considers defect prediction and resource allocation techniques to minimize service interruptions and maximize resource utilization. It incorporates reactive and proactive fault modeling to identify and respond to faults, anticipates potential faults, and takes preventative measures. This integration makes the cloud computing environment more robust and reliable. However, extensive simulations and evaluations demonstrate the proposed algorithm's effectiveness in reducing service downtime, ensuring application reliability, and sustaining optimal performance. The algorithm's ability to dynamically migrate virtual machines (VMs) based on defect prediction contributes to efficient resource allocation and load balancing, mitigating potential bottlenecks and enhancing system resilience. The results demonstrate the applicability and effectiveness of the proposed framework for maintaining cloud-based applications' dependability. Combining reactive and proactive fault modeling theories, the proposed algorithm provides a comprehensive method for keeping cloud-based applications reliable and fault-tolerant.

Downloads

Download data is not yet available.

References

Gowri, A. S. (2019). Impact of virtualization technologies in the development and management of cloud applications. International Journal of Intelligent Systems and Applications in Engineering, 7(2), 104-110.

Ahmet, E. F. E., & Isık, A. (2020). A general view of industry 4.0 revolution from cybersecurity perspective. International Journal of Intelligent Systems and Applications in Engineering, 8(1), 11-20.

Pattanayak, B. K., Mohanty, R. K., Venkataramana, T., Pattnaik, S., Ranjan, R., Laha, S. R. (2022) An IoT Enabled Cloud Based Virtual Classroom Assisted e-Learning System. In NeuroQuantology, Vol.20, 9.

Laha, S. R., Parhi, M., Pattnaik, S., Pattanayak, B. K., & Patnaik, S. (2020, October). Issues, Challenges and Techniques for Resource Provisioning in Computing Environment. In 2020 2nd International Conference on Applied Machine Learning (ICAML) (pp. 157-161). IEEE.

Hota, N., & Pattanayak, B. K. (2021). Cloud computing load balancing using Amazon web service technology. In Progress in Advanced Computing and Intelligent Engineering (pp. 661-669). Springer, Singapore.

Laha, S. R., Mahapatra, S. K., Pattnaik, S., Pattanayak, B. K., & Pati, B. (2021). U-INS: an android-based navigation system. In Cognitive Informatics and Soft Computing (pp. 125-132). Springer, Singapore.

The worst cloud outages of 2013, Online, cited 01.07.2013.URL:http://www.infoworld.com/slideshow/107783/the-worst-cloud-outages of 2013-so-far-221831.

The worst cloud outages of 2014, Online, cited 01.01.2015.URL:http://www.infoworld.com/article/2606209/cloud-computing/162288-The-worst-cloud-outages-of-2014-so-far.html.

CU, O. K., & Bhama, P. R. S. (2019). Fuzzy based energy efficient workload management system for flash crowd. Computer Communications, 147, 225-234.

Amoon, M., El-Bahnasawy, N., Sadi, S., & Wagdi, M. (2019). On the design of reactive approach with flexible checkpoint interval to tolerate faults in cloud computing systems. Journal of Ambient Intelligence and Humanized Computing, 10(11), 4567-4577.

Sivagami, V. M., & Easwarakumar, K. S. (2019). An improved dynamic fault tolerant management algorithm during VM migration in cloud data center. Future Generation Computer Systems, 98, 35-43.

Yao, G., Ren, Q., Li, X., Zhao, S., & Ruiz, R. (2020). A hybrid fault-tolerant scheduling for deadline-constrained tasks in Cloud systems. IEEE Transactions on Services Computing.

Hasan, M., & Goraya, M. S. (2019). Flexible fault tolerance in cloud through replicated cooperative resource group. Computer Communications, 145, 176-192.

Beheshti, M. K., & Safi-Esfahani, F. (2020). BFPF-Cloud: Applying SVM for Byzantine Failure Prediction to Increase Availability and Failure Tolerance in Cloud Computing. SN Computer Science, 1(5), 1-31.

Alaei, M., Khorsand, R., & Ramezanpour, M. (2021). An adaptive fault detector strategy for scientific workflow scheduling based on improved differential evolution algorithm in cloud. Applied Soft Computing, 99, 106895.

Zhong, J. H., Peng, Z. P., Li, Q. R., & He, J. G. (2019). Multi workflow fair scheduling scheme research based on reinforcement learning. Procedia Computer Science, 154, 117-123.

Ragmani, A., Elomri, A., Abghour, N., Moussaid, K., & Rida, M. (2020). FACO: A hybrid fuzzy ant colony optimization algorithm for virtual machine scheduling in high-performance cloud computing. Journal of Ambient Intelligence and Humanized Computing, 11(10), 3975-3987.

Tong, Z., Chen, H., Deng, X., Li, K., & Li, K. (2020). A scheduling scheme in the cloud computing environment using deep Q-learning. Information Sciences, 512, 1170-1191.

Hamdani, M., Aklouf, Y., & Bouarara, H. A. (2019, March). Improved fuzzy load-balancing algorithm for cloud computing system. In Proceedings of the 9th International Conference on Information Systems and Technologies (pp. 1-4).

Chaudhary, D., & Kumar, B. (2019). Cost optimized hybrid genetic-gravitational search algorithm for load scheduling in cloud computing. Applied Soft Computing, 83, 105627.

Kashikolaei, S. M. G., Hosseinabadi, A. A. R., Saemi, B., Shareh, M. B., Sangaiah, A. K., & Bian, G. B. (2020). An enhancement of task scheduling in cloud computing based on imperialist competitive algorithm and firefly algorithm. The Journal of Supercomputing, 76(8), 6302-6329.

Li, Y., Wang, S., Hong, X., & Li, Y. (2018, July). Multi-objective task scheduling optimization in cloud computing based on genetic algorithm and differential evolution algorithm. In 2018 37th Chinese Control Conference (CCC) (pp. 4489-4494). IEEE.

Jena, U. K., Das, P. K., & Kabat, M. R. (2020). Hybridization of meta-heuristic algorithm for load balancing in cloud computing environment. Journal of King Saud University-Computer and Information Sciences.

Sun, H., Yu, H., Fan, G., & Chen, L. (2020). QoS-aware task placement with fault-tolerance in the edge-cloud. IEEE Access, 8, 77987-78003.

Parshapa, P. ., & Rani, P. I. . (2023). A Survey on an Effective Identification and Analysis for Brain Tumour Diagnosis using Machine Learning Technique. International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 68–78. https://doi.org/10.17762/ijritcc.v11i3.6203

Sahoo, D. K. . (2022). A Novel Method to Improve the Detection of Glaucoma Disease Using Machine Learning. Research Journal of Computer Systems and Engineering, 3(1), 67–72. Retrieved from https://technicaljournals.org/RJCSE/index.php/journal/article/view/44

Downloads

Published

16.07.2023

How to Cite

Pattanaik, B. C. ., Sahoo, B. K. ., Pati, B. ., & Laha, S. R. . (2023). Dynamic Fault Tolerance Management Algorithm for VM Migration in Cloud Data Centers. International Journal of Intelligent Systems and Applications in Engineering, 11(3), 85–96. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3145

Issue

Section

Research Article