A Cutting-Edge Data Mining Approach for Dynamic Data Replication That also Involves the Preventative Deletion of Data Centres That are Not Compatible with One Other

Authors

  • Bassam Talib Sabri Department of Business Information Technology /University of Information Technology and Communications Baghdad, Iraq

Keywords:

Raincloud compute, Replicating, Cloud-Sim, Fuzzing systems

Abstract

At this time, large cloud-based applications have produced an increase in the number of demands for data center storage. Replicating data offers an effective method for managing data files in an expansive Cloud environment, which ultimately results in increased data dependability and availability. In this research work, we made a proposal for a data replication approach that we referred to as the Hybrid Repetition Stratagem. This strategy implemented the facsimile assignment, assortment, then auxiliary processes. The Hybrid Repetition Stratagem system is designed to replicate data files in the cloud and includes three primary stages. it chooses optimal location (the location that is the greatest dominant and has the highest number of accesses) for storing the new copy in order to shorten the amount of time required to retrieve it. In the second phase, HRSs takes into consideration a number of different parameters in order to determine which replica node will provide the best experience for users. These parameters include micro chip procedure competence, net broadcast ability, Input and output competence of CDs, weight, then net dormancy. In the tertiary step, the choice to replace something is taken so that the system may have a faster reaction time. A uncertain implication scheme by 3 inputs constraints allows HRS to determine the significance of valued replicas based on their characteristics (amount of admissions, price, then the previous period the model was accessed time). Using the Cloud Sim toolkit package, the newly designed replication policy is modeled and tested. The data are replicated among the cloud nodes in an acceptable manner via the technique that we have developed, which is very simple to put into action in an actual setting. The results of the experiments demonstrate that HRSs may considerably improve the availability, performance, and load balancing of applications that need a large amount of data. In addition, there is no need to increase any extra overhead costs since it is still viable.

Downloads

Download data is not yet available.

References

Liu Q, Wang G, Liu X, Peng T, Wu J. Achieving reliable and secure services in cloud computing environments. Computers & Electrical Engineering, 2017; 59: 153-164.

Jakóbik A, Grzonk D, Palmieri F. Non-deterministic security driven meta scheduler for distributed cloud organizations. Simulation Modelling Practice and Theory, 2017; 76: 67-81.

Mishra S.K, Puthal D, Sahoo B, Jena S.K, Obaidat M.S. An adaptive task allocation technique for green cloud computing. The Journal of Supercomputing, 2017; 1-16.

Foster I, Zhao Y, Raicu I, Lu S. Cloud computing and grid computing 360- degree compared. In: Grid Computing Environments Workshop, GCE’08, 2008; 1-10.

Rajkumar B, Rajiv R, Calheiros R.N. Modeling and simulation of scalable cloud computing environments and the CloudSim toolkit: challenges and opportunities. High Performance Computing & Simulation, 2009; 1-11.

Ghemawat S, Gobioff H, Leung S. The Google file system. In: ACM Symposium on Operating Systems Principles, 2003; 29-43.

Borthakur D. The Hadoop distributed file system: Architecture and design. Available: http://hadoop.apache.org/common/docs/r0.18.3/ hdfs_design.html, 2007.

Feng D, Qin L. Adaptive object placement in object-based storage systems with minimal blocking probability. In: Proceeding of the 20th international conference on Advanced Information Networking and Applications, 2006.

López-Pires F, Barán B. Many-objective virtual machine placement. Journal of Grid Computing, 2017; 15 (2): 161-176.

Tao M, Ota O, Dong M. Dependency-aware dependable scheduling workflow applications with active replica placement in the cloud. IEEE Transactions on Cloud Computing, 2017; 99.

Mansouri N, Kuchaki Rafsanjani M, Javidi M.M. DPRS: A dynamic popularity aware replication strategy with parallel download scheme in cloud environments. Simulation Modelling and Theory, 2017; 77: 177–196.

Rahman R.M, Barker K, Alhajj R. Replica placement design with static optimality and dynamic maintainability. In: Sixth IEEE International Symposium on Cluster Computing and the Grid, 2006; 434-437.

Shvachko K, Kuang H, Radia S, Chansler R. The Hadoop distributed file system. In: IEEE 26th Symposium on Mass Storage Systems and Technologies, 2010; 1-10.

Mansouri N, Dastghaibyfard G.H. A dynamic replica management strategy in data grid. Journal of Network and Computer Applications, 2012; 35: 1297-1303.

Ibrahim I.A, Dai W, Bassiouni M. intelligent data placement mechanism for replicas distribution in cloudstorage systems. In: IEEE International Conference on Smart Cloud (SmartCloud), 2016; 134-139.

Mansouri N, Dastghaibyfard G.H, Mansouri E. Combination of data replication and scheduling algorithm for improving data availability in data grids. Journal of Network and Computer Applications, 2013; 36: 711-722.

Mansouri N, Dastghaibyfard G.H. Enhanced dynamic hierarchical replication and weighted scheduling strategy in data grid. Journal of Parallel and Distributed Computing, 2013; 73: 534-543.

Mansouri N. Adaptive data replication strategy in cloud computing for performance improvement. Frontiers of Computer Science, 2016; 1-11.

Sun D.W, Chang G.R, Gao S, Jin L.Z, Wang X.W. Modeling a dynamic data replication strategy to increase system availability in cloud computing environments. Journal of Computer Science and Technology, 2012; 27: 256-272.

Chang R.S, Chang H.P. A dynamic data replication strategy using access-weights in data grids. Journal of Supercomputing, 2008; 45(3): 277-295.

Kim Y.H, Jung M.J, Lee C.H. Energy-aware real-time task scheduling exploiting temporal locality. IEICE Transactions on Information and Systems, 2010; 93(5): 1147-1153.

Sun D.W, Chang G.R, Miao C, Jin L.Z, Wang X.W. Analyzing modeling and evaluating dynamic adaptive fault tolerance strategies in cloud computing environments. Journal of Supercomputing, 2013; 66: 193-228.

Zhang B, Wang X, Huang M. A PGSA based data replica selection scheme for accessing cloud storage system. Advanced Computer Architecture, 2014; 451: 140-151.

Ding X, You J. Plant growth simulation algorithm. Shanghai People’s Publishing House, 2011; 1-59. [25] Li B, Song S.L, Bezakova I, Cameron K.W. EDR: An energy-aware runtime load distribution system for data-intensive applications in the cloud. In: IEEE International Conference on Cluster Computing, 2013.

Lin J.W, Chen C.H, Chang J.M. QoS-aware data replication for data-intensive applications in cloud computing systems. IEEE Transactions on Cloud Computing, 2013; 1: 101-115.

Long S.Q, Zhao Y.L, Chen W. MORM: A multi-objective optimized replication management strategy for cloud storage cluster. Journal of Systems Architecture, 2014; 60: 234-244.

Luo Y, Li R, Tian F. Application of artificial immune algorithm to function optimization. In: Fifth World Congress on Intelligent Control and Automation, 2004; 3: 2248-2252.

Lou C, Zheng M, Liu X, Li X. Replica selection strategy based on individual QoS sensitivity constraints in cloud environment. Pervasive Computing and the Networked World, 2014; 8351: 393-399.

Kumar K.A, Quamar A, Deshpande A, Khuller S. SWORD: workload-aware data placement and replica selection for cloud data management systems. The VLDB Journal, 2014; 23: 845-870.

Saleh A, Javidan R, Fatehikhaje M.T. A four-phase data replication algorithm for data grid. Journal of Advanced Computer Science & Technology, 2015; 4.

Newman M. Networks: An introduction, Oxford University Press, 2009.

Korat C, Gohel P. A novel honey bee inspired algorithm for dynamic load balancing in cloud Environment. International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, 2015; 4.

Dasgupta K, Kumar Mondal J, Dutta P. Optimized video steganography using genetic algorithm. In: International Conference on Computational Intelligence: Modeling, Techniques and Applications, 2013; 10: 131-137.

Chang B, Tsai H, Huang C.F, Lin Z.Y, Chen C.M. Fast access security on cloud computing: Ubuntu enterprise server and cloud with face and fingerprint identification. In: Proceedings of the 2nd International Congress on Computer Applications and Computational Science, 2012; 451-457.

Calheiros R.N, Ranjan R, Beloglazov A, De Rose C.A.F, Buyya R. CloudSim: A toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Software: Practice and Experience, 2011; 41: 23-50.

Fittkau F, Frey S, Hasselbring W. Cloud user-centric enhancements of the simulator CloudSim to improve cloud deployment option analysis. In: Proceedings of the 1st European conference on Service-Oriented and Cloud Computing, 2012.

Lim S, Sharma B, Nam G, Kim E, Das C. Mdcsim: a multi-tier data center simulation, platform. In: Proceedings of IEEE International Conference on Cluster Computing and Workshops, 2009.

Nunez A, Vazquez-Poletti J.L, Caminero A.C, Castane G.G, Carretero J, Llorente I.M. iCanCloud: a flexible and scalable cloud infrastructure simulator. Journal of Grid Computing, 2012; 10 (1): 185-209.

Jararweh Y, Alshara Z, Jarrah M, Kharbutli M, Alsaleh M. Teachcloud: a cloud computing educational toolkit. In: Proceedings of the 1st International IBM Cloud Academy Conference, 2012.

Garg S, Buyya R. Networkcloudsim: modelling parallel applications in cloud simulations. In: Proceedings of the 4th IEEE/ACM International Conference on Utility and Cloud Computing, 2011; 105-113.

Kliazovich D, Bouvry P, Khan S.U. GreenCloud: a packet-level simulator of energy-aware cloud computing data centers. The Journal of Supercomputing, 2012; 62(3): 1263-1283.

Barroso L.A, Clidaras J, Holzle U. The datacenter as a computer: an introduction to the design of warehouse-scale machines. 2nd ed. Morgan and Claypool Publishers, 2013.

Howell F, Mcnab R. SimJava: A discrete event simulation library for java. In: Proceedings of the first International Conference on Web-Based Modeling and Simulation, 1998.

Khlebus, S.F., Hasoun, R.K., Sabri, B.T., " A modification of the Cayley-purser algorithm" International Journal of Nonlinear Analysis and Applications, 2022, 13(1), pp. 707–716.

Razzaq Abdul Hussein, R., Hamza, Z.F., Sabri, B.T." Forecasting the number of COVID-19 infections in Iraq using the ARIMA model" Journal of Applied Science and Engineering (Taiwan)this link is disabled, 2021, 24(5), pp. 729–734.

Bassam Talib Sabri, Noaman Ahmed Yaseen AL-Falahi, Isam Adil Salman, " Option for optimal extraction to indicate recognition of gestures using the self-improvement of the micro genetic algorithm" Journal of international journal of nonlinear analysis and application, vol. 12, no. 2, pp. 2295-2302, 2021.

Bassam S Ali, Osman N Ucan, " Lossy Hyperspectral Image Compression Based on Intraband Prediction and Inter-band Fractal," in 2018 Proceedings of the Fourth International Conference on Engineering & MIS 2018, turkey, Istanbul, 2018/6/19.

Structure for replicating data across many, distinct data centers in the cloud

Downloads

Published

27.12.2022

How to Cite

Sabri , B. T. . (2022). A Cutting-Edge Data Mining Approach for Dynamic Data Replication That also Involves the Preventative Deletion of Data Centres That are Not Compatible with One Other. International Journal of Intelligent Systems and Applications in Engineering, 10(3s), 88–99. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2416

Issue

Section

Research Article