AI-Driven Predictive Auto-Scaling for Cloud-Native Systems with Real-Time Anomaly Detection

Authors

  • Mahender Singh

Keywords:

AI-driven auto-scaling, cloud observability, anomaly detection, predictive scaling, AWS infrastructure, time-series forecasting, self-healing automation.

Abstract

Cloud-native architectures demand highly dynamic resource scaling to handle fluctuating workloads efficiently. Traditional reactive scaling methods often lead to over-provisioning, under-utilization, or performance degradation. This paper introduces an AI-driven predictive auto-scaling framework that leverages machine learning-based observability data to anticipate resource demand proactively. By integrating real-time anomaly detection, this approach minimizes system failures due to unexpected surges or resource misallocations. Our proposed solution utilizes Long Short-Term Memory (LSTM) networks for predictive analytics and an unsupervised anomaly detection model to optimize AWS-based cloud infrastructures. Experimental results demonstrate improved cost efficiency, reduced latency, and enhanced system resilience, outperforming conventional auto-scaling mechanisms.

Downloads

Download data is not yet available.

References

Ahmad, S., Lavin, A., Purdy, S., & Agha, Z. (2017). Unsupervised real-time anomaly detection for streaming data. Neurocomputing, 262, 134–147. https://doi.org/10.1016/j.neucom.2017.04.070

Ahmed, M., Mahmood, A., & Hu, J. (2016). A survey of network anomaly detection techniques. Journal of Network and Computer Applications, 60, 19-31.

Akyildiz, I. F., Kak, A., & Nie, S. (2020). 6G and Beyond: The Future of Wireless Communications Systems. IEEE Access, 8, 133995–134030. https://doi.org/10.1109/access.2020.3010896

Amte, R. (n.d.). Next-generation cloud infrastructure: The role of AI in automating provisioning and scaling. ResearchGate.

Anand, A. (n.d.). AI-driven infrastructure management: The future of cloud computing. Management.

Anbalagan, K. (2024). AI in cloud computing: Enhancing services and performance. International Journal of Computer Engineering and Applications.

Chen, P., Yang, S., & McCann, J. A. (2014). Distributed Real-Time anomaly detection in networked industrial sensing Systems. IEEE Transactions on Industrial Electronics, 62(6), 3832–3842. https://doi.org/10.1109/tie.2014.2350451

Chen, Y., Alspaugh, S., & Katz, R. (2012). Interactive analytical processing in big data systems: A cross-industry study of MapReduce workloads. Proceedings of the VLDB Endowment, 5(12), 1802-1813.

Dash Karan, M. S. (2022). AI-driven cloud computing: Enhancing scalability, security, and efficiency. ResearchGate.

Gupta, A., & Reddy, C. K. (2020). Feature selection and activity recognition system using a smartphone accelerometer sensor. IEEE Transactions on Information Technology in Biomedicine, 14(3), 691-698.

Huang, G., Li, Y., & Wang, Z. (2019). Data stream processing and mining in the edge computing era. arXiv preprint arXiv:1909.04847.

Liu, F. T., Ting, K. M., & Zhou, Z. H. (2008). Isolation forest. Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, 413-422.

Luo, J., Hong, T., & Yue, M. (2018). Real-time anomaly detection for very short-term load forecasting. Journal of Modern Power Systems and Clean Energy, 6(2), 235–243. https://doi.org/10.1007/s40565-017-0351-7

Morocho-Cayamcela, M. E., Lee, H., & Lim, W. (2019). Machine learning for 5G/B5G mobile and wireless communications: potential, limitations, and future directions. IEEE Access, 7, 137184–137206. https://doi.org/10.1109/access.2019.2942390

Nama, P., Pattanayak, S., & Meka, H. S. (2023). AI-driven innovations in cloud computing: Transforming scalability, resource management, and predictive analytics in distributed systems. International Research Journal.

Pentyala, D. K. (2021). Enhancing data reliability in cloud-native environments through AI-orchestrated processes. The Computertech.

Pentyala, D. K. (2024). Artificial intelligence for fault detection in cloud-optimized data engineering systems. International Journal of Social Trends.

Smith, A., & Elkan, C. (2007). A Bayesian network framework for rejecting noise in anomaly detection. Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 726-734.

Somanathan, S. (n.d.). AI-powered decision-making in cloud transformation: Enhancing scalability and resilience through predictive analytics. ResearchGate.

Wang, J., & Ye, J. (2015). Two-stage confidence interval estimation in high-dimensional linear models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 77(3), 613-637.

Zhang, X., & Qi, G. (2021). Stock market prediction based on generative adversarial network. Procedia Computer Science, 183, 108-113.

Downloads

Published

29.07.2024

How to Cite

Mahender Singh. (2024). AI-Driven Predictive Auto-Scaling for Cloud-Native Systems with Real-Time Anomaly Detection. International Journal of Intelligent Systems and Applications in Engineering, 12(22s), 2004–2018. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/7420

Issue

Section

Research Article