AI-Driven Predictive Auto-Scaling for Cloud-Native Systems with Real-Time Anomaly Detection
Keywords:
AI-driven auto-scaling, cloud observability, anomaly detection, predictive scaling, AWS infrastructure, time-series forecasting, self-healing automation.Abstract
Cloud-native architectures demand highly dynamic resource scaling to handle fluctuating workloads efficiently. Traditional reactive scaling methods often lead to over-provisioning, under-utilization, or performance degradation. This paper introduces an AI-driven predictive auto-scaling framework that leverages machine learning-based observability data to anticipate resource demand proactively. By integrating real-time anomaly detection, this approach minimizes system failures due to unexpected surges or resource misallocations. Our proposed solution utilizes Long Short-Term Memory (LSTM) networks for predictive analytics and an unsupervised anomaly detection model to optimize AWS-based cloud infrastructures. Experimental results demonstrate improved cost efficiency, reduced latency, and enhanced system resilience, outperforming conventional auto-scaling mechanisms.
Downloads
References
Ahmad, S., Lavin, A., Purdy, S., & Agha, Z. (2017). Unsupervised real-time anomaly detection for streaming data. Neurocomputing, 262, 134–147. https://doi.org/10.1016/j.neucom.2017.04.070
Ahmed, M., Mahmood, A., & Hu, J. (2016). A survey of network anomaly detection techniques. Journal of Network and Computer Applications, 60, 19-31.
Akyildiz, I. F., Kak, A., & Nie, S. (2020). 6G and Beyond: The Future of Wireless Communications Systems. IEEE Access, 8, 133995–134030. https://doi.org/10.1109/access.2020.3010896
Amte, R. (n.d.). Next-generation cloud infrastructure: The role of AI in automating provisioning and scaling. ResearchGate.
Anand, A. (n.d.). AI-driven infrastructure management: The future of cloud computing. Management.
Anbalagan, K. (2024). AI in cloud computing: Enhancing services and performance. International Journal of Computer Engineering and Applications.
Chen, P., Yang, S., & McCann, J. A. (2014). Distributed Real-Time anomaly detection in networked industrial sensing Systems. IEEE Transactions on Industrial Electronics, 62(6), 3832–3842. https://doi.org/10.1109/tie.2014.2350451
Chen, Y., Alspaugh, S., & Katz, R. (2012). Interactive analytical processing in big data systems: A cross-industry study of MapReduce workloads. Proceedings of the VLDB Endowment, 5(12), 1802-1813.
Dash Karan, M. S. (2022). AI-driven cloud computing: Enhancing scalability, security, and efficiency. ResearchGate.
Gupta, A., & Reddy, C. K. (2020). Feature selection and activity recognition system using a smartphone accelerometer sensor. IEEE Transactions on Information Technology in Biomedicine, 14(3), 691-698.
Huang, G., Li, Y., & Wang, Z. (2019). Data stream processing and mining in the edge computing era. arXiv preprint arXiv:1909.04847.
Liu, F. T., Ting, K. M., & Zhou, Z. H. (2008). Isolation forest. Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, 413-422.
Luo, J., Hong, T., & Yue, M. (2018). Real-time anomaly detection for very short-term load forecasting. Journal of Modern Power Systems and Clean Energy, 6(2), 235–243. https://doi.org/10.1007/s40565-017-0351-7
Morocho-Cayamcela, M. E., Lee, H., & Lim, W. (2019). Machine learning for 5G/B5G mobile and wireless communications: potential, limitations, and future directions. IEEE Access, 7, 137184–137206. https://doi.org/10.1109/access.2019.2942390
Nama, P., Pattanayak, S., & Meka, H. S. (2023). AI-driven innovations in cloud computing: Transforming scalability, resource management, and predictive analytics in distributed systems. International Research Journal.
Pentyala, D. K. (2021). Enhancing data reliability in cloud-native environments through AI-orchestrated processes. The Computertech.
Pentyala, D. K. (2024). Artificial intelligence for fault detection in cloud-optimized data engineering systems. International Journal of Social Trends.
Smith, A., & Elkan, C. (2007). A Bayesian network framework for rejecting noise in anomaly detection. Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 726-734.
Somanathan, S. (n.d.). AI-powered decision-making in cloud transformation: Enhancing scalability and resilience through predictive analytics. ResearchGate.
Wang, J., & Ye, J. (2015). Two-stage confidence interval estimation in high-dimensional linear models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 77(3), 613-637.
Zhang, X., & Qi, G. (2021). Stock market prediction based on generative adversarial network. Procedia Computer Science, 183, 108-113.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.