Intent-Driven Fleets: An Agentic AI Framework for Cloud Elasticity
Keywords:
Cloud Elasticity, Multi-Agent Systems, Intent-Driven Orchestration, Model Context Protocol, Autonomous InfrastructureAbstract
The evolution of cloud infrastructure toward hyper-scale deployments has exposed the fundamental inadequacy of reactive, threshold-based auto-scaling mechanisms. As digital services grow to serve global user bases during concentrated seasonal demand windows, the gap between high-level business objectives and low-level infrastructure execution has widened into a structural operational failure. Intent-Driven Fleets (IDF) address this gap through an autonomous orchestration framework that coordinates specialized AI agents, Commander, Forecasting, Provisioner, and Efficiency via the Model Context Protocol (MCP), enabling infrastructure to reason about goals rather than execute pre-written rules. The framework proposes the Plan-Execute-Observe-Reflect (PEOR) cycle, a formalized iterative process in which infrastructure anticipates demand, breaks business intent down into dependency-based execution plans, accepts business-layer telemetry to provide a context in which decisions are made, and continually optimizes provisioning behaviour via long-term memory. Security is also achieved by a deterministic guardrail layer, which is directly implemented as part of the MCP server that ensures that agent actions are not unlimited financially by permitting only signed authorization tokens, which are verified prior to each tool call. Individually identified engineering issues of this architecture are: context window congestion when receiving full telemetry, tool-call latency buildup amidst multi-region provisioning sequences, and concurrency conflicts necessitating distributed intent locking. The IDF framework establishes intent as the most effective abstraction for managing hyper-scale cloud environments, pointing toward a genuinely autonomous operational paradigm where global infrastructure responds directly to business goals without requiring continuous human translation at every execution step.
Downloads
References
Ahmed Barnawi et al., "The views, measurements and challenges of elasticity in the cloud: A review," Computer Communications, 2020. [Online]. Available: https://www.sciencedirect.com/science/article/abs/pii/S0140366419319516
AWS, "Unlock new value with agentic AI on AWS". [Online]. Available: https://aws.amazon.com/ai/agentic-ai/?trk=624522eb-7b74-4623-b50b-12a12eff8837&sc_channel=ps&ef_id=Cj0KCQjw7IjOBhDyARIsAFzrWQz5JpMg7ygr-lbiV599sk7CJbLb22dXmKSXlzW7AS9t_177UPdawF0aAtdMEALw_wcB:G:s&s_kwcid=AL!4422!3!795924513590!p!!g!!agentic%20ai!23528572469!196289121721&gad_campaignid=23528572469&gbraid=0AAAAADjHtp81WbJO_Wn5zyt2VgxzURdFs&gclid=Cj0KCQjw7IjOBhDyARIsAFzrWQz5JpMg7ygr-lbiV599sk7CJbLb22dXmKSXlzW7AS9t_177UPdawF0aAtdMEALw_wcB
Yuxing Zhang et al., "Predictive Auto-Scaling in Distributed Cloud Environments Using Machine Learning," ICDIS '25: Proceedings of the 2nd International Symposium on Integrated Circuit Design and Integrated Systems, 2025. [Online]. Available: https://dl.acm.org/doi/10.1145/3772326.3774726
Model Context Protocol, "Architecture overview". [Online]. Available: https://modelcontextprotocol.io/docs/learn/architecture
Rzadca, Krzysztof, et al. "Autopilot: workload autoscaling at Google." EuroSys '20: Proceedings of the Fifteenth European Conference on Computer Systems. 2020.[Online]. Available: https://dl.acm.org/doi/abs/10.1145/3342195.3387524
Stefan Nastic et al., "A Serverless Real-Time Data Analytics Platform for Edge Computing," IEEE Internet Computing, 2017. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/7994559
IBM, "Multi-agent orchestration". [Online]. Available: https://www.ibm.com/products/watsonx-orchestrate/multi-agent-orchestration?utm_content=SRCWW&p1=Search&p4=2368123929001&p5=p&p9=187402692430&gclsrc=aw.ds&gad_source=1&gad_campaignid=22029767487&gbraid=0AAAAAD-_QsT1gyIkcksIsL9yClbGu20SB&gclid=Cj0KCQjw7IjOBhDyARIsAFzrWQx9ik1nw1gagLHvux02YxaIBX9GZQEYdwziu--96zlaGqmt9rLcqaMaAu_uEALw_wcB
Perspicientia Consultancy Private Limited, "Designing Guardrails for Autonomous AI Systems," LinkedIn 2026. [Online]. Available: https://www.linkedin.com/pulse/designing-guardrails-autonomous-ai-systems-perspicientia-bk4gc/
Benjamin Hindman et al. "Mesos: A platform for {Fine-Grained} resource sharing in the data center". [Online]. Available: https://www.usenix.org/legacy/event/nsdi11/tech/full_papers/Hindman.pdf
Ghaith Dkmak et al., "AI-Driven Anomaly Detection in Cloud-Native Microservices: The Night’s Watch Algorithm," Appl. Sci. 2025. [Online]. Available: https://www.mdpi.com/2076-3417/15/23/12762
Uchi Uchibeke et al., "Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents", 2026. [Online]. Available: https://arxiv.org/pdf/2603.20953
McKinsey, "What is a context window?", 2024. [Online]. Available: https://www.mckinsey.com/featured-insights/mckinsey-explainers/what-is-a-context-window
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.


