Intent-Driven Fleets: An Agentic AI Framework for Cloud Elasticity

Authors

  • Somdutt Brajaraj Patnaik

Keywords:

Cloud Elasticity, Multi-Agent Systems, Intent-Driven Orchestration, Model Context Protocol, Autonomous Infrastructure

Abstract

The evolution of cloud infrastructure toward hyper-scale deployments has exposed the fundamental inadequacy of reactive, threshold-based auto-scaling mechanisms. As digital services grow to serve global user bases during concentrated seasonal demand windows, the gap between high-level business objectives and low-level infrastructure execution has widened into a structural operational failure. Intent-Driven Fleets (IDF) address this gap through an autonomous orchestration framework that coordinates specialized AI agents, Commander, Forecasting, Provisioner, and Efficiency via the Model Context Protocol (MCP), enabling infrastructure to reason about goals rather than execute pre-written rules. The framework proposes the Plan-Execute-Observe-Reflect (PEOR) cycle, a formalized iterative process in which infrastructure anticipates demand, breaks business intent down into dependency-based execution plans, accepts business-layer telemetry to provide a context in which decisions are made, and continually optimizes provisioning behaviour via long-term memory. Security is also achieved by a deterministic guardrail layer, which is directly implemented as part of the MCP server that ensures that agent actions are not unlimited financially by permitting only signed authorization tokens, which are verified prior to each tool call. Individually identified engineering issues of this architecture are: context window congestion when receiving full telemetry, tool-call latency buildup amidst multi-region provisioning sequences, and concurrency conflicts necessitating distributed intent locking. The IDF framework establishes intent as the most effective abstraction for managing hyper-scale cloud environments, pointing toward a genuinely autonomous operational paradigm where global infrastructure responds directly to business goals without requiring continuous human translation at every execution step.

DOI: https://doi.org/10.17762/ijisae.v14i1s.8220

Downloads

Download data is not yet available.

References

Ahmed Barnawi et al., "The views, measurements and challenges of elasticity in the cloud: A review," Computer Communications, 2020. [Online]. Available: https://www.sciencedirect.com/science/article/abs/pii/S0140366419319516

AWS, "Unlock new value with agentic AI on AWS". [Online]. Available: https://aws.amazon.com/ai/agentic-ai/?trk=624522eb-7b74-4623-b50b-12a12eff8837&sc_channel=ps&ef_id=Cj0KCQjw7IjOBhDyARIsAFzrWQz5JpMg7ygr-lbiV599sk7CJbLb22dXmKSXlzW7AS9t_177UPdawF0aAtdMEALw_wcB:G:s&s_kwcid=AL!4422!3!795924513590!p!!g!!agentic%20ai!23528572469!196289121721&gad_campaignid=23528572469&gbraid=0AAAAADjHtp81WbJO_Wn5zyt2VgxzURdFs&gclid=Cj0KCQjw7IjOBhDyARIsAFzrWQz5JpMg7ygr-lbiV599sk7CJbLb22dXmKSXlzW7AS9t_177UPdawF0aAtdMEALw_wcB

Yuxing Zhang et al., "Predictive Auto-Scaling in Distributed Cloud Environments Using Machine Learning," ICDIS '25: Proceedings of the 2nd International Symposium on Integrated Circuit Design and Integrated Systems, 2025. [Online]. Available: https://dl.acm.org/doi/10.1145/3772326.3774726

Model Context Protocol, "Architecture overview". [Online]. Available: https://modelcontextprotocol.io/docs/learn/architecture

Rzadca, Krzysztof, et al. "Autopilot: workload autoscaling at Google." EuroSys '20: Proceedings of the Fifteenth European Conference on Computer Systems. 2020.[Online]. Available: https://dl.acm.org/doi/abs/10.1145/3342195.3387524

Stefan Nastic et al., "A Serverless Real-Time Data Analytics Platform for Edge Computing," IEEE Internet Computing, 2017. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/7994559

IBM, "Multi-agent orchestration". [Online]. Available: https://www.ibm.com/products/watsonx-orchestrate/multi-agent-orchestration?utm_content=SRCWW&p1=Search&p4=2368123929001&p5=p&p9=187402692430&gclsrc=aw.ds&gad_source=1&gad_campaignid=22029767487&gbraid=0AAAAAD-_QsT1gyIkcksIsL9yClbGu20SB&gclid=Cj0KCQjw7IjOBhDyARIsAFzrWQx9ik1nw1gagLHvux02YxaIBX9GZQEYdwziu--96zlaGqmt9rLcqaMaAu_uEALw_wcB

Perspicientia Consultancy Private Limited, "Designing Guardrails for Autonomous AI Systems," LinkedIn 2026. [Online]. Available: https://www.linkedin.com/pulse/designing-guardrails-autonomous-ai-systems-perspicientia-bk4gc/

Benjamin Hindman et al. "Mesos: A platform for {Fine-Grained} resource sharing in the data center". [Online]. Available: https://www.usenix.org/legacy/event/nsdi11/tech/full_papers/Hindman.pdf

Ghaith Dkmak et al., "AI-Driven Anomaly Detection in Cloud-Native Microservices: The Night’s Watch Algorithm," Appl. Sci. 2025. [Online]. Available: https://www.mdpi.com/2076-3417/15/23/12762

Uchi Uchibeke et al., "Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents", 2026. [Online]. Available: https://arxiv.org/pdf/2603.20953

McKinsey, "What is a context window?", 2024. [Online]. Available: https://www.mckinsey.com/featured-insights/mckinsey-explainers/what-is-a-context-window

Downloads

Published

14.02.2026

How to Cite

Somdutt Brajaraj Patnaik. (2026). Intent-Driven Fleets: An Agentic AI Framework for Cloud Elasticity. International Journal of Intelligent Systems and Applications in Engineering, 14(1s), 570–577. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/8220

Issue

Section

Research Article