When public cloud computing emerged in the late 2000s and began scaling through the early 2010s, the prevailing theory was straightforward. Cloud computing theory claimed nearly every workload would eventually migrate off-premises. The economics were compelling, the convenience undeniable, and the momentum felt unstoppable. More than 15 years later, enterprise IT leaders are still managing substantial on-premises infrastructure — and many are actively investing in upgrades.

Today, various research reports estimate that between 35 and 50 percent of workloads have moved to the cloud. Whether that figure is above or below 50 percent, it’s clear that workloads remain distributed. Cost, data sensitivity, regulatory compliance, latency requirements, and operational control all shape where a given workload belongs. The public cloud became one option in a complex portfolio, and the same will be true for AI workload placement.

Organizations have absorbed that lesson. In the early days, everyone ran AI proofs of concept in the cloud, but enterprise leaders are asking more specific questions on how to design scalable AI architecture. Today’s discussion centers on which AI workloads belong where and under what conditions? Cost recently surfaced as a major concern as AI token use skyrocketed. In many cases, AI costs escalated due to flawed policies that incentivized employees to consume as many tokens as possible to prove they were using AI.

Today, organizations are considering the rationale for keeping workloads on-premises and whether upgrading their on-premises technology will be a cost-benefit or a disadvantage. It is not an easy question to answer. No technology vendor has delivered a definitive framework or spreadsheet that simplifies that decision. What has happened is that every major cloud and hardware vendor now offers some combination of managed AI services and pre-validated AI factory reference designs intended to help organizations scale AI deployments beyond proof of concept.

The Hybrid AI Reality Is Already Here and Continues to Gain Momentum

Walk into a strategy conversation with a large enterprise today, and you will rarely encounter a pure cloud or pure on-premises AI strategy. Hybrid deployments are the operating assumption. The more substantive discussion is about placement logic — what drives a workload toward private infrastructure, what drives it toward a public cloud, and what governance and connectivity model bridges the two. It is worth noting up front that HPE, as a hardware and infrastructure company, has a clear commercial interest in upgrading on-premises deployments. That context does not invalidate the market need, but it is relevant when evaluating how the company frames its positioning.

Regulated financial institutions, healthcare systems, defense contractors, and national governments all have data residency requirements, sovereignty mandates, and security classifications must consider partitioning workloads between cloud and updated on-premises infrastructure.

The AI Infrastructure Market Responds to Private and Sovereign Demand

In 2026, most vendors are discussing which infrastructure advancements are needed to support agentic AI. While there were many hardware announcements at HPE’s annual Discover conference, the company also discussed updates to private cloud and sovereign AI infrastructure for agentic AI. HPE launched its first private cloud offerings roughly 2 years ago.

HPE is not alone in this space. Dell, Lenovo, and others have announced comparable on-premises AI infrastructure products built around Nvidia accelerators. Google Cloud was among the early movers in offering air-gapped, disconnected cloud offerings for regulated industries. The differentiation between these offerings — at the architecture, software, and services layer — is still being established in the market and warrants scrutiny from buyers evaluating alternatives.

HPE CEO Antonio Neri described the AI infrastructure decision as inseparable from data governance and sovereignty. Lopez Research has found this framing is consistent with what enterprise buyers in regulated industries report when asked about deployment constraints. HPE is delivering a pre-validated, purpose-built on-premises environment for AI workloads that reduces integration complexity and accelerates deployment timelines relative to assembling components independently. HPE also announced a Sovereign AI Factory configuration targeting governments and regulated industries, with built-in defense-grade security hardening, federal compliance readiness, and air-gapped operation. Let’s talk specifically about some of the ways HPE is addressing the agentic AI challenge.

Agentic AI Adds New Complexity to the Security and Governance Problem

Neri articulated a theme that was consistent across the 2026 enterprise technology conference season when he discussed how AI has moved beyond generative assistants to autonomous AI agents. Technology vendors have discussed AI agents for more than a year. Many enterprises are already encountering agentic AI through their existing SaaS platforms. More advanced organizations are building and deploying their own agents. Lopez Research’s conversations with early adopters consistently surface the same operational pain points, including multi-agent orchestration across enterprise applications, securing and permissioning AI agents, governance, and company-wide observability into what agents are doing.

Agentic AI introduces a security and governance surface for every organization that most existing enterprise security stacks were not designed to handle. Today’s security products were designed for individuals, not AI agents. These solutions are anchored on user credentials, access policies, and behavioral patterns. An AI agent, if allowed to do so, can operate autonomously, continuously, and at machine speed across multiple systems simultaneously. Bad situations propagate across workflows fast before any human reviewer is aware that something has gone wrong.

HPE’s response at Discover included a three-tier identity model for agentic workloads that includes user verification, agent-level governance, and human approval gates for sensitive actions. It also offers the ability to wrap agents built in any framework with security controls, including API protection, identity management, and encryption, without requiring code changes. Integration with Nvidia OpenShell provides isolated execution environments per agent. NeMo Guardrails enforce policy at the model level. Zerto integration enables rollback to a clean state if an agent executes incorrectly.

Private Cloud AI now includes a governed data layer with deep integration into the Nvidia AI Data Platform, giving enterprises a unified way to access, prepare, and manage data across their existing environments — no custom pipelines required. The HPE Alletra Storage MP Extend 1000 serves as the storage foundation, purpose-built for the performance demands of modern AI workloads. It adds real-time metadata enrichment and native MCP support, so agents and applications can retrieve the right data and context faster across both structured and unstructured data. HPE claims the result is a 7–12x faster time to value compared to building the environment yourself.

Once your data is governed and ready, Private Cloud AI delivers the infrastructure to scale inference. Multi-node inference allows larger models to be served across multiple systems, so capacity grows naturally with demand. A new unified gateway gives teams a single API for accessing both frontier and open-source models, with centralized credentials, budgets, and policies built in. New configurations now scale up to 256 GPUs, which includes the new ProLiant DL394 with Nvidia GPUs optimized specifically for inferencing and long-context workloads. Additionally, shared KV cache capabilities eliminate the need to repeatedly recompute context, reducing cost per first token and delivering significant performance gains across the board.

No architecture from any specific vendor will ever fully address a company’s governance or security problems. Still, buyers need to ensure that their vendors are addressing the problem and that they are willing to work with others to support a holistic approach. What HPE’s offering does represent is a concrete, specific engineering response to a recognized gap. Rami Rahim, HPE’s EVP and President of Networking, expanded on this in a separate day 2 session, arguing that the network itself must become an active enforcement layer for agentic security through zero-trust architecture, AI-driven anomaly detection, and automated policy enforcement.

Familiar AI Themes, With A Focus on Execution

Assessed across the first half of the 2026 enterprise technology conference season, HPE Discover did not introduce themes that were new to the industry conversation. Sovereign AI requirements, the governance gap in agentic systems, and hybrid placement logic have all been visible in analyst briefings, vendor roadmaps, and customer conversations for some time.

That observation, however, should not diminish the significance of what Neri and Rahim presented. Identifying a trend early is necessary but not sufficient. The value of HPE Discover lies not in the novelty of the concept but in the focus of execution. What HPE customers were looking for was the degree to which the company has translated an accurate read of market direction into deployable infrastructure that enterprises and governments can procure and operate today. Whether HPE’s implementation proves durable competitive differentiation in a space where competitors are moving quickly is a question the next twelve months will answer.

HPE’s position is architecturally sound and consistent with broader industry direction. The customer deployments already underway illustrate there’s real demand in this space from various industry segments. The U.S. Defense Information Systems Agency (DISA) awarded HPE a ten-year contract to modernize its digital and AI platform capabilities, requiring a NIST-compliant private cloud environment that meets federal security classifications. In Europe, HPE is building the HammerHAI system at the High-Performance Computing Center Stuttgart (HLRS) in Germany. It is a sovereign AI installation delivering more than 15 exaflops of peak AI inference performance for research institutions and industrial organizations that must comply with European data residency requirements. In healthcare, St. Jude Children’s Research Hospital is using HPE Private Cloud AI to bring AI capabilities to its clinical and research teams while protecting sensitive pediatric oncology data. These three deployments — federal defense, national research infrastructure, and regulated healthcare — represent the segment of buyers for whom private and sovereign AI is a requirement, not a preference.

Scaling AI Requires A Portfolio Approach

The cloud never won all of the workloads. The economics, the regulations, and the operational realities of enterprise IT ensured that on-premises infrastructure remained relevant long after public cloud momentum suggested otherwise. The same dynamics are now shaping AI infrastructure decisions.

For organizations in regulated industries, private and sovereign AI is not a conservative hedge against innovation. For many, it is the enabling condition for AI adoption at all. But private AI infrastructure is not a complete exit from cloud services, and buyers should be cautious about treating it as such.

Even organizations running heavily on-premises AI environments (regulated or not) will almost certainly rely on the public cloud for a portion of their AI workloads. The question is how much. AI model training is the most frequently cited example — training large foundation models requires burst compute capacity that few enterprises can justify owning outright. But the dependencies extend further. AI experimentation and prototyping typically benefit from the speed and low commitment of cloud environments before workloads are validated for on-premises production. And accessing specialized frontier models — from providers like Anthropic, Google, OpenAI, or open-source alternatives — will often happen via cloud APIs, particularly for capabilities that do not require fine-tuning on proprietary data.

In some cases, even those cloud touchpoints will need to meet sovereign requirements. Major hyperscalers have developed sovereign cloud offerings for regulated industries. A growing tier of neoclouds is positioning explicitly around sovereign infrastructure for organizations requiring local data residency, compliance certification, and jurisdictional control within a managed cloud model. The decision, in most cases, is not binary — it is a portfolio question about which workloads run where and what governance conditions each environment must satisfy.

Regardless of the type of organization you are, validated reference designs for private deployment that can scale in a reasonable timeframe with minimal execution risk serve a real purpose. AI is not trivial to engineer independently. Pre-validated stacks lower the barrier for organizations that need private AI the most but have the least tolerance for deployment risk. HPE has made a substantive set of announcements in this space. So have others. Organizations keep getting better solutions to the “How do I build AI?” question nearly every month. Perhaps, the bigger question businesses need to focus on is “What do we need to build?”

Share.
Exit mobile version