BIP America Latest News

collapse
Home / Daily News Analysis / Why the future of AI is on-premises - business advice from Dell Tech World 2026

Why the future of AI is on-premises - business advice from Dell Tech World 2026

May 31, 2026  Twila Rosenbaum  43 views
Why the future of AI is on-premises - business advice from Dell Tech World 2026

Nearly every technology conference today has a focus on artificial intelligence, and this week's Dell Technologies World was no exception. But what stood out was the focus on how businesses can actually execute on AI, especially by building more AI capabilities into their infrastructure. With rising costs, sovereignty requirements, and agent adoption, Dell's latest conference provided actionable advice for transitioning AI workloads to a hybrid infrastructure.

Top trends at Dell Tech World 2026 that are pushing businesses to increase their on-premises AI capabilities include increasing demand for data and AI sovereignty, the need for tighter governance, especially for agents, and more direct control over these critical systems. These factors are driving a paradigm shift away from purely cloud-based AI towards distributed, on-premises compute resources.

'Intelligence is becoming infrastructure'

In the opening keynote, Dell chairman and CEO Michael Dell set the tone by stating that the company is working to move AI closer to the data and infrastructure. "Abundant intelligence is here," Michael Dell told attendees. "Intelligence is becoming infrastructure." This sentiment captures a fundamental change in how enterprises perceive AI: no longer just a service accessed via APIs, but a core component of the IT stack that must be owned and managed internally.

Enterprises are realizing that piloting AI through a public cloud API is simple, but moving that pilot into large-scale production requires internal, dedicated server and compute resources. Without on-premises or hybrid architecture, enterprises face hurdles around data capacity and latency, especially as businesses move from traditional AI to agentic systems. The transition from proof-of-concept to production often reveals hidden costs and performance bottlenecks that cloud-only deployments cannot easily address.

One of the main pressures driving this shift is the increased costs of using cloud-based large language models (LLMs). In his keynote, Dell discussed the concept of "tokenomics," emphasizing that every token consumed in the cloud carries a price tag. Dell Technologies vice chairman and COO Jeff Clarke, during his Day 2 keynote, highlighted that token usage for AI has risen by 320-fold and that, by 2030, global token consumption is predicted to grow 3,400%. Such exponential growth makes cloud-based AI increasingly unsustainable for enterprises with heavy inference workloads.

To combat these costs, Dell's portfolio is designed to enable businesses to move more AI workloads from the cloud to on-prem compute. Dell speakers outlined multiple ways AI compute can be utilized internally, from local workstations to huge data center racks to edge devices, all of which can greatly reduce the cost of tokens for a business. For example, running an open-source model on dedicated hardware can slash per-token expenses by orders of magnitude compared to API-based consumption models.

Along with costs, the move to more internal compute resources is also being driven by trends in AI and data sovereignty. Research from Aberdeen shows that companies across the globe and all business sectors are putting a high value on keeping data and AI training out of the cloud and protected in company data centers. Sovereign AI requirements are becoming critical as regulations like GDPR in Europe, China's data security laws, and similar frameworks elsewhere mandate that sensitive data must remain within national borders.

Growing requirements for sovereign AI

At Dell Technologies World, several Dell speakers discussed the growing requirements for sovereign AI and how Dell can help customers meet these needs, including the introduction of the new Dell AI Data Platform. This platform aims to provide a unified data layer that supports on-premises AI workloads while ensuring compliance with local regulations. The platform integrates data management, governance, and AI orchestration in a way that keeps data under the enterprise's control.

Requirements for sovereign AI become even more important as businesses begin to adopt agents and agentic systems. With agents, not only do costs around tokens see significant growth (one case study at the show stated that a company exceeded an entire year's token budget by March once agents entered the equation), but the need for strict security, governance, and control becomes vital to prevent unintended consequences. Agents that operate autonomously can make decisions that have real-world impacts, so enterprises must be able to audit, trace, and roll back any action.

"When an agent takes an action on your behalf, you need to know what it did, why it did it, and how to undo it if it got it wrong," Jeff Clarke said in his keynote. This statement underscores the governance challenges that come with agentic AI. To address these, Dell announced several new offerings designed to help businesses build and manage agents responsibly. These include Dell Deskside Agentic AI, a development offering that includes workstations, Nvidia NemoClaw software, and Dell services. Also announced was support for Nvidia OpenShell, a sandboxed environment for building agents and enforcing corporate governance and privacy policies.

Dell Deskside Agentic AI provides a dedicated local workstation where developers can prototype agents without incurring cloud costs or exposing sensitive data. The integration with Nvidia NemoClaw allows for fine-tuning and evaluation of agent behaviors in a controlled setting. OpenShell, meanwhile, creates a safe sandbox where agents can be tested against corporate policies before being deployed into live environments.

Conflicting advice: Move fast versus go slow

Many of these announcements, sessions, and discussions at Dell Technologies World highlighted one of the main balancing acts of today's AI infrastructures. There were often seemingly conflicting statements, with talk about helping businesses "move fast" and "not be left behind" contrasting with practical sessions that highlighted going slow, ensuring security and governance, and starting small with AI and agents. This tension reflects the reality that AI adoption is not a one-size-fits-all process; organizations must find the right pace based on their risk tolerance, regulatory obligations, and existing infrastructure.

Also, many of the software offerings touted as solutions to AI and agent hurdles are still in beta or even alpha, and often state that they should not be used in production. This means that companies that want to stay secure and meet compliance requirements need to weigh whether these software tools are mature and secure enough to meet their requirements. Furthermore, the rapid evolution of the AI landscape means that enterprises must remain agile and continuously reassess their technology choices.

Beyond the specific announcements, Dell Tech World 2026 provided a broader framework for understanding the shift to on-premises AI. The conference emphasized that AI is not just about algorithms but about the entire infrastructure that supports them. From storage and networking to compute and data management, every component must be optimized for AI workloads. Dell highlighted its PowerEdge servers, PowerScale storage, and networking solutions as key enablers for on-prem AI deployments.

Another significant theme was the importance of data readiness. Many enterprises struggle to leverage AI effectively because their data is siloed, unclean, or not properly labeled. Dell's AI Data Platform aims to address this by providing a consolidated view of enterprise data and offering tools for data preparation and labeling. This allows data scientists to focus on model development rather than data engineering.

The conference also showcased several customer success stories. For instance, a large financial institution moved its fraud detection AI from the cloud to on-premises infrastructure and saw a 40% reduction in inference latency and a 60% decrease in operational costs. A healthcare provider adopted edge-based AI to process medical images locally, ensuring patient data never left the facility while still benefiting from advanced diagnostics.

Dell's emphasis on hybrid infrastructure reflects a broader industry trend. As the AI market matures, enterprises are realizing that a balanced approach—using the cloud for burst capacity and experimentation while relying on on-premises systems for steady-state production—offers the best of both worlds. This hybrid model also provides resilience against cloud outages and vendor lock-in.

For businesses looking for more measured and practical discussions about AI and agentic systems, Dell Technologies World 2026 offered many sessions, product offerings, and demonstrations of tools and solutions to help them take practical steps toward building an AI enterprise. From workshops on token optimization to hands-on labs for agent development, the conference provided concrete guidance for organizations at every stage of their AI journey.


Source: ZDNET News


Share:

Your experience on this site will be improved by allowing cookies Cookie Policy