Lenovo Expands Hybrid AI Portfolio to Cut Enterprise Inference Costs by Up to 8X

By Ridhika Basnet | June 25, 2026 | 5 min read

The company said its on-premise AI deployments can reduce cost per million tokens by up to 18X compared with model-as-a-service APIs.

On June 24, 2026, Lenovo expanded its Lenovo Hybrid AI Advantage platform with new inferencing and agentic AI capabilities designed to help enterprises deploy AI across devices, edge environments, data centers, and cloud infrastructure.

The company said the additions are aimed at improving AI economics, accelerating deployment, and supporting autonomous AI workloads.

"As captured by the Lenovo CIO Playbook 2026, 94% of organizations are planning to increase their AI investment over the next year, and enterprises are moving beyond AI experimentation and demanding measurable business outcomes," said Ashley Gorakhpurwalla, President of Infrastructure Solutions at Lenovo.

According to him, Lenovo's infrastructure helps businesses deploy AI where it creates the most value. He noted that this approach lowers data processing costs, speeds up deployment, and maintains strict corporate security and governance.

The expanded portfolio is built with technologies from NVIDIA, Intel, Red Hat, and Canonical and forms part of Lenovo's Hybrid AI Factory approach, which is designed to support AI deployment across hybrid environments, the company said.

What the Platforms Do

Lenovo introduced two new Hybrid AI Platform configurations. The first is a CPU-only platform powered by Intel Xeon 6 processors and Red Hat AI Enterprise. The company said the platform is designed to process approximately 2x more AI requests concurrently and supports workloads such as retrieval-augmented generation (RAG), customer service, and human resources applications.

The second is the Hybrid AI Platform (221), available with either Canonical Ubuntu and Kubernetes or Red Hat AI Enterprise. Lenovo said the platform can be deployed in as little as a few weeks and is designed to support enterprise AI workloads with governance and data sovereignty requirements.

According to the company, industry research shows that 92% of organizations deploying agentic AI report costs exceeding expectations. Lenovo said its total cost of ownership analysis found that on-premise deployments can deliver up to 8X lower cost per token than cloud infrastructure-as-a-service environments and up to 18X lower cost per million tokens than model-as-a-service APIs for sustained CPU and graphics processing unit (GPU) workloads.

The company also announced one-click deployment capabilities for AI agents across desktop and data center environments. Lenovo said its Knowledge Super Agent use cases, validated through its AI Library, have demonstrated savings of thousands of employee hours per organization.

Lenovo is also co-developing AI agents and skills for NVIDIA NemoClaw with enterprise customers and plans to extend agentic AI capabilities into retail through an AI-powered kiosk designed to assist shoppers in stores.

Key Takeaways

Lenovo's new Hybrid AI platform reduces enterprise inference costs by up to 18X compared to traditional APIs.
The expanded portfolio supports AI deployment across devices, edge, data centers, and cloud environments.
94% of organizations plan to increase AI investments, focusing on measurable business outcomes.
New configurations enhance AI request processing and support diverse applications like customer service and HR.
Lenovo's approach ensures data processing efficiency while maintaining corporate security and governance.

Lenovo Expands Hybrid AI Portfolio to Cut Enterprise Inference Costs by Up to 8X

Key Takeaways

Related Articles