vRAN market disappoints – just like OpenRAN and mobile 5G

Most wireless network operators are not convinced virtual RAN (vRAN) [1.] is worth the effort to deploy. Omdia, an analyst company owned by Informa, put vRAN’s share of the total market for RAN baseband products at just 10% in 2023. It is growing slowly, with 20% market share forecast by 2028, but it far from being the default RAN architectural choice.

Among the highly touted benefits of virtualization is the ability for RAN developers to exploit the much bigger economies of scale found in the mainstream IT market. “General-purpose technology will eventually have so much investment in it that it will outpace custom silicon,” said Sachin Katti, the general manager of Intel’s network and edge group, during a previous Light Reading interview.

Note 1. The key feature of vRAN is the virtualization of RAN functions, allowing operators to perform baseband operations on standard servers instead of dedicated hardware.  The Asia Pacific region is currently leading in vRAN adoption due to rapid 5G deployment in countries like China, South Korea, and Japan. Samsung has established a strong presence as a supplier of vRAN equipment and software.

The whole market for RAN products generated revenues of just $40 billion in 2023. Intel alone made $54.2 billion in sales that same year.  Yet Huawei, Ericsson and Nokia, the big players in RAN base station technology, have continued to miniaturize and advance their custom chips. Nokia boasts 5-nanometer chips in its latest products and last year lured Derek Urbaniak, a highly regarded semiconductor expert, from Ericsson in a sign it wants to play an even bigger role in custom chip development.

Ericsson collaborates closely with Intel on virtual RAN, and yet it has repeatedly insisted its application-specific integrated circuits (ASICs) perform better than Intel’s CPUs in 5G. One year ago, Michael Begley, Ericsson’s head of RAN compute, told Light Reading that “purpose-built hardware will continue to be the most energy-efficient and compact hardware for radio site deployments going forward.”

Intel previously suffered delays when moving to smaller designs and there is gloominess about its prospects as note in several IEEE Techblog posts like this one and this one. Intel suffered a $17 billion loss for the quarter ending in September, after reporting a small $300 million profit a year before. Sales fell 6% year-over-year, to $13.3 billion, over this same period.

Unfortunately, for telcos eyeing virtualization, Intel is all they really have. Its dominance of the small market for virtual RAN has not been weakened in the last couple of years, leaving operators with no viable alternatives. This was made apparent in a recent blog post by Ericsson, which listed Intel as the only commercial-grade chip solution for virtual RAN. AMD was at the “active engagement” stage, said Ericsson last November. Processors based on the blueprints of ARM, a UK-based chip designer that licenses its designs, were not even mentioned.

The same economies-of-scale case for virtual RAN is now being made about Nvidia and its graphical processing units (GPUs), which Nvidia boss Jensen Huang seems eager to pitch as a kind of general-purpose AI successor to more humdrum CPUs. If the RAN market is too small, and its developers must ride in the slipstream of a much bigger market, Nvidia and its burgeoning ecosystem may seem a safer bet than Intel. And the GPU maker already has a RAN pitch, including a lineup of Arm-based CPUs to host some of the RAN software.

Semiconductor-related economies of scale, should not be the sole benefit of a virtual RAN. “With a lot of the work that’s been done around orchestration, you can deploy new software to hundreds of sites in a couple of hours in a way that was not feasible before,” said Alok Shah of Samsung Electronics. Architecturally, virtualization should allow an operator to host its RAN on the same cloud-computing infrastructure used for other telco and IT workloads. With a purpose-built RAN, an operator would be using multiple infrastructure platforms.

In telecom markets without much fiber or fronthaul infrastructure there is unlikely to be much centralization of RAN compute. This necessitates the deployment of servers at mast sites, where it is hard to see them being used for anything but the RAN. Even if a company wanted to host other applications at a mobile site, the processing power of Sapphire Rapids, the latest Intel generation, is fully consumed by the functions of the virtual distributed unit (vDU), according to Shah. “I would say the vDU function is kind of swallowing up the whole server,” he said.

Indeed, for all the talk of total cost of ownership (TCO) savings, some deployments of Sapphire Rapids have even had to feature two servers at a site to support a full 5G service, according to Paul Miller, the chief technology officer of Wind River, which provides the cloud-computing platform for Samsung’s virtual RAN in Verizon’s network.  Miller expects that to change with Granite Rapids, the forthcoming successor technology to Sapphire Rapids. “It’s going to be a bit of a sea change for the network from a TCO perspective – that you may be able to get things that took two servers previously, like low-band and mid-band 5G, onto a single server,” he said.

Samsung’s Shah is hopeful Granite Rapids will even free up compute capacity for other types of applications. “We’ll have to see how that plays out, but the opportunity is there, I think, in the future, as we get to that next generation of compute.” In the absence of many alternative processor platforms, especially for telcos rejecting the inline virtual RAN approach, Intel will be under pressure to make sure the journey for Granite Rapids is less turbulent than it sounds.

Another challenge is the mobile backhaul, which is expected to limit the growth of the vRAN industry. Backhaul connectivity ia central s widely used in wireless networks to transfer a signal from a remote cell site to the core network (typically the edge of the Internet). The two main methods of mobile backhaul implementations are fiber-based and wireless point-to-point backhaul.

The pace of data delivery suffers in tiny cell networks with poor mobile network connectivity. Data management is becoming more and more important as tiny cells are employed for network connectivity. Increased data traffic across small cells, which raises questions about data security, is mostly to blame for poor data management. vRAN solutions promise improved network resiliency and utilization, faster network routing, and better-optimized network architecture to meet the diverse 5G requirements of enterprise customers.

References:

https://www.lightreading.com/5g/virtual-ran-still-seems-to-be-not-worth-the-effort

https://www.ericsson.com/en/blog/north-america/2024/open-ran-progress-report

https://www.sdxcentral.com/5g/ran/definitions/vran/

https://www.businessresearchinsights.com/market-reports/virtualized-radio-access-network-vran-market-106129

https://www.globalgrowthinsights.com/market-reports/virtualized-radio-access-network-vran-market-100486

LightCounting: Open RAN/vRAN market is pausing and regrouping

Dell’Oro: Private 5G ecosystem is evolving; vRAN gaining momentum; skepticism increasing

Huawei CTO Says No to Open RAN and Virtualized RAN

Heavy Reading: How network operators will deploy Open RAN and cloud native vRAN

CES 2025: Intel announces edge compute processors with AI inferencing capabilities

At CES 2025 today, Intel unveiled the new Intel® Core™ Ultra (Series 2) processors, designed to revolutionize mobile computing for businesses, creators and enthusiast gamers. Intel said “the new processors feature cutting-edge AI enhancements, increased efficiency and performance improvements.”

“Intel Core Ultra processors are setting new benchmarks for mobile AI and graphics, once again demonstrating the superior performance and efficiency of the x86 architecture as we shape the future of personal computing,” said Michelle Johnston Holthaus, interim co-CEO of Intel and CEO of Intel Products. “The strength of our AI PC product innovation, combined with the breadth and scale of our hardware and software ecosystem across all segments of the market, is empowering users with a better experience in the traditional ways we use PCs for productivity, creation and communication, while opening up completely new capabilities with over 400 AI features. And Intel is only going to continue bolstering its AI PC product portfolio in 2025 and beyond as we sample our lead Intel 18A product to customers now ahead of volume production in the second half of 2025.”

Intel also announced new edge computing processors, designed to provide scalability and superior performance across diverse use cases. Intel Core Ultra processors were said to deliver remarkable power efficiency, making them ideal for AI workloads at the edge, with performance gains that surpass competing products in critical metrics like media processing and AI analytics. Those edge processors are targeted at compute servers running in hospitals, retail stores, factory floors and other “edge” locations that sit between big data centers and end-user devices. Such locations are becoming increasingly important to telecom network operators hoping to sell AI capabilities, private wireless networks, security offerings and other services to those enterprise locations.

Intel edge products launching today at CES include:

  • Intel® Core™ Ultra 200S/H/U series processors (code-named Arrow Lake).
  • Intel® Core™ 200S/H series processors (code-named Bartlett Lake S and Raptor Lake H Refresh).
  • Intel® Core™ 100U series processors (code-named Raptor Lake U Refresh).
  • Intel® Core™ 3 processor and Intel® Processor (code-named Twin Lake).

“Intel has been powering the edge for decades,” said Michael Masci, VP of product management in Intel’s edge computing group, during a media presentation last week.  According to Masci, AI is beginning to expand the edge opportunity through inferencing [1.].  “Companies want more local compute. AI inference at the edge is the next major hotbed for AI innovation and implementation,” he added.

Note 1. Inferencing in AI refers to the process where a trained AI model makes predictions or decisions based on new data, rather than previously stored “training models.” It’s essentially AI’s ability to apply learned knowledge on fresh inputs in real-time. Edge computing plays a critical role in inferencing, because it brings it closer to users. That lowers latency (much faster AI responses) and can also reduce bandwidth costs and ensure privacy and security as well.

Editor’s Note: Intel’s edge compute business – the one pursuing AI inferencing – is in in its Client Computing Group (CCG) business unit. Intel’s chips for telecom operators reside inside its NEX business unit.

Intel’s Masci specifically called out Nvidia’s GPU chips, claiming Intel’s new silicon lineup supports up to 5.8x faster performance and better usage per watt.  Indeed, Intel claims their “Core™ Ultra 7 processor  uses about one-third fewer TOPS (Trillions Operations Per Second) than Nvidia’s Jetson AGX Orin, but beats its competitor with media performance that is up to 5.6 times faster, video analytics performance that is up to 3.4x faster and performance per watt per dollar up to 8.2x better.”

………………………………………………………………………………………………………………………………………………………………………………………………………………………………………………………….

However, Nvidia has been using inference in its AI chips for quite some time. Company officials last month confirmed that 40% of Nvidia’s revenues come from AI inference, rather than AI training efforts in big data centers.  Colette Kress, Nvidia Executive Vice President and Chief Financial Officer, said, “Our architectures allows an end-to-end scaling approach for them to do whatever they need to in the world of accelerated computing and Ai. And we’re a very strong candidate to help them, not only with that infrastructure, but also with the software.”

“Inference is super hard. And the reason why inference is super hard is because you need the accuracy to be high on the one hand. You need the throughput to be high so that the cost could be as low as possible, but you also need the latency to be low,” explained Nvidia CEO Jensen Huang during his company’s recent quarterly conference call.

“Our hopes and dreams is that someday, the world does a ton of inference. And that’s when AI has really succeeded, right? It’s when every single company is doing inference inside their companies for the marketing department and forecasting department and supply chain group and their legal department and engineering, and coding, of course. And so we hope that every company is doing inference 24/7.”

……………………………………………………………………………………………………………………………………………………………………………………………………………………………………………………………….

Sadly for its many fans (including this author), Intel continues to struggle in both data center processors and AI/ GPU chips. The Wall Street Journal recently reported that “Intel’s perennial also-ran, AMD, actually eclipsed Intel’s revenue for chips that go into data centers. This is a stunning reversal: In 2022, Intel’s data-center revenue was three times that of AMD.”

Even worse for Intel, more and more of the chips that go into data centers are GPUs and Intel has minuscule market share of these high-end chips. GPUs are used for training and delivering AI.  The WSJ notes that many of the companies spending the most on building out new data centers are switching to chips that have nothing to do with Intel’s proprietary architecture, known as x86, and are instead using a combination of a competing architecture from ARM and their own custom chip designs.  For example, more than half of the CPUs Amazon has installed in its data centers over the past two years were its own custom chips based on ARM’s architecture, Dave Brown, Amazon vice president of compute and networking services, said recently.

This displacement of Intel is being repeated all across the big providers and users of cloud computing services. Microsoft and Google have also built their own custom, ARM-based CPUs for their respective clouds. In every case, companies are moving in this direction because of the kind of customization, speed and efficiency that custom silicon supports.

References:

https://www.intel.com/content/www/us/en/newsroom/news/2025-ces-client-computing-news.html#gs.j0qbu4

https://www.intel.com/content/www/us/en/newsroom/news/2025-ces-client-computing-news.html#gs.j0qdhd

https://seekingalpha.com/article/4741811-nvidia-corporation-nvda-ubs-global-technology-conference-transcript

https://www.wsj.com/tech/intel-microchip-competitors-challenges-562a42e3

https://www.lightreading.com/the-edge-network/intel-desperate-for-an-edge-over-nvidia-with-ai-inferencing

Massive layoffs and cost cutting will decimate Intel’s already tiny 5G network business

WSJ: China’s Telecom Carriers to Phase Out Foreign Chips; Intel & AMD will lose out

The case for and against AI-RAN technology using Nvidia or AMD GPUs

Superclusters of Nvidia GPU/AI chips combined with end-to-end network platforms to create next generation data centers

FT: Nvidia invested $1bn in AI start-ups in 2024

AI winner Nvidia faces competition with new super chip delayed

AI Frenzy Backgrounder; Review of AI Products and Services from Nvidia, Microsoft, Amazon, Google and Meta; Conclusions