Nvidia CEO Huang: AI is the largest infrastructure buildout in human history; AI Data Center CAPEX will generate new revenue streams for operators

Executive Summary:

In a February 6, 2026 CNBC interview with with Scott Wapner, Nvidia CEO Jensen Huang [1.] characterized the current AI build‑out as “the largest infrastructure buildout in human history,” driven by exceptionally high demand for compute from hyperscalers and AI companies. “Through the roof” is how he described AI infrastructure spending.  It’s a “once-in-a-generation infrastructure buildout,” specifically highlighting that demand for Nvidia’s Blackwell chips and the upcoming Vera Rubin platform is “sky-high.” He emphasized that the shift from experimental AI to AI as a fundamental utility has reached a definitive inflection point for every major industry.

Jensen forecasts aa roughly 7–to- 8‑year AI investment cycle lies ahead, with capital intensity justified because deployed AI infrastructure is already generating rising cash flows for operators.  He maintains that the widely cited ~$660 billion AI data center capex pipeline is sustainable, on the grounds that GPUs and surrounding systems are revenue‑generating assets, not speculative overbuild. In his view, as long as customers can monetize AI workloads profitably, they will “keep multiplying their investments,” which underpins continued multi‑year GPU demand, including for prior‑generation parts that remain fully leased.

Note 1.  Being the undisputed leader of AI hardware (GPU chips and networking equipment via its Mellanox acquisition), Nvidia MUST ALWAYS MAKE POSITIVE REMARKS AND FORECASTS related to the AI build out boom.  Reader discretion is advised regarding Huang’s extremely bullish, “all-in on AI” remarks.

Huang reiterated that AI will “fundamentally change how we compute everything,” shifting data centers from general‑purpose CPU‑centric architectures to accelerated computing built around GPUs and dense networking. He emphasizes Nvidia’s positioning as a full‑stack infrastructure and computing platform provider—chips, systems, networking, and software—rather than a standalone chip vendor.  He accuratedly stated that Nvidia designs “all components of AI infrastructure” so that system‑level optimization (GPU, NIC, interconnect, software stack) can deliver performance gains that outpace what is possible with a single chip under a slowing Moore’s Law. The installed base is presented as productive: even six‑year‑old A100‑class GPUs are described as fully utilized through leasing, underscoring persistent elasticity of AI compute demand across generations.

AI Poster Childs – OpenAI and Anthropic:

Huang praised OpenAI and Anthropic, the two leading artificial intelligence labs, which both use Nvidia chips through cloud providers. Nvidia invested $10 billion in Anthropic last year, and Huang said earlier this week that the chipmaker will invest heavily in OpenAI’s next fundraising round.

“Anthropic is making great money. Open AI is making great money,” Huang said. “If they could have twice as much compute, the revenues would go up four times as much.”

He said that all the graphics processing units that Nvidia has sold in the past — even six-year old chips such as the A100 — are currently being rented, reflecting sustained demand for AI computing power.

“To the extent that people continue to pay for the AI and the AI companies are able to generate a profit from that, they’re going to keep on doubling, doubling, doubling, doubling,” Huang said.

Economics, utilization, and returns:

On economics, Huang’s central claim is that AI capex converts into recurring, growing revenue streams for cloud providers and AI platforms, which differentiates this cycle from prior overbuilds. He highlights very high utilization: GPUs from multiple generations remain in service, with cloud operators effectively turning them into yield‑bearing infrastructure.

This utilization and monetization profile underlies his view that the capex “arms race” is rational: when AI services are profitable, incremental racks of GPUs, network fabric, and storage can be modeled as NPV‑positive infrastructure projects rather than speculative capacity. He implies that concerns about a near‑term capex cliff are misplaced so long as end‑market AI adoption continues to inflect.

Competitive and geopolitical context:

Huang acknowledges intensifying global competition in AI chips and infrastructure, including from Chinese vendors such as Huawei, especially under U.S. export controls that have reduced Nvidia’s China revenue share to roughly half of pre‑control levels. He frames Nvidia’s strategy as maintaining an innovation lead so that developers worldwide depend on its leading‑edge AI platforms, which he sees as key to U.S. leadership in the AI race.

He also ties AI infrastructure to national‑scale priorities in energy and industrial policy, suggesting that AI data centers are becoming a foundational layer of economic productivity, analogous to past buildouts in electricity and the internet.

Implications for hyperscalers and chips:

Hyperscalers (and also Nvidia customers) Meta , Amazon, Google/Alphabet and Microsoft recently stated that they plan to dramatically increase spending on AI infrastructure in the years ahead. In total, these hyperscalers could spend $660 billion on capital expenditures in 2026 [2.] , with much of that spending going toward buying Nvidia’s chips. Huang’s message to them is that AI data centers are evolving into “AI factories” where each gigawatt of capacity represents tens of billions of dollars of investment spanning land, compute, and networking. He suggests that the hyperscaler industry—roughly a $2.5 trillion sector with about $500 billion in annual capex transitioning from CPU to GPU‑centric generative AI—still has substantial room to run.

Note 2.  An understated point is that while these hyperscalers are spending hundered of billions of dollars on AI data centers and Nvidia chips/equipment they are simultaneously laying off tens of thousands of employees.  For example, Amazon recently announced 16,000 job cuts this year after 14,000 layoffs last October.

From a chip‑level perspective, he argues that Nvidia’s competitive moat stems from tightly integrated hardware, networking, and software ecosystems rather than any single component, positioning the company as the systems architect of AI infrastructure rather than just a merchant GPU vendor.

References:

https://www.cnbc.com/2026/02/06/nvidia-rises-7percent-as-ceo-says-660-billion-capex-buildout-is-sustainable.html

Big tech spending on AI data centers and infrastructure vs the fiber optic buildout during the dot-com boom (& bust)

Analysis: Cisco, HPE/Juniper, and Nvidia network equipment for AI data centers

Networking chips and modules for AI data centers: Infiniband, Ultra Ethernet, Optical Connections

Will billions of dollars big tech is spending on Gen AI data centers produce a decent ROI?

Superclusters of Nvidia GPU/AI chips combined with end-to-end network platforms to create next generation data centers

184K global tech layoffs in 2025 to date; ~27.3% related to AI replacing workers

 

 

Analysis: Edge AI and Qualcomm’s AI Program for Innovators 2026 – APAC for startups to lead in AI innovation

Qualcomm is a strong believer in Edge AI as an enabler of faster, more secure, and energy-efficient processing directly on devices—rather than the cloud—unlocking real-time intelligence for industries like robotics and smart cities.

In support of that vision, the fabless SoC company announced the official launch of its Qualcomm AI Program for Innovators (QAIPI) 2026 – APAC, a regional startup incubation initiative that supports startups across Japan, Singapore, and South Korea in advancing the development and commercialization of innovative edge AI solutions.

Building on Qualcomm’s commitment to edge AI innovation, the second edition of QAIPI-APAC invites startups to develop intelligent solutions across a broad range of edge-AI applications using Qualcomm Dragonwing™ and Snapdragon® platforms, together with the new Arduino® UNO Q development board, strengthening their pathway toward global commercialization.

Startups gain comprehensive support and resources, including access to Qualcomm Dragonwing™ and Snapdragon® platforms, the Arduino® UNO Q development board, technical guidance and mentorship, a grant of up to US$10,000, and eligibility for up to US$5,000 in patent filing incentives, accelerating AI product development and deployment.

Applications are open now through April 30, 2026 and will be evaluated based on innovation, technical feasibility, potential societal impact, and commercial relevance. The program will be implemented in two phases. The application phase is open to eligible startups incorporated and registered in Japan, Singapore, or South Korea. Shortlisted startups will enter the mentorship phase, receiving one-on-one guidance, online training, technical support, and access to Qualcomm-powered hardware platforms and development kits for product development. They will also receive a shortlist grant of up to US$10,000 and may be eligible for a patent filing incentive of up to US$5,000. At the conclusion of the program, shortlisted startups may be invited to showcase their innovations at a signature Demo Day in late 2026, engaging with industry leaders, investors, and potential collaborators across the APAC innovation ecosystem.

Comment and Analysis:

Qualcomm is a strong believer in Edge AI—the practice of running AI models directly on devices (smartphones, cars, IoT, PCs) rather than in the cloud—because they view it as the next major technological paradigm shift, overcoming limitations inherent in cloud computing. Despite the challenges of power consumption and processing limitations, Qualcomm’s strategy hinges on specialized, heterogenous computing rather than relying solely on RISC-based CPU cores.

Key Issues for Qualcomm’s Edge AI solutions:

1.  The “Heterogeneous” Solution to Processing Limits
While it is true that standard CPU cores (even RISC-based ones) are inefficient for AI, Qualcomm does not use them for AI workloads. Instead, they use a heterogeneous architecture:
  • Qualcomm® AI Engine: This combines specialized hardware, including the Hexagon NPU (Neural Processing Unit), Adreno GPU, and CPU. The NPU is specifically designed to handle high-performance, complex AI workloads (like Generative AI) far more efficiently than a generic CPU.
  • Custom Oryon CPU: The latest Snapdragon X Elite platform features customized cores that provide high performance while outperforming traditional x86 solutions in power efficiency for everyday tasks.
2. Overcoming Power Consumption (Performance/Watt)
Qualcomm focus on “Performance per Watt” rather than raw power.
  • Specialization Saves Power: By using specialized AI engines (NPUs) rather than general-purpose CPU/GPU cores, Qualcomm can run inference tasks at a fraction of the power cost.
  • Lower Overall Energy: Doing AI at the edge can save total energy by avoiding the need to send data to a power-hungry data center, which requires network infrastructure, and then sending it back.
  • Intelligent Efficiency: The Snapdragon 8 Elite, for example, saw a 27% reduction in power consumption while increasing AI performance significantly.
3. Critical Advantages of Edge over Cloud
Qualcomm believes edge is essential because cloud AI cannot solve certain critical problems:
  • Instant Responsiveness (Low Latency): For autonomous vehicles or industrial robotics, a few milliseconds of latency to the cloud can be catastrophic. Edge AI provides real-time, instantaneous analysis.
  • Privacy and Security: Data never leaves the device. This is crucial for privacy-conscious users (biometrics) and compliance (GDPR), which is a major advantage over cloud-based AI.
  • Offline Capability: Edge devices, such as agricultural sensors or smart home devices in remote areas, continue to function without internet connectivity.
4. Market Expansion and Economic Drivers
  • Diversification: With the smartphone market maturing, Qualcomm sees the “Connected Intelligent Edge” as a huge growth opportunity, extending their reach into automotive, IoT, and PCs.
  • “Ecosystem of You”: Qualcomm aims to connect billions of devices, making AI personal and context-aware, rather than generic.
5. Bridging the Gap: Software & Model Optimization
Qualcomm is not just providing hardware; they are simplifying the deployment of AI:
  • Qualcomm AI Hub: This makes it easier for developers to deploy optimized models on Snapdragon devices.
  • Model Optimization: They specialize in making AI models smaller and more efficient (using quantization and specialized AI inference) to run on devices without requiring massive, cloud-sized computing power.
In summary, Qualcomm believes in Edge AI because they are building highly specialized hardware designed to excel within tight power and thermal constraints.
……………………………………………………………………………………………………………………………………………………………………………

References:

https://www.prnewswire.com/apac/news-releases/qualcomm-ai-program-for-innovators-2026–apac-officially-kicks-off—empowering-startups-across-japan-singapore-and-south-korea-to-lead-the-ai-innovation-302676025.html

Qualcomm CEO: AI will become pervasive, at the edge, and run on Snapdragon SoC devices

Huawei, Qualcomm, Samsung, and Ericsson Leading Patent Race in $15 Billion 5G Licensing Market

Private 5G networks move to include automation, autonomous systems, edge computing & AI operations

Nvidia’s networking solutions give it an edge over competitive AI chip makers

Nvidia AI-RAN survey results; AI inferencing as a reinvention of edge computing?

CES 2025: Intel announces edge compute processors with AI inferencing capabilities

Qualcomm CEO: expect “pre-commercial” 6G devices by 2028

 

Analysis: SpaceX FCC filing to launch up to 1M LEO satellites for solar powered AI data centers in space

SpaceX has applied to the Federal Communications Commission (FCC) for permission to launch up to 1 million LEO satellites for a new solar-powered AI data center system in space.  The private company, 40% owned by Elon Musk, envisions an orbital data center system with “unprecedented computing capacity” needed to run large-scale AI inference and applications for billions of users, according to SpaceX’s filing entered late on Friday.

Data centers are the physical backbone of artificial intelligence, requiring massive amounts of power. “By directly harnessing near-constant solar power with little operating or maintenance costs, these satellites will achieve transformative cost and energy efficiency while significantly reducing the environmental impact associated with terrestrial data centers,” the FCC filing said. Musk would need the telecom regulator’s approval to move forward.

Credit: Blueee/Alamy Stock Photo

The proposed new satellites would operate in “narrow orbital shells” of up to 50 kilometers each. The satellites would operate at altitudes of between 500 kilometers and 2,000 kilometers, and 30 degrees, and “sun-synchronous orbit inclinations” to capture power from the sun. The system is designed to be interconnected via optical links with existing Starlink broadband satellites, which would transmit data traffic back to ground Earth stations.

SpaceX’s request bets heavily on reduced costs of Starship, the company’s next-generation reusable rocket under development.  Starship has test-launched 11 times since 2023. Musk expects the rocket, which is crucial for expanding Starlink with more powerful satellites, to put its first payloads into orbit this year.
“Fortunately, the development of fully reusable launch vehicles like Starship that can deploy millions of tons of mass per year to orbit when launching at rate, means on-orbit processing capacity can reach unprecedented scale and speed compared to terrestrial buildouts, with significantly reduced environmental impact,” SpaceX said.
SpaceX is positioning orbital AI compute as the definitive solution to the terrestrial capacity crunch, arguing that space-based infrastructure represents the most efficient path for scaling next-generation workloads. As ground-based data centers face increasing grid density constraints and power delivery limitations, SpaceX intends to leverage high-availability solar irradiation to bypass Earth’s energy bottlenecks.The company’s technical rationale hinges on several key architectural advantages:
  • Energy Density & Sustainability: By tapping into “near-constant solar power,” SpaceX aims to utilize a fraction of the Sun’s output—noting that even a millionth of its energy exceeds current civilizational demand by four orders of magnitude.
  • Thermal Management: To address the cooling requirements of high-density AI clusters, these satellites will utilize radiative heat dissipation, eliminating the water-intensive cooling loops required by terrestrial facilities.
  • Opex & Scalability: The financial viability of this orbital layer is tethered to the Starship launch platform. SpaceX anticipates that the radical reduction in $/kg launch costs provided by a fully reusable heavy-lift vehicle will enable rapid scaling and ensure that, within years, the lowest LCOA (Levelized Cost of AI) will be achieved in orbit.
The transition to orbital AI compute introduces a fundamental shift in network topology, moving processing from terrestrial hubs to a decentralized, space-based edge layer. The latency implications are characterized by three primary architectural factors:
  • Vacuum-Speed Data Transmission: In a vacuum, light propagates roughly 50% faster than through terrestrial fiber optic cables. By utilizing Starlink’s optical inter-satellite links (OISLs)—a “petabit” laser mesh—data can bypass terrestrial bottlenecks and subsea cables. This potentially reduces intercontinental latency for AI inference to under 50ms, surpassing many long-haul terrestrial routes.
  • Edge-Native Processing & Data Gravity: Current workflows require downlinking massive raw datasets (e.g., Synthetic Aperture Radar imagery) for terrestrial processing, a process that can take hours. Shifting to orbital edge computing allows for “in-situ” AI inference, processing data onboard to deliver actionable insights in minutes rather than hours. This “Space Cloud” architecture eliminates the need to route raw data back to the Earth’s internet backbone, reducing data transmission volumes by up to 90%.
  • LEO Proximity vs. Terrestrial Hops: While terrestrial fiber remains the “gold standard” for short-range latency (typically 1–10ms), it is often hindered by inefficient routing and multiple hops. SpaceX’s LEO constellation, operating at altitudes between 340km and 614km, currently delivers median peak-hour latencies of ~26ms in the US. Future orbital configurations may feature clusters at varying 50km intervals to optimize for specific workload and latency tiers.

………………………………………………………………………………………………………………………………………………………………………………………

The SpaceX FCC filing on Friday follows an exclusive report by Reuters that Elon Musk is considering merging SpaceX with his xAI (Grok chatbot) company ahead of an IPO later this year. Under the proposed merger, shares of xAI would be exchanged for shares in SpaceX. Two entities have been set up in Nevada to facilitate the transaction, Reuters said.  Musk also runs electric automaker Tesla, tunnel company The Boring Co. and neurotechnology company Neuralink.

………………………………………………………………………………………………………………………………………………………………………………………

References:

https://www.reuters.com/business/aerospace-defense/spacex-seeks-fcc-nod-solar-powered-satellite-data-centers-ai-2026-01-31/

https://www.lightreading.com/satellite/spacex-seeks-fcc-approval-for-mega-ai-data-center-constellation

https://www.reuters.com/world/musks-spacex-merger-talks-with-xai-ahead-planned-ipo-source-says-2026-01-29/

Google’s Project Suncatcher: a moonshot project to power ML/AI compute from space

Blue Origin announces TeraWave – satellite internet rival for Starlink and Amazon Leo

China ITU filing to put ~200K satellites in low earth orbit while FCC authorizes 7.5K additional Starlink LEO satellites

Amazon Leo (formerly Project Kuiper) unveils satellite broadband for enterprises; Competitive analysis with Starlink

Telecoms.com’s survey: 5G NTNs to highlight service reliability and network redundancy

 

Huge significance of EchoStar’s AWS-4 spectrum sale to SpaceX

U.S. BEAD overhaul to benefit Starlink/SpaceX at the expense of fiber broadband providers

Telstra selects SpaceX’s Starlink to bring Satellite-to-Mobile text messaging to its customers in Australia

SpaceX launches first set of Starlink satellites with direct-to-cell capabilities

AST SpaceMobile to deliver U.S. nationwide LEO satellite services in 2026

GEO satellite internet from HughesNet and Viasat can’t compete with LEO Starlink in speed or latency

How will fiber and equipment vendors meet the increased demand for fiber optics in 2026 due to AI data center buildouts?

Subsea cable systems: the new high-capacity, high-resilience backbone of the AI-driven global network

China’s open source AI models to capture a larger share of 2026 global AI market

Overview of AI Models – China vs U.S. :

Chinese AI language models (LMs) have advanced rapidly and are now contesting with the U.S. for global market leadership.  Alibaba’s Qwen-Image-2512 is emerging as a top-performing, free, open-source model capable of high-fidelity human, landscape, and text rendering. Other key, competitive models include Zhipu AI’s GLM-Image (trained on domestic chips), ByteDance’s Seedream 4.0, and UNIMO-G.

Today, Alibaba-backed Moonshot AI released an upgrade of its flagship AI model, heating up a domestic arms race ahead of an expected rollout by Chinese AI hotshot DeepSeek. The latest iteration of Moonshot’s Kimi can process text, images, and videos simultaneously from a single prompt, the company said in a statement, aligning with a trend toward so-called omni models pioneered by industry leaders like OpenAI and Alphabet Inc.’s Google.

Moonshot AI Kimi website. Photographer: Raul Ariano/Bloomberg

………………………………………………………………………………………………………………………………………………………………………………………………………………………………………………..

Chinese AI models are rapidly narrowing the gap with Western counterparts in quality and accessibility.  That shift is forcing U.S. AI leaders like Alphabet’s Google, Microsoft’s Copilot, OpenAI, and Anthropic to fight harder to  maintain their technological lead in AI.  That’s despite their humongous spending on AI data centers, related AI models and infrastructure.

In early 2025, investors seized on DeepSeek’s purportedly lean $5.6 million LM training bill as a sign that Nvidia’s high-end GPUs were already a relic and that U.S. hyperscalers had overspent on AI infrastructure. Instead, the opposite dynamic played out: as models became more capable and more efficient, usage exploded, proving out a classic Jevons’ Paradox and validating the massive data-center build-outs by Microsoft, Amazon, and Google.

The real competitive threat from DeepSeek and its peers is now coming from a different direction. Many Chinese foundation models are released as “open source” or “open weight” AI models which makes them effectively free to download, easy to modify, and cheap to run at scale. By contrast, most leading U.S. players keep tight control over their systems, restricting access to paid APIs and higher-priced subscriptions that protect margins but limit diffusion.

That strategic divergence is visible in how these systems are actually used. U.S. models such as Google’s Gemini, Anthropic’s Claude, and OpenAI’s GPT series still dominate frontier benchmarks [1′] and high‑stakes reasoning tasks. According to a recently published report by OpenRouter, a third-party AI model aggregator, and venture capital firm Andreessen Horowitz. Chinese open-source models have  captured roughly 30% of the “working” AI market. They are especially strong in coding support and roleplay-style assistants—where developers and enterprises optimize for cost efficiency, local customization, and deployment freedom rather than raw leaderboard scores.

Note 1. A frontier benchmark for AI models is a high-difficulty evaluation designed to test the absolute limits of artificial intelligence in complex,, often unsolved, reasoning tasks. FrontierMath, for example, is a prominent benchmark focusing on expert-level mathematics, requiring AI to solve hundreds of unpublished problems that challenge, rather than merely measure, current capabilities.

China’s open playbook:

China’s more permissive stance on model weights is not just a pricing strategy — it’s an acceleration strategy. Opening weights turns the broader developer community into an extension of the R&D pipeline, allowing users to inspect internals, pressure‑test safety, and push incremental improvements upstream.

As Kyle Miller at Georgetown’s Center for Security and Emerging Technology argues, China is effectively trading away some proprietary control to gain speed and breadth: by letting capability diffuse across the ecosystem, it can partially offset the difficulty of going head‑to‑head with tightly controlled U.S. champions like OpenAI and Anthropic. That diffusion logic is particularly potent in a system where state planners, big tech platforms, and startups are all incentivized to show visible progress in AI.

Even so, the performance gap has not vanished. Estimates compiled by Epoch AI suggest that Chinese models, on average, trail leading U.S. releases by about seven months. The window briefly narrowed during DeepSeek’s R1 launch in early 2025, when it looked like Chinese labs might have structurally compressed the lag; since then, the gap has widened again as U.S. firms have pushed ahead at the frontier.

Capital, chips, and the power problem:

The reason the U.S. lead has held is massive AI infrastructure spending. Consensus forecasts put capital expenditure by largely American hyperscalers at roughly $400 billion in 2025 and more than $520 billion in 2026, according to Goldman Sachs Research. By comparison, UBS analysts estimate that China’s major internet platforms collectively spent only about $57 billion last year—a fraction of U.S. outlays, even if headline Chinese policy rhetoric around AI is more aggressive.

But sustaining that level of investment runs into a physical constraint that can’t be hand‑waved away: electricity. The newest data-center designs draw more than a gigawatt of power each—about the output of a nuclear reactor—turning grid capacity into a strategic bottleneck. China now generates more than twice as much power as the U.S., and its centralized planning system can more readily steer incremental capacity toward AI clusters than America’s fragmented, heavily regulated electricity market.

That asymmetry is already shaping how some on Wall Street frame the race. Christopher Woods, global head of equity strategy at Jefferies, recently reiterated that China’s combination of open‑source models and abundant cheap power makes it a structurally formidable AI competitor. In his view, the “DeepSeek moment” of early last year remains a warning that markets have largely chosen to ignore as they rotate back into U.S. AI mega‑caps.

A fragile U.S. AI advantage:

For now, U.S. companies still control the most important chokepoint in the stack: advanced AI accelerators. Access to Nvidia’s cutting‑edge GPUs remains a decisive advantage.  Yesterday, Microsoft announced the Maia 200 chip – their first silicon and system platform optimized specifically for AI inference.  The chip was  was designed for efficiency, both in terms of its ability to deliver tokens per dollar and performance per watt of power used.

“Maia 200 can deliver 30% better performance per dollar than the latest generation hardware in our fleet today,” Microsoft EVP for Cloud and AI Scott Guthrie wrote in a blog post.

Image Credit: Microsoft

……………………………………………………………………………………………………………………………………………………………………………………………………………………………….

Leading Chinese AI research labs have struggled to match training results using only domestic designed silicon. DeepSeek, which is developing the successor to its flagship model and is widely expected to release it around Lunar New Year, reportedly experimented with chips from Huawei and other local vendors before concluding that performance was inadequate and turning to Nvidia GPUs for at least part of the training run.

That reliance underscores the limits of China’s current self‑reliance push—but it also shouldn’t be comforting to U.S. strategists. Chinese firms are actively working around the hardware gap, not waiting for it to close. DeepSeek’s latest research focuses on training larger models with fewer chips through more efficient memory design, an incremental but important reminder that architectural innovation can partially offset disadvantages in raw compute.

From a technology‑editorial perspective, the underlying story is not simply “China versus the U.S.” at the model frontier. It is a clash between two AI industrial strategies: an American approach that concentrates capital, compute, and control in a handful of tightly integrated platforms, and a Chinese approach that leans on open weights, diffusion, and state‑backed infrastructure to pull the broader ecosystem forward.

The question for 2026 is whether U.S. AI firms’ lead in capability and chips can keep outrunning China’s advantages in openness and power—or whether the market will again wait for a shock like DeepSeek to re‑price that risk.

Deepseek and Other Chinese AI Models:

DeepSeek published research this month outlining a method of training larger models using fewer chips through a more efficient memory design. “We view DeepSeek’s architecture as a new, promising engineering solution that could enable continued model scaling without a proportional increase in GPU capacity,” wrote UBS analyst Timothy Arcuri.

Export controls haven’t prevented Chinese companies from training advanced models, but challenges emerge when the models are deployed at scale. Zhipu AI, which released its open-weight GLM 4.7 model in December, said this month it was rationing sales of its coding product to 20% of previous capacity after demand from users overwhelmed its servers.

Moonshot, Zhipu AI and MiniMax Group Inc are among a handful of AI LM front-runners in a hotly contested battle among Chinese large language model makers, which at one point was dubbed the “War of One Hundred Models.”

“I don’t see compute constraints limiting [Chinese companies’] ability to make models that are better and compete near the U.S. frontier,” Georgetown’s Miller says. “I would say compute constraints hit on the wider ecosystem level when it comes to deployment.”

Gaining access to Nvidia AI chips:

U.S. President Donald Trump’s plan to allow Nvidia to sell its H200 chips to China could be pivotal. Alibaba Group and ByteDance, TikTok’s parent company, have privately indicated interest in ordering more than 200,000 units each, Bloomberg reported.  The H200 outperforms any Chinese-produced AI chip, with a roughly 32% processing-power advantage over Huawei’s Ascend 910C.

With access to Nvidia AI chips, Chinese labs could build AI-training supercomputers as capable as American ones at 50% extra cost compared with U.S.-made ones, according to the Institute for Progress. Subsidies by the Chinese government could cover that differential, leveling the playing field, the institute says.

Conclusions:

A combination of open-source innovation and loosened chip controls could create a cheaper, more capable Chinese AI ecosystem. The possibility is emerging just as OpenAI and Anthropic consider public stock listings (IPOs) and U.S. hyperscalers such as Microsoft and Meta Platforms face pressure to justify heavy spending.

The risk for U.S. AI leaders is no longer theoretical; China’s open‑weight, low‑cost model ecosystem is already eroding the moat that Google, OpenAI, and Anthropic thought they were building. By prioritizing diffusion over tight control, Chinese firms are seeding a broad developer base, compressing iteration cycles, and normalizing expectations that powerful models should be cheap—or effectively free—to adapt and deploy.

U.S. AI leaders could face pressure on pricing and profit margins from China AI competitors while having to deal with AI infrastructure costs and power constraints. Their AI advantage remains real, but fragile—highly exposed to regulatory shifts, export controls, and any breakthrough in China’s workarounds on hardware and training efficiency. The uncomfortable prospect for U.S. AI incumbents is that they could win the race for the best models and still lose ground in the market if China’s diffusion‑driven strategy defines how AI is actually consumed at scale.

…………………………………………………………………………………………………………………………………………………………………………………………………………………..

References:

https://www.barrons.com/articles/deepseek-ai-gemini-chatgpt-stocks-ccde892c

https://blogs.microsoft.com/blog/2026/01/26/maia-200-the-ai-accelerator-built-for-inference/

https://www.bloomberg.com/news/articles/2026-01-27/china-s-moonshot-unveils-new-ai-model-ahead-of-deepseek-release

https://www.scmp.com/tech/tech-trends/article/3335602/chinas-open-source-models-make-30-global-ai-usage-led-qwen-and-deepseek

China gaining on U.S. in AI technology arms race- silicon, models and research

U.S. export controls on Nvidia H20 AI chips enables Huawei’s 910C GPU to be favored by AI tech giants in China

Goldman Sachs: Big 3 China telecom operators are the biggest beneficiaries of China’s AI boom via DeepSeek models; China Mobile’s ‘AI+NETWORK’ strategy

Bloomberg: China Lures Billionaires Into Race to Catch U.S. in AI

Comparing AI Native mode in 6G (IMT 2030) vs AI Overlay/Add-On status in 5G (IMT 2020)

Telecom operators investing in Agentic AI while Self Organizing Network AI market set for rapid growth

Telecom companies are planning to use Agentic AI [1.] for customer experience and network automation. A recent RADCOM survey shows 71% of network operators plan to deploy agentic AI in 2026, while 14% have already begun, prioritizing areas that directly influence trust and customer satisfaction: security and fraud prevention (57%) and customer service and support (56%).  The top use cases are automated customer complaint resolution and autonomous fault resolution.

Operators are betting on agentic AI to remove friction before customers feel it, with the highest-value use cases reflecting this shift, including:

  • 57% – automated customer complaint resolution
  • 54% – autonomous fault resolution before it impacts service
  • 52% – predicting experience to prevent churn

This technology is shifting networks from simply detecting issues to preventing them before customers notice. In contact centers, 2026 is expected to see a rise in human and AI agent collaboration to improve efficiency and customer service.

Note 1.  Agentic AI refers to autonomous artificial intelligence systems that can perceive, reason, plan, and act independently to achieve complex goals with minimal human intervention, going beyond simple command-response to manage multi-step tasks, use various tools, and adapt to new information for proactive automation in dynamic environments. These intelligent agents function like digital coworkers, coordinating internally and with other systems to execute sophisticated workflows.

……………………………………………………………………………………………………………………………………………………………………………………………

ResearchAndMarkets.com has just published a “Self-Organizing Network Artificial Intelligence (AI) Global Market Report 2025.” The market research firm says that the self-organizing network AI [2.] is forecast to expand from $5.19 billion in 2024 to $6.18 billion in 2025, at a CAGR of 19.2%. This surge is driven by the integration of machine learning and AI in telecom networks, smart network management investment, and the growing demand for features like self-healing and self-optimization, as well as predictive maintenance technologies.driven by the expansion of 5G, increasing automation demands, and AI integration for network optimization. Opportunities include AI-driven RRM and predictive maintenance. Asia-Pacific emerges as the fast-growing region, boosting telecom innovations amid global trade shifts.

Note 2.  Self-organizing network AI leverages software, hardware, and services to dynamically optimize and manage telecom networks, applicable across various network types and deployment modes. The market encompasses a broad range of solutions, from network optimization software to AI-driven planning products, underscoring its expansive potential.

Looking further ahead, the market is expected to reach $12.32 billion by 2029, with a CAGR of 18.8%. Key drivers during this period include heightened demand for automation, increased 5G deployments, and growing network densification, accompanied by rising data traffic and subscriber numbers. Trends such as AI-driven network automation advancements, machine learning integration for real-time optimization, and the rise of generative AI for analytics are reshaping the landscape.

The expansion of 5G networks plays a pivotal role in propelling this growth. These networks, characterized by high-speed data and ultra-low latency, significantly enhance the capabilities of self-organizing network AI. The integration facilitates real-time data processing, supporting automation, optimization, and predictive maintenance, thereby improving service quality and user experience. A notable development in 2023 saw UK outdoor 5G coverage rise to 85-93%, reflecting growing demand and technological advancement.

Huawei Technologies and other major tech companies, are pioneering innovative solutions like AI-driven radio resource management (RRM), which optimizes network performance and enhances user experience. These solutions rely on AI and machine learning for dynamic spectrum and network resource management. For instance, Huawei’s AI Core Network, introduced at MWC 2025, marks a substantial leap in intelligent telecommunications, integrating AI into core systems for seamless connectivity and real-time decision-making.

Strategic acquisitions are also shaping the market, exemplified by Amdocs Limited acquiring TEOCO Corporation in 2023 to bolster its network optimization and analytics capabilities. This acquisition aims to enhance end-to-end network intelligence and operational efficiency.

Leading players in the market include Huawei, Cisco Systems Inc., Qualcomm Incorporated, and many others, driving innovation and competition. Europe held the largest market share in 2024, with Asia-Pacific poised to be the fastest-growing region through the forecast period.

References:

Operator Priorities for 2026 and Beyond: Data, Automation, Customer Experience

https://uk.finance.yahoo.com/news/self-organizing-network-artificial-intelligence-105400706.html

Ericsson integrates agentic AI into its NetCloud platform for self healing and autonomous 5G private network

Agentic AI and the Future of Communications for Autonomous Vehicles (V2X)

IDC Report: Telecom Operators Turn to AI to Boost EBITDA Margins

Omdia: How telcos will evolve in the AI era

Palo Alto Networks and Google Cloud expand partnership with advanced AI infrastructure and cloud security

Hyperscaler capex > $600 bn in 2026 a 36% increase over 2025 while global spending on cloud infrastructure services skyrockets

Hyperscaler capex for the “big five” (Amazon, Alphabet/Google, Microsoft, Meta/Facebook, Oracle) is now widely forecast to exceed $600 bn in 2026, a 36% increase over 2025. Roughly 75%, or $450 bn, of that spend is directly tied to AI infrastructure (i.e., servers, GPUs, datacenters, equipment), rather than traditional cloud.  Hyperscalers are increasingly leaning on debt markets to bridge the gap between rapidly rising AI capex budgets and internal free cash flow, transforming historically cash-funded business models into ones utilizing leverage, albeit with still very strong balance sheets. Aggregate capex for “the big five”, after buybacks and dividends are included, are now above projected cash flows, thereby necessitating external funding needs.

……………………………………………………………………………………………………………………………………………………………………………………………………………………..

According to market research from Omdia (owned by Informa) global spending on cloud infrastructure services reached $102.6 billion in Q3 2025 — a 25% year-on-year increase. It was the fifth consecutive quarter in which cloud spending growth remained above 20%.  Omdia says it “reflects a significant shift in the technology landscape as enterprise demand for AI moves beyond early experimentation toward scaled production deployment.” AWS, Microsoft Azure, and Google Cloud – maintained their market rankings from the previous quarter, and collectively accounted for 66% of global cloud infrastructure spending. Together, the three firms had 29% year-on-year growth in their cloud spending.

Hyperscaler AI strategies are shifting from a focus on incremental model performance to platform-driven, production-ready approaches. Enterprises are now evaluating AI platforms based not solely on model capabilities, but also on their support for multi-model strategies and agent-based applications. This evolution is accelerating hyperscalers’ move toward platform-level AI capabilities. According to the report, Amazon Web Services (AWS), Microsoft Azure, and Google Cloud are integrating proprietary foundation models with a growing range of third-party and open-weight models to meet these new demands.

“Collaboration across the ecosystem remains critical,” said Rachel Brindley, Senior Director at Omdia. “Multi-model support is increasingly viewed as a production requirement rather than a feature, as enterprises seek resilience, cost control, and deployment flexibility across generative AI workloads.”

Facing challenges with practical application, major cloud providers are boosting resources for AI agent lifecycle management, including creation and operationalization, as enterprise-level deployment proves more intricate than anticipated.

Yi Zhang, Senior Analyst at Omdia, said, “Many enterprises still lack standardized building blocks that can support business continuity, customer experience, and compliance at the same time, which is slowing the real-world deployment of AI agents. This is where hyperscalers are increasingly stepping in, using platform-led approaches to make it easier for enterprises to build and run agents in production environments.”

This past October, Omdia released a report forecasting that growth of cloud adoption among communications service providers (CSPs) will double this year. It also forecasted a compound annual growth rate (CAGR) of 7.3% to 2030, resulting in the telco cloud market being worth $24.8 billion.

………………………………………………………………………………………………………………………………………………………………………………………………………………………..

Editor’s Note:  Does anyone remember the stupendous increase in fiber optic spending from 1998-2001 till that bubble burst?  Caveat Emptor!

………………………………………………………………………………………………………………………………………………………………………………………………………………………..

References:

https://www.mufgamericas.com/sites/default/files/document/2025-12/AI_Chart_Weekly_12_19_Financing_the_AI_Supercycle.pdf

https://www.telecoms.com/public-cloud/global-cloud-infrastructure-spend-up-25-in-q3

https://www.telecoms.com/public-cloud/telco-investment-in-cloud-infrastructure-is-accelerating-omdia

AI infrastructure spending boom: a path towards AGI or speculative bubble?

Expose: AI is more than a bubble; it’s a data center debt bomb

Can the debt fueling the new wave of AI infrastructure buildouts ever be repaid?

AI spending boom accelerates: Big tech to invest an aggregate of $400 billion in 2025; much more in 2026!

Gartner: AI spending >$2 trillion in 2026 driven by hyperscalers data center investments

AI spending is surging; companies accelerate AI adoption, but job cuts loom large

Will billions of dollars big tech is spending on Gen AI data centers produce a decent ROI?

Big tech spending on AI data centers and infrastructure vs the fiber optic buildout during the dot-com boom (& bust)

Canalys & Gartner: AI investments drive growth in cloud infrastructure spending

Sovereign AI infrastructure for telecom companies: implementation and challenges

AI Echo Chamber: “Upstream AI” companies huge spending fuels profit growth for “Downstream AI” firms

Custom AI Chips: Powering the next wave of Intelligent Computing

 

Sovereign AI infrastructure for telecom companies: implementation and challenges

Sovereign AI infrastructure refers to the domestic capability of a nation or an organization to own and control the entire technology stack for artificial intelligence (AI) systems within its own borders, subject to local laws and governance. This includes the physical data centers, specialized hardware (like GPUs), software, data, and skilled workforce.  Sovereign AI infrastructure involves a full “stack” designed to ensure national control and reduce reliance on foreign providers. A few key features:

  • Policies and technical controls (e.g., data localization, encryption) to ensure that sensitive data used for training and inference remains within the jurisdiction.
  • Development and hosting of proprietary or locally tailored AI models and software frameworks that align with national values, languages, and ethical standards.
  • Workforce Development: Investing in domestic talent, including data scientists, engineers, and legal experts, to build and maintain the local AI ecosystem.
  • Regulatory Framework: A comprehensive legal and ethical framework for AI development and deployment that ensures compliance with national laws and standards.

Why It’s Important – The pursuit of sovereign AI infrastructure is driven by several strategic considerations for both governments and private enterprises:

  • National Security: To ensure that critical systems in defense, intelligence, and public infrastructure are not dependent on potentially adversarial foreign technologies or subject to extraterritorial access laws (like the U.S. CLOUD Act).
  • Economic Competitiveness: To foster a domestic tech industry, create high-skilled jobs, protect intellectual property, and capture the significant economic benefits of AI-driven growth.
  • Data Privacy and Compliance: To comply with stringent local data protection regulations (e.g., GDPR in the EU) and build public trust by ensuring citizen data is handled securely and according to local laws. Cultural Preservation: To train AI models on local datasets and languages, preserving cultural nuances and avoiding bias found in generalized, globally trained models.

Image Credit: Nvidia

………………………………………………………………………………………………………………………………………………………………………………………………………..

Governments around the world are starting to build sovereign AI infrastructure, and according to a new report from Morningstar DBRS, which opines that major telecommunications companies are uniquely positioned to benefit from that shift.  Here are a few take-aways from the report:

  • Sovereign AI funding opens a new growth path for telcos – Governments investing in domestic AI infrastructure are increasingly turning to operators, whose network and regulatory strengths position them to capture a large share of this emerging market.
  • Telcos’ capabilities align with sovereignty needs – Their expertise in large-scale networks, local presence, and established government relationships give them an edge over hyperscalers for sensitive, sovereignty-focused AI projects.
  • Early adopters gain advantage – Operators in Canada and Europe are already moving into sovereign AI, positioning themselves to secure higher-margin enterprise and government workloads as national AI buildouts accelerate.
Infrastructure advantages provide a strategic head start for telecommunications companies. Telcos currently manage extensive data centers, fiber optic networks, and computing infrastructure nationwide. Leveraging these established physical assets can significantly reduce the barriers to implementing sovereign AI solutions, contrasting favorably with the greenfield development required by other entities. 
The sophisticated data governance expertise within telcos is well-suited for the stringent requirements of sovereign AI. Their decades of experience managing and processing massive datasets have resulted in mature data handling practices directly applicable to the data infrastructure demands of secure, sovereign AI systems.
Furthermore, existing edge computing capabilities offer a distinct competitive advantage. Telecom networks facilitate localized AI processing near data sources while adhering to data residency requirements—a crucial combination for sovereign AI deployments.  This translates to “embedding AI within their network fabric for both optimization and distributed inference,” enabling AI consumption that offers lower latency, reduced cost, and applicability for high-sensitivity use cases in sectors like government and national security.
The opportunity to integrate AI workloads with emerging 5G and 6G infrastructures creates additional strategic value. Sovereign AI represents a pivotal opportunity for telecom operators to position themselves as central players in national AI strategies, evolving their role beyond primary connectivity provisioning.
……………………………………………………………………………………………………………………………………………………………………………….
Implementing sovereign AI presents substantial challenges despite its strategic potential. Key bottlenecks and technical complexities include:
  • Infrastructure Demands: Building robust domestic AI ecosystems requires specialized expertise spanning hardware, software, data governance, and policy.
  • Resource Constraints: Dr. Matt Hasan, CEO at aiRESULTS and a former AT&T executive, highlights specific bottlenecks:
    • Compute Density at Scale.
    • Spectrum Allocation amidst political pressures.
    • Energy Demand exceeding existing grid capacity.
  • Intensified Reliability Requirements: Sovereign AI implementation places heightened demands on telecom providers for system uptime, reliability, quality, and data privacy. This necessitates a focus on efficient power consumption, resilient routing and backups, robust encryption, and comprehensive cybersecurity measures.
  • Supply Chain Vulnerabilities: Geopolitical tensions introduce risks to the supply of critical components such as GPUs and specialized chips, underscoring the interconnected nature of global hardware supply chains.
  • The rapid evolution of AI technology mandates continuous investment and technical agility to ensure sovereign deployments remain current.
Competitive landscape dynamics:
  • The interplay between global hyperscalers and regional telecom operators is expected to shift.
  • Hasan predicts a collaborative model, with regional telcos leveraging their position as sovereign partners through joint ventures, rather than an outright displacement of hyperscalers.
Ultimately, the objective of sovereign AI is strategic resilience, not complete digital isolation. Nations must judiciously balance sovereignty goals with the advantages of global technological collaboration. For telecom operators, adeptly managing these complexities and investment demands will define sovereign AI’s realization as a viable growth opportunity.
…………………………………………………………………………………………………………………………………………………………………………….

References:

Telcos Across Five Continents Are Building NVIDIA-Powered Sovereign AI Infrastructure

https://dbrs.morningstar.com/research/468155/telecoms-are-well-placed-to-benefit-from-sovereign-ai-infrastructure-plans

How “sovereign AI” could shape telecom

https://www.rcrwireless.com/20251202/ai/sovereign-ai-telcos

Subsea cable systems: the new high-capacity, high-resilience backbone of the AI-driven global network

Analysis: OpenAI and Deutsche Telekom launch multi-year AI collaboration

AI infrastructure spending boom: a path towards AGI or speculative bubble?

Market research firms Omdia and Dell’Oro: impact of 6G and AI investments on telcos

Omdia: How telcos will evolve in the AI era

OpenAI announces new open weight, open source GPT models which Orange will deploy

Expose: AI is more than a bubble; it’s a data center debt bomb

Can the debt fueling the new wave of AI infrastructure buildouts ever be repaid?

Custom AI Chips: Powering the next wave of Intelligent Computing

AI spending boom accelerates: Big tech to invest an aggregate of $400 billion in 2025; much more in 2026!

IBM and Groq Partner to Accelerate Enterprise AI Inference Capabilities

Dell’Oro: Analysis of the Nokia-NVIDIA-partnership on AI RAN

 

Analysis: OpenAI and Deutsche Telekom launch multi-year AI collaboration

Deutsche Telekom (DT) has formalized a strategic, multi-year collaboration with OpenAI to integrate advanced artificial intelligence (AI) solutions across its internal operations and customer engagement platforms. The partnership aims to co-develop “simple, personal, and multi-lingual AI experiences” focused on enhancing communication and productivity. Initial pilot programs are slated for deployment in Q1 2026. AI will also play a larger role in customer care, internal copilots, and network operations as the Group advances toward more autonomous, self-healing networks.DT plans a company-wide rollout of ChatGPT Enterprise, leveraging AI to streamline core functions including:

  • Customer Care: Deploying sophisticated virtual assistants to manage billing inquiries, service outages, plan modifications, roaming support, and device troubleshooting [1].
  • Internal Operations: Utilizing AI copilots to increase internal efficiency.
  • Network Management: Optimizing core network provisioning and operations.
This collaboration underscores DT’s long-standing strategic imperative to establish itself as a leader in European cloud and AI infrastructure, emphasizing digital sovereignty. Some historical initiatives supporting this strategy include:
  • Sovereign Cloud (2021): DT’s T-Systems division partnered with Google Cloud to offer sovereign cloud services.
  • T Cloud Suite (Early 2025): The launch of a comprehensive suite providing sovereign public, private, and AI cloud options leveraging hybrid infrastructure.
  • Industrial AI Cloud (Early 2025): A collaboration with Nvidia to build a dedicated industrial AI data center in Munich, scheduled for Q1 2026 operations.

The integration of OpenAI technology strategically positions DT to offer a comprehensive value proposition to enterprise clients, combining connectivity, data center capabilities, and specialized AI software under a sovereign framework, according to Recon Analytics Founder Roger Entner.  “There are not that many AI data centers in Europe and in Germany,” Entner explained, noting this leaves the door open for operators like DT to fill in the gap. “In the U.S. you have a ton of data centers that you can do AI. Therefore, it doesn’t make sense for a network operator to have also a data center. They tried to compete with hyperscalers, and it failed. And the scale in the U.S. is a lot bigger than in Europe.”
OpenAI and Deutsche Telekom collaborate. © Deutsche Telekom
…………………………………………………………………………………………………………………………………………………………….
Tekonyx President and Chief Research Officer Sid Nag suggests the integration could extend to employing ChatGPT-based coding tools for developing proprietary Operational Support Systems (OSS) and Business Support Systems (BSS).   He anticipates the partnership will generate new revenue streams through offerings including:
  • Edge AI compute services for enterprises.
  • Vertical AI solutions tailored for healthcare, retail, and manufacturing sectors.
  • Integrated private 5G and AI bundles for industrial logistical hubs.

“Telcos – if they execute – will have a big play in the edge inferencing space as well as providing hosting and colo services that can host domain specific SLMs that need to be run closer to the user data,” he said. “Furthermore, telcos will play a role in connectivity services across Neocloud providers such as CoreWeave, Lambda Labs, Digital Ocean, Vast.AI etc. OpenAI does not want to lose the opportunity to partner with telcos so they are striking early,” Nag added.

Other Voices:

  • Roger Entner notes the model is highly applicable to European incumbents (e.g., Orange, Telefonica) due to the relative scarcity of existing AI data centers in the region, allowing operators to fill a critical infrastructure gap.  Conversely, the model is less viable for U.S. operators, where hyperscalers already dominate the extensive data center market.
  • AvidThink Founder and colleague Roy Chua cautions that while DT presents a robust “reference blueprint,” replicating this strategy requires significant scale, substantial financial investment, and regulatory alignment—factors not easily accessible to all network operators.
  • Futurum Group VP and Practice Lead Nick Patience told Fierce Network, “This deal elevates DT from being a user of AI to being a co-developer, which is pretty significant. DT is one of the few operators building a full-stack AI story. This is an example of OpenAI treating telcos as high-scale distribution and data channels – customer care, billing, network telemetry, national reach and government relationships. This suggests OpenAI is deliberately building an operator channel in key regions (U.S., Korea, EU) but still in partnership with existing cloud and infra providers rather than displacing them.”
………………………………………………………………………………………………………………………………………………………….
Open AI’s Telco Deals:

OpenAI has established significant partnerships with several telecom network providers and related technology companies to integrate AI into network operations, enhance customer experience, and develop new AI-native platforms. Those deals and collaborations include:

  • T-Mobile: T-Mobile has a multi-year agreement with OpenAI and is actively testing the integration of AI (specifically IntentCX) into its business operations for customer service improvements. T-Mobile is also collaborating with Nokia and Nvidia on AI-RAN (Radio Access Network) technologies for 6G innovation.
  • SK Telecom (SKT): SK Telecom has an in-house AI company and collaborates with OpenAI and other AI leaders like Anthropic to enhance its AI capabilities, build sovereign AI infrastructure, and explore new services for its customers in South Korea and globally. They are also reportedly integrating Perplexity into their offerings.
  • Deutsche Telekom (DT): DT is partnering with OpenAI to offer ChatGPT Enterprise across its business to help teams work more effectively, improve customer service, and automate network operations.
  • Circles: This global telco technology company and OpenAI announced a strategic global collaboration to build a fully AI-native telco SaaS platform, which will first launch in Singapore. The platform aims to revolutionize the consumer experience and drive operational efficiencies for telcos worldwide.
  • Rakuten: Rakuten and OpenAI launched a strategic partnership to develop AI tools and a platform aimed at leveraging Rakuten’s Open RAN expertise to revolutionize the use of AI in telecommunications.
  • Orange: Orange is working with OpenAI to drive new use cases for enterprise needs, manage networks, and enable innovative customer care solutions, including those that support African regional languages.
  • Indian Telecoms (Reliance Jio, Airtel): Telecom providers in India are integrating AI tools from companies like Google and Perplexity into their mobile subscriptions, providing millions of users access to advanced intelligence resources.
  • Nokia & Nvidia: In a broader industry collaboration, Nvidia invested $1 billion in Nokia to add Nvidia-powered AI-RAN products to Nokia’s portfolio, enabling telecom service providers to launch AI-native 5G-Advanced and 6G networks. This partnership also includes T-Mobile US for testing.

Conclusions:

With more than 261 million mobile customers globally, Deutsche Telekom provides a strong foundation to bring AI into everyday use at scale. The new collaboration marks the next step in Deutsche Telekom’s AI journey – moving from early pilots to large-scale products that make AI useful for everyone

………………………………………………………………………………………………………………………………………………………………….

Deutsche Telekom: successful completion of the 6G-TakeOff project with “3D networks”

Deutsche Telekom and Google Cloud partner on “RAN Guardian” AI agent

Deutsche Telekom offers 5G mmWave for industrial customers in Germany on 5G SA network

Deutsche Telekom migrates IP-based voice telephony platform to the cloud

Open AI raises $8.3B and is valued at $300B; AI speculative mania rivals Dot-com bubble

OpenAI and Broadcom in $10B deal to make custom AI chips

Custom AI Chips: Powering the next wave of Intelligent Computing

OpenAI orders HBM chips from SK Hynix & Samsung for Stargate UAE project

OpenAI announces new open weight, open source GPT models which Orange will deploy

OpenAI partners with G42 to build giant data center for Stargate UAE project

Reuters & Bloomberg: OpenAI to design “inference AI” chip with Broadcom and TSMC

Custom AI Chips: Powering the next wave of Intelligent Computing

by the  Indxx team of market researchers with Alan J Weissberger

The Market for AI Related Semiconductors:

Several market research firms and banks forecast that revenue from AI-related semiconductors will grow at about 18% annually over the next few years—five times faster than non-AI semiconductor market segments.

  • IDC forecasts that global AI hardware spending, including chip demand, will grow at an annual rate of 18%.
  • Morgan Stanley analysts predict that AI-related semiconductors will grow at an 18% annual rate for a specific company, Taiwan Semiconductor (TSMC).
  • Infosys notes that data center semiconductor sales are projected to grow at an 18% CAGR.
  • MarketResearch.biz and the IEEE IRDS predict an 18% annual growth rate for AI accelerator chips.
  • Citi also forecasts aggregate chip sales for potential AI workloads to grow at a CAGR of 18% through 2030. 

AI-focused chips are expected to represent nearly 20% of global semiconductor demand in 2025, contributing approximately $67 billion in revenue [1].  The global AI chip market is projected to reach $40.79 billion in 2025 [2.] and continue expanding rapidly toward $165 billion by 2030.

…………………………………………………………………………………………………………………………………………………

Types of AI Custom Chips:

Artificial intelligence is advancing at a speed that traditional computing hardware can no longer keep pace with. To meet the demands of massive AI models, lower latency, and higher computing efficiency, companies are increasingly turning to custom AI chips which are purpose-built processors optimized for neural networks, training, and inference workloads.

Those AI chips include Application Specific Integrated Circuits (ASICs) and Field- Programmable Gate Arrays (FPGAs) to Neural Processing Units (NPUs) and Google’s Tensor Processing Units (TPUs).  They are optimized for core AI tasks like matrix multiplications and convolutions, delivering far higher performance-per-watt than CPUs or GPUs. This efficiency is key as AI workloads grow exponentially with the rise of Large Language Models (LLMs)  and generative AI.

OpenAI – Broadcom Deal:

Perhaps the biggest custom AI chip design is being done by an OpenAI partnership with Broadcom in a multi-year, multi-billion dollar deal announced in October 2025.  In this arrangement, OpenAI will design the hardware and Broadcom will develop custom chips to integrate AI model knowledge directly into the silicon for efficiency.

Here’s a summary of the partnership:

  • OpenAI designs its own AI processors (GPUs) and systems, embedding its AI insights directly into the hardware. Broadcom develops and deploys these custom chips and the surrounding infrastructure, using its Ethernet networking solutions to scale the systems.
  • Massive Scale: The agreement covers 10 gigawatts (GW) of AI compute, with deployments expected over four years, potentially extending to 2029.
  • Cost Savings: This custom silicon strategy aims to significantly reduce costs compared to off-the-shelf Nvidia or AMD chips, potentially saving 30-40% on large-scale deployments.
  • Strategic Goal: The collaboration allows OpenAI to build tailored hardware to meet the intense demands of developing frontier AI models and products, reducing reliance on other chip vendors.

AI Silicon Market Share of Key Players:

  • Nvidia, with its extremely popular AI GPUs and CUDA software ecosystem., is expected to maintain its market leadership. It currently holds an estimated 86% share of the AI GPU market segment according to one source [2.]. Others put NVIDIA’s market AI chip market share between 80% and 92%.
  • AMD holds a smaller, but growing, AI chip market share, with estimates placing its discrete GPU market share around 4% to 7% in early to mid-2025. AMD is projected to grow its AI chip division significantly, aiming for a double-digit share with products like the MI300X.  In response to the extraordinary demand for advanced AI processors, AMD’s Chief Executive Officer, Dr. Lisa Su, presented a strategic initiative to the Board of Directors: to pivot the company’s core operational focus towards artificial intelligence. Ms. Su articulated the view that the “insatiable demand for compute” represented a sustained market trend. AMD’s strategic reorientation has yielded significant financial returns; AMD’s market capitalization has nearly quadrupled, surpassing $350 billion [1]. Furthermore, the company has successfully executed high-profile agreements, securing major contracts to provide cutting-edge silicon solutions to key industry players, including OpenAI and Oracle.
  • Intel accounts for approximately 1% of the discrete GPU market share, but is focused on expanding its presence in the AI training accelerator market with its Gaudi 3 platform, where it aims for an 8.7% share by the end of 2025.  The former microprocessor king has recently invested heavily in both its design and manufacturing businesses and is courting customers for its advanced data-center processors.
  • Qualcomm, which is best known for designing chips for mobile devices and cars, announced in October that it would launch two new AI accelerator chips. The company said the new AI200 and AI250 are distinguished by their very high memory capabilities and energy efficiency.

Big Tech Custom AI chips vs Nvidia AI GPUs:

Big tech companies, including Google, Meta, Amazon, and Apple—are designing their own custom AI silicon to reduce costs, accelerate performance, and scale AI across industries. Yet nearly all rely on TSMC for manufacturing, thanks to its leadership in advanced chip fabrication technology [3.]

  • Google recently announced Ironwood, its 7th-generation Tensor Processing Unit (TPU), a major AI chip for LLM training and inference, offering 4x the performance of its predecessor (Trillium) and massive scalability for demanding AI workloads like Gemini, challenging Nvidia’s dominance by efficiently powering complex AI at scale for Google Cloud and major partners like Meta. Ironwood is significantly faster, with claims of over 4x improvement in training and inference compared to the previous Trillium (6th gen) TPU.  It allows for super-pods of up to 9,216 interconnected chips, enabling huge computational power for cutting-edge models. It’s optimized for high-volume, low-latency AI inference, handling complex thinking models and real-time chatbots efficiently.
  • Meta is in advanced talks to purchase and rent large quantities of Google’s custom AI chips (TPUs), starting with cloud rentals in 2026 and moving to direct purchases for data centers in 2027, a significant move to diversify beyond Nvidia and challenge the AI hardware market. This multi-billion dollar deal could reshape AI infrastructure by giving Meta access to Google’s specialized silicon for workloads like AI model inference, signaling a major shift in big tech’s chip strategy, notes this TechRadar article. 
  • According to a Wall Street Journal report published on December 2, 2025, Amazon’s new Trainium3 custom AI chip presents a challenge to Nvidia’s market position by providing a more affordable option for AI development.  Four times as fast as its previous generation of AI chips, Amazon said Trainium3 (produced by AWS’s Annapurna Labs custom-chip design business) can reduce the cost of training and operating AI models by up to 50% compared with systems that use equivalent graphics processing units, or GPUs.  AWS acquired Israeli startup Annapurna Labs in 2015 and began designing chips to power AWS’s data-center servers, including network security chips, central processing units, and later its AI processor series, known as Inferentia and Trainium.  “The main advantage at the end of the day is price performance,” said Ron Diamant, an AWS vice president and the chief architect of the Trainium chips. He added that his main goal is giving customers more options for different computing workloads. “I don’t see us trying to replace Nvidia,” Diamant said.
  • Interestingly, many of the biggest buyers of Amazon’s chips are also Nvidia customers. Chief among them is Anthropic, which AWS said in late October is using more than one million Trainium2 chips to build and deploy its Claude AI model. Nvidia announced a month later that it was investing $10 billion in Anthropic as part of a massive deal to sell the AI firm computing power generated by its chips.

Image Credit: Emil Lendof/WSJ, iStock

Other AI Silicon Facts and Figures:

  • Edge AI chips are forecast to reach $13.5 billion in 2025, driven by IoT and smartphone integration.
  • AI accelerators based on ASIC designs are expected to grow by 34% year-over-year in 2025.
  • Automotive AI chips are set to surpass $6.3 billion in 2025, thanks to advancements in autonomous driving.
  • Google’s TPU v5p reached 30% faster matrix math throughput in benchmark tests.
  • U.S.-based AI chip startups raised over $5.1 billion in venture capital in the first half of 2025 alone.

Conclusions:

Custom silicon is now essential for deploying AI in real-world applications such as automation, robotics, healthcare, finance, and mobility. As AI expands across every sector, these purpose-built chips are becoming the true backbone of modern computing—driving a hardware race that is just as important as advances in software. More and more AI firms are seeking to diversify their suppliers by buying chips and other hardware from companies other than Nvidia.  Advantages like cost-effectiveness, specialization, lower power consumption and strategic independence that cloud providers gain from developing their own in-house AI silicon.  By developing their own chips, hyperscalers can create a vertically integrated AI stack (hardware, software, and cloud services) optimized for their specific internal workloads and cloud platforms. This allows them to tailor performance precisely to their needs, potentially achieving better total cost of ownership (TCO) than general-purpose Nvidia GPUs

However, Nvidia is convinced it will retain a huge lead in selling AI silicon.  In a post on X, Nvida wrote that it was “delighted by Google’s success with its TPUs,” before adding that Nvidia “is a generation ahead of the industry—it’s the only platform that runs every AI model and does it everywhere computing is done.” The company said its chips offer “greater performance, versatility, and fungibility” than more narrowly tailored custom chips made by Google and AWS.

The race is far from over, but we can expect to surely see more competition in the AI silicon arena.

………………………………………………………………………………………………………………………………………………………………………………….

Links for Notes:

1.  https://www.mckinsey.com/industries/semiconductors/our-insights/artificial-intelligence-hardware-%20new-opportunities-for-semiconductor-companies/pt-PT

2. https://sqmagazine.co.uk/ai-chip-statistics/

3. https://www.ibm.com/think/news/custom-chips-ai-future

References:

https://www.wsj.com/tech/ai/amazons-custom-chips-pose-another-threat-to-nvidia-8aa19f5b

https://www.techradar.com/pro/meta-and-google-could-be-about-to-sign-a-mega-ai-chip-deal-and-it-could-change-everything-in-the-tech-space

https://www.wsj.com/tech/ai/nvidia-ai-chips-competitors-amd-broadcom-google-amazon-6729c65a

AI infrastructure spending boom: a path towards AGI or speculative bubble?

OpenAI and Broadcom in $10B deal to make custom AI chips

Reuters & Bloomberg: OpenAI to design “inference AI” chip with Broadcom and TSMC

RAN silicon rethink – from purpose built products & ASICs to general purpose processors or GPUs for vRAN & AI RAN

Dell’Oro: Analysis of the Nokia-NVIDIA-partnership on AI RAN

Cisco CEO sees great potential in AI data center connectivity, silicon, optics, and optical systems

Expose: AI is more than a bubble; it’s a data center debt bomb

China gaining on U.S. in AI technology arms race- silicon, models and research

 

AI infrastructure spending boom: a path towards AGI or speculative bubble?

by Rahul Sharma, Indxx with Alan J Weissberger, IEEE Techblog

Introduction:

The ongoing wave of artificial intelligence (AI) infrastructure investment by U.S. mega-cap tech firms marks one of the largest corporate spending cycles in history. Aggregate annual AI investments, mostly for cloud resident mega-data centers, are expected to exceed $400 billion in 2025, potentially surpassing $500 billion by 2026 — the scale of this buildout rivals that of past industrial revolutions — from railroads to the internet era.[1]

At its core, this spending surge represents a strategic arms race for computational dominance. Meta, Alphabet, Amazon and Microsoft are racing to secure leadership in artificial intelligence capabilities — a contest where access to data, energy, and compute capacity are the new determinants of market power.

AI Spending & Debt Financing:

Leading technology firms are racing to secure dominance in compute capacity — the new cornerstone of digital power:

  • Meta plans to spend $72 billion on AI infrastructure in 2025.
  • Alphabet (Google) has expanded its capex guidance to $91–93 billion.[3]
  • Microsoft and Amazon are doubling data center capacity, while AWS will drive most of Amazon’s $125 billion 2026 investment.[4]
  • Even Apple, typically conservative in R&D, has accelerated AI infrastructure spending.

Their capex is shown in the chart below:

Analysts estimate that AI could add up to 0.5% to U.S. GDP annually over the next several years. Reflecting this optimism, Morgan Stanley forecasts $2.9 trillion in AI-related investments between 2025 and 2028. The scale of commitment from Big Tech is reshaping expectations across financial markets, enterprise strategies, and public policy, marking one of the most intense capital spending cycles in corporate history.[2]

Meanwhile, OpenAI’s trillion-dollar partnerships with Nvidia, Oracle, and Broadcom have redefined the scale of ambition, turning compute infrastructure into a strategic asset comparable to energy independence or semiconductor sovereignty.[5]

Growth Engine or Speculative Bubble?

As Big Tech pours hundreds of billions of dollars into AI infrastructure, analysts and investors remain divided — some view it as a rational, long-term investment cycle, while others warn of a potential speculative bubble.  Yet uncertainty remains — especially around Meta’s long-term monetization of AGI-related efforts.[8]

Some analysts view this huge AI spending as a necessary step towards achieving Artificial General Intelligence (AGI) – an unrealized type of AI that possesses human-level cognitive abilities, allowing it to understand, learn, and adapt to any intellectual task a human can. Unlike narrow AI, which is designed for specific functions like playing chess or image recognition, AGI could apply its knowledge to a wide range of different situations and problems without needing to be explicitly programmed for each one.

Other analysts believe this is a speculative bubble, fueled by debt that can never be repaid. Tech sector valuations have soared to dot-com era levels – and, based on price-to-sales ratios, are well beyond them. Some of AI’s biggest proponents acknowledge the fact that valuations are overinflated, including OpenAI chairman Bret Taylor: “AI will transform the economy… and create huge amounts of economic value in the future,” Taylor told The Verge. “I think we’re also in a bubble, and a lot of people will lose a lot of money,” he added.

Here are a few AI bubble points and charts:

  • AI-related capex is projected to consume up to 94% of operating cash flows by 2026, according to Bank of America.[6]
  • Over $75 billion in AI-linked corporate bonds have been issued in just two months — a signal of mounting leverage. Still, strong revenue growth from AI services (particularly cloud and enterprise AI) keeps optimism alive.[7]
  • Meta, Google, Microsoft, Amazon and xAI (Elon Musk’s company) are all using off-balance-sheet debt vehicles, including special-purpose vehicles (SPVs) to fund part of their AI investments. A slowdown in AI demand could render the debt tied to these SPVs worthless, potentially triggering another financial crisis.
  • Alphabet’s (Google’s parent company) CEO Sundar Pichai sees “elements of irrationality” in the current scale of AI investing which is much more than excessive investments during the dot-com/fiber optic built-out boom of the late 1990s. If the AI bubble bursts, Pichai said that no company will be immune, including Alphabet, despite its breakthrough technology, Gemini, fueling gains in the company’s stock price.

…………………………………………………………………………………………………………………..

From Infrastructure to Intelligence:

Executives justify the massive spend by citing acute compute shortages and exponential demand growth:

  • Microsoft’s CFO Amy Hood admitted, “We’ve been short on capacity for many quarters” and confirmed that the company will increase its spending on GPUs and CPUs in 2026 to meet surging demand.
  • Amazon’s Andy Jassy noted that “every new tranche of capacity is immediately monetized”, underscoring strong and sustained demand for AI and cloud services.
  • Google reported billions in quarterly AI revenue, offering early evidence of commercial payoff.

Macro Ripple Effects – Industrializing Intelligence:

AI data centers have become the factories of the digital age, fueling demand for:

  • Semiconductors, especially GPUs (Nvidia, AMD, Broadcom)
  • Cloud and networking infrastructure (Oracle, Cisco)
  • Energy and advanced cooling systems for AI data centers (Vertiv, Schneider Electric, Johnson Controls, and other specialists such as Liquid Stack and Green Revolution Cooling).
Leading Providers of Energy and Cooling Systems for AI Data Centers:
Company Name  Core Expertise Key Solutions for AI Data Centers
Vertiv Critical infrastructure (power & cooling) Offers full-stack solutions with air and liquid cooling, power distribution units (PDUs), and monitoring systems, including the AI-ready Vertiv 360AI portfolio.
Schneider Electric Energy management & automation Provides integrated power and thermal management via its EcoStruxure platform, specializing in modular and liquid cooling solutions for HPC and AI applications.
Johnson Controls HVAC & building solutions Offers integrated, energy-efficient solutions from design to maintenance, including Silent-Aire cooling and YORK chillers, with a focus on large-scale operations.
Eaton Power management Specializes in electrical distribution systems, uninterruptible power supplies (UPS), and switchgear, which are crucial for reliable energy delivery to high-density AI racks.
These companies focus heavily on innovative liquid cooling technologies, which are essential for managing the extreme heat generated by high-density AI servers and GPUs: 
  • LiquidStack: A leader in two-phase and modular immersion cooling and direct-to-chip systems, trusted by large cloud and hardware providers.
  • Green Revolution Cooling (GRC): Pioneers in single-phase immersion cooling solutions that help simplify thermal management and improve energy efficiency.
  • Iceotope: Focuses on chassis-level precision liquid cooling, delivering dielectric fluid directly to components for maximum efficiency and reduced operational costs.
  • Asetek: Specializes in direct-to-chip (D2C) liquid cooling solutions and rack-level Coolant Distribution Units (CDUs) for high-performance computing.
  • CoolIT Systems: Known for its custom direct liquid cooling technologies, working closely with server OEMs (Original Equipment Manufacturers) to integrate cold plates and CDUs for AI and HPC workloads. 

–>This new AI ecosystem is reshaping global supply chains — but also straining local energy and water resources. For example, Meta’s massive data center in Georgia has already triggered environmental concerns over energy and water usage.

Global Spending Outlook:

  • According to UBS, global AI capex will reach $423 billion in 2025, $571 billion by 2026 and $1.3 trillion by 2030, growing at a 25% CAGR during the period 2025-2030.
    Compute demand is outpacing expectations, with Google’s Gemini saw 130 times rise in AI token usage over the past 18 months, highlighting soaring compute and Meta’s infrastructure needs expanding sharply.[9]

Conclusions:

The AI infrastructure boom reflects a bold, forward-looking strategy by Big Tech, built on the belief that compute capacity will define the next decade’s leaders. If Artificial General Intelligence (AGI) or large-scale AI monetization unfolds as expected, today’s investments will be seen as visionary and transformative. Either way, the AI era is well underway — and the race for computational excellence is reshaping the future of global markets and innovation.

…………………………………………………………………………………………………………………………………………………………………………………………………………………………….

Footnotes:

[1] https://www.investing.com/news/stock-market-news/ai-capex-to-exceed-half-a-trillion-in-2026-ubs-4343520?utm_medium=feed&utm_source=yahoo&utm_campaign=yahoo-www

[2] https://www.venturepulsemag.com/2025/08/01/big-techs-400-billion-ai-bet-the-race-thats-reshaping-global-technology/#:~:text=Big%20Tech’s%20$400%20Billion%20AI%20Bet:%20The%20Race%20That’s%20Reshaping%20Global%20Technology,-3%20months%20ago&text=The%20world’s%20largest%20technology%20companies,enterprise%20strategy%2C%20and%20public%20policy.

[3] https://www.businessinsider.com/big-tech-capex-spending-ai-earnings-2025-10?

[4] https://www.investing.com/analysis/meta-plunged-12-amazon-jumped-11–same-ai-race-different-economics-200669410

[5] https://www.cnbc.com/2025/10/15/a-guide-to-1-trillion-worth-of-ai-deals-between-openai-nvidia.html

[6] https://finance.yahoo.com/news/bank-america-just-issued-stark-152422714.html

[7] https://news.futunn.com/en/post/64706046/from-cash-rich-to-collective-debt-how-does-wall-street?level=1&data_ticket=1763038546393561

[8] https://www.businessinsider.com/big-tech-capex-spending-ai-earnings-2025-10?

[9] https://finance.yahoo.com/news/ai-capex-exceed-half-trillion-093015889.html

……………………………………………………………………………………………………………………………………………………………………………………………………………………………

About the Author:

Rahul Sharma is President & Co-Chief Executive Officer at Indxx a provider of end-to-end indexing services, data and technology products.  He has been instrumental in leading the firm’s growth since 2011. Raul manages Indxx’s Sales, Client Engagement, Marketing and Branding teams while also helping to set the firm’s overall strategic objectives and vision.

Rahul holds a BS from Boston College and an MBA with Beta Gamma Sigma honors from Georgetown University’s McDonough School of Business.

……………………………………………………………………………………………………………………………………………………………………………………………………………………………

References:

Curmudgeon/Sperandeo: New AI Era Thinking and Circular Financing Deals

Expose: AI is more than a bubble; it’s a data center debt bomb

Can the debt fueling the new wave of AI infrastructure buildouts ever be repaid?

AI spending boom accelerates: Big tech to invest an aggregate of $400 billion in 2025; much more in 2026!

Big tech spending on AI data centers and infrastructure vs the fiber optic buildout during the dot-com boom (& bust)

FT: Scale of AI private company valuations dwarfs dot-com boom

Amazon’s Jeff Bezos at Italian Tech Week: “AI is a kind of industrial bubble”

AI Data Center Boom Carries Huge Default and Demand Risks

Will billions of dollars big tech is spending on Gen AI data centers produce a decent ROI?

Dell’Oro: Analysis of the Nokia-NVIDIA-partnership on AI RAN

RAN silicon rethink – from purpose built products & ASICs to general purpose processors or GPUs for vRAN & AI RAN

Nokia in major pivot from traditional telecom to AI, cloud infrastructure, data center networking and 6G

Reuters: US Department of Energy forms $1 billion AI supercomputer partnership with AMD

………………………………………………………………………………………………………………………………………………………………………….

 

Page 1 of 13
1 2 3 13