Telecom data centers must be redesigned for the AI era with rack scale architectures, enhanced power & cooling requirements

Recent analysis by SiliconANGLE and Morgan Stanley highlights that the bottleneck for generative AI in telecom has shifted from software capabilities to physical hardware availability. While telecom network operators have successfully designed AI models for network optimization, predictive maintenance, and autonomous traffic routing, they lack the raw compute power to run them at scale. Traditional telecom data centers were built for central office workloads and basic virtualization, not the massive parallel processing required by modern Large Language Models (LLMs) and real-time AI inference. As a result, carriers are trapped in a compute-constrained environment, forced to queue workloads or ration processing power.

……………………………………………………………………………………………………………………………………….

The growth slope of generative AI in networking through 2026–2027 is now entirely bound to the physical deployment speed of raw, rack-scale data center infrastructure. This dependency is driven by three main factors:

Gigawatt-Scale Power and Liquid Cooling: Next-generation AI clusters require unprecedented power density, often exceeding 40kW to 100kW per rack. Telcos cannot simply drop these into existing facilities; they require entirely new or heavily retrofitted data centers featuring advanced liquid cooling architectures to prevent thermal throttling.
The Fragmented Edge vs. Centralized Fortresses: Operators are realizing that centralized hyperscale data centers (like AWS or Azure clusters in Virginia) cannot support latency-sensitive “Physical AI” or real-time agentic workflows. To make AI-native networking work, carriers must deploy high-density compute racks directly at the network edge, a highly complex and capital-intensive roll-out.
Neutral Interconnection Hubs: Multi-cloud setups and distributed training workloads are putting immense pressure on backbones. The expansion rate of neutral interconnect hubs (like Equinix and Digital Realty) is directly gating how fast enterprises and telcos can orchestrate data between fragmented training clusters and edge inference nodes.
Rack-scale architecture is rapidly emerging as the primary deployment unit as enterprises transition from discrete servers to fully integrated systems capable of supporting the power density, thermal constraints, and interconnect requirements of production-scale AI workloads.

Image Credit: AMD

……………………………………………………………………………………………………………………………………………………………………………………………

AI data centers supporting telecom networks require fundamentally different power and cooling infrastructure compared to legacy enterprise facilities. The transition to generative AI and real-time edge processing has pushed power density per rack from an average of 5–10 kW up to 40–100+ kW.

Dell Technologies Inc. has been strategically aligning its portfolio to this shift, and at Dell Technologies World 2026, the company introduced an expanded PowerRack portfolio that integrates compute, networking, and storage within a unified rack-scale platform. This evolution underscores a broader transition in system design priorities—from server-centric architectures to tightly coupled, rack-level systems—driven by the escalating demands of AI infrastructure. As Arun Narayanan, senior vice president of compute and networking product management at Dell, indicated, increasing power density and system complexity are making rack-level architectural optimization not just advantageous, but essential.

“Go back two years ago, the largest, most powerful rack was 80 kilowatts,” Narayanan said. “Come to Vera Rubin, you’re going to get racks of 235 kilowatts, and then get to the next generation of Rubin Ultra and Kyber, you’re going to very quickly get to one megawatt racks. You have to fundamentally redesign everything from power distribution to cooling.”

Power Requirements and Delivery:

To prevent massive line losses and voltage drops at high densities, data centers must completely overhaul their internal alternating current (AC) distribution.

- Medium-Voltage Power Distribution: Traditional facilities step utility power down to 480V AC far from the rack. High-density AI data centers run medium-voltage or power directly down to the row or container level before stepping down. This minimizes conduction losses through the heavy copper busbars.
- The Move to 48V DC Busbars: Within the server chassis, power shelf architectures are shifting from traditional 12V DC distribution to DC busbars. A delivery architecture reduces the current required to deliver the same wattage by a factor of four. Resistive power loss occurs when electrical energy is converted into heat due to the inherent opposition to current flow in a conductor. The formula (P{loss} = I^2 R dictates that this power dissipation is highly sensitive to current changes. Therefore, cutting the current to one-fourth reduces internal rack heat and conduction power losses by 93.75%
- Grid Interconnection and Substation Constraints: A single rack-scale AI cluster (such as a cluster of 32 or 64 interconnected nodes) can easily pull 2 to 3 Megawatts (MW). Operators are bypassing traditional local distribution grids entirely. They are building dedicated on-site substations tied directly to transmission-level lines to guarantee upstream capacity.

Cooling Requirements and Technologies:

Air cooling hits a hard physical performance ceiling at roughly 30–35 kW per rack. Beyond this threshold, the volume of air required to pass through the server chassis creates unacceptable fan power consumption and audible noise. AI data centers deploy liquid-based thermodynamics to dissipate the thermal energy.

       [ Liquid Cooling Architectures for AI Racks ]
       
 ┌───────────────────────────┐      ┌───────────────────────────┐
 │       Direct-to-Chip      │      │     Immersion Cooling     │
 ├───────────────────────────┤      ├───────────────────────────┤
 │ Closed loop micro-channels│      │ Entire server submerged   │
 │ bolted directly onto GPUs │      │ in dielectric fluid tank  │
 │                           │      │                           │
 │     [ GPU ] ──► [ Liquid] │      │    ┌───┐ ┌───┐ ┌───┐      │
 │   Cold Plate   Coolant    │      │    │GPU│ │CPU│ │RAM│      │
 │    Circuit     Circuit    │      │    └───┴─┴───┴─┴───┘      │
 └───────────────────────────┘      └───────────────────────────┘

- Direct-to-Chip (Cold Plate) Cooling: This is the primary architecture for 2026 deployments. A closed-loop copper block with micro-channels is bolted directly onto high-thermal-flux components like the GPU or CPU. A specialized dielectric or water-glycol fluid circulates through the block. This absorbs heat directly from the silicon via conduction and pumps it away to a secondary heat exchanger.
- Immersion Cooling (Single-Phase and Two-Phase):
  - - Single-Phase: The entire server blade is submerged in a bath of non-conductive, hydrocarbon- or synthetic-based dielectric fluid. The fluid circulates through the chassis via natural convection or pumps to remove heat.
    - Two-Phase: The dielectric fluid has a low boiling point (\(50^{\circ }\text{C}\)). The heat from the chips boils the fluid into a vapor. The vapor rises to a condenser coil at the top of the sealed tank, condenses back into liquid, and falls back into the pool. This utilizes the latent heat of vaporization, making it highly efficient.
- Cooling Distribution Units (CDUs): High-density loops rely on CDUs to act as the barrier between the internal facility water loops (which can be lower quality) and the ultra-pure, treated water circuit flowing directly through the server cold plates.

Strategic Market Outlook (2026–2027):

Because hardware deployment cannot be short-circuited by software updates, a clear divide is emerging in the telecom sector. Operators who secured early private capital, locked in GPU supply chains, and invested in dark fiber infrastructure are positioned to scale their AI capabilities rapidly. Conversely, carriers relying on incremental, legacy virtualization upgrades will face a hard performance ceiling. Through 2027, the market winners will be determined not by who has the best AI algorithms, but by who can build and power physical rack space the fastest.

……………………………………………………………………………………………………………..

References:

https://siliconangle.com/2026/05/25/rack-scale-infrastructure-key-ai-factory-era-delltechworld/

https://www.datacenterknowledge.com/data-center-chips/rack-scale-revolution-ai-drives-new-era-of-data-center-architecture

https://www.datacenterknowledge.com/build-design/ai-transforms-data-centers-into-power-and-cooling-plants

https://www.sixfivemedia.com/content/why-rack-scale-architecture-matters-preparing-data-centers-for-the-next-wave-of-ai—six-five-on-the-road

https://www.amd.com/en/blogs/2025/amd-delivering-open-rack-scale-ai-infrastructure-to-unlock-agentic-ai.html

https://www.nae.edu/349862/The-Evolving-Power-Architecture-of-AI-Data-Centers-?layoutChange=Normal

One thought on “Telecom data centers must be redesigned for the AI era with rack scale architectures, enhanced power & cooling requirements”

IEEE Member says:

July 24, 2026 at 20:40

On July 23, 2026, a transmission line fault in Northern Virginia’s “Data Center Alley” (the world’s largest concentration of data centers) caused hyperscale facilities to automatically switch to backup power, removing more than 3 GW of demand from the PJM grid within seconds. While the event created brief voltage fluctuations from Virginia to Chicago, it also demonstrated how today’s modern data centers are engineered to protect both operations and the grid exactly as intended during unexpected disruptions.

Shumate Engineering is an award-winning designer of AI data centers and a mission-critical MEP engineering firm that works with data center developers and operators nationwide.

IEEE ComSoc Technology Blog

Telecom data centers must be redesigned for the AI era with rack scale architectures, enhanced power & cooling requirements

References:

China vs U.S.: Race to Generate Power for AI Data Centers as Electricity Demand Soars

Big tech spending on AI data centers and infrastructure vs the fiber optic buildout during the dot-com boom (& bust)

Will billions of dollars big tech is spending on Gen AI data centers produce a decent ROI?

AWS to deploy AI inference chips from Cerebras in its data centers; Anapurna Labs/Amazon in-house AI silicon products

Analysis: Cisco, HPE/Juniper, and Nvidia network equipment for AI data centers

Networking chips and modules for AI data centers: Infiniband, Ultra Ethernet, Optical Connections

Lumen Technologies to connect Prometheus Hyperscale’s energy efficient AI data centers

One thought on “Telecom data centers must be redesigned for the AI era with rack scale architectures, enhanced power & cooling requirements”

Leave a Reply Cancel Reply

Archives

Archives

Recent Posts