Thought Leaders

Protecting Your AI Investment: Why Cooling Strategy Matters More Than Ever

mm

Data center operators are gambling millions on outdated cooling technology. The conversation around data center cooling isn’t just changing—it’s being completely redefined by the economics of AI. The stakes have never been higher.

The rapid advancement of AI has transformed data center economics in ways few predicted. When a single rack of AI servers costs around $3 million—as much as a luxury home—the risk calculation fundamentally changes. As Andreessen Horowitz co-founder Ben Horowitz recently cautioned, data centers financing these massive hardware investments “could get upside down very fast” if they don’t carefully manage their infrastructure strategy.

This new reality demands a fundamental rethinking of cooling approaches. While traditional metrics like PUE and operating costs are still important, they are secondary to protecting these multi-million-dollar hardware investments. The real question data center operators should be asking is: How do we best protect our AI infrastructure investment?

The Hidden Risks of Traditional Cooling

The industry’s historic reliance on single-phase, water-based cooling solutions carries increasingly unacceptable risks in the AI era. While it has served data centers well for years, the thermal demands of AI workloads have pushed this technology beyond its practical limits. The reason is simple physics: single-phase systems require higher flow rates to manage today’s thermal loads, increasing the risk of leaks and catastrophic failures.

This isn’t a hypothetical risk. A single water leak can instantly destroy millions in AI hardware—hardware that often has months-long replacement lead times in today’s supply-constrained market. The cost of even a single catastrophic failure can exceed a data center’s cooling infrastructure budget for an entire year. Yet many operators continue to rely on these systems, effectively gambling their AI investment on aging technology.

At Data Center World 2024, Dr. Mohammad Tradat, NVIDIA’s Manager of Data Center Mechanical Engineering, asked, “How long will single-phase cooling live? It’ll be phased out very soon…and then the need will be for two-phase, refrigerant-based cooling.” This isn’t just a growing opinion—it’s becoming an industry consensus backed by physics and financial reality.

A New Approach to Investment Protection

Two-phase cooling technology, which uses dielectric refrigerants instead of water, fundamentally changes this risk equation. The cost of implementing a two-phase cooling system—typically around $200,000 per rack—should be viewed as insurance for protecting a $5 million AI hardware investment. To put this in perspective, that’s a 4% premium to protect your asset—considerably lower than insurance rates for other multi-million dollar business investments. The business case becomes even clearer when you factor in the potential costs of AI training disruption and idle infrastructure during unplanned downtime.

For data center operators and financial stakeholders, the decision to invest in two-phase cooling should be evaluated through the lens of risk management and investment protection. The relevant metrics should include not just operating costs or energy efficiency but also the total value of hardware being protected, the cost of potential failure scenarios, the future-proofing value for next-generation hardware and the risk-adjusted return on cooling investment.

As AI continues to drive up the density and value of data center infrastructure, the industry must evolve its approach to cooling strategy. The question isn’t whether to move to two-phase cooling but when and how to transition while minimizing risk to existing operations and investments.

Smart operators are already making this shift, while others risk learning an expensive lesson. In an era where a single rack costs more than many data centers’ annual operating budgets, gambling on outdated cooling technology isn’t just risky – it’s potentially catastrophic. The time to act is now—before that risk becomes a reality.

Josh Claman is the Executive Chairman and Founding CEO of Accelsius, a global leader in two-phase direct-to-chip liquid cooling technology for AI data centers. He co-founded Accelsius in 2022 to address one of the sector’s most pressing challenges: the enormous energy and water demands placed on communities by AI infrastructure. Under his leadership, the company developed NeuCool, a closed-loop, waterless cooling platform engineered for the thermal demands of next-generation AI chips—delivering up to 50% energy savings compared to traditional air cooling while eliminating facility dependence on local water supplies.

Claman brings more than 30 years of global technology leadership to the role. He previously served as Vice President of Public and Large Enterprise for the Americas at Dell Technologies, overseeing multi-billion dollar business units across North and South America, and as General Manager of Dell UK—part of a career that spanned five continents and included senior roles at both AT&T and NCR. He later served as President of ReachLocal, Chief Business Officer at 3D printing pioneer Stratasys, and Chief Executive Officer of digital health company Rimidi.

Claman earned a BA with High Distinction in History and Government from the University of Illinois Urbana-Champaign and an MBA from the Darla Moore School of Business at the University of South Carolina. He writes and speaks frequently on the intersection of technology, energy, and public policy, and has addressed data center sustainability at forums across North America and Europe. He believes the industry’s long-term license to operate depends as much on community trust as on technical performance.