Exploring the Potential: Unlocking New Possibilities with Mining Overclocking
2025年03月26日
Liquid cooling technology: the core engine for efficient heat dissipation in AI training rooms
With the explosive development of large AI models (such as ChatGPT and DeepSeek), the demand for computing power in AI training rooms has increased exponentially, and behind the high-density computing power are the dual challenges of power consumption and heat dissipation. Traditional air cooling technology can no longer meet the heat dissipation requirements of a single cabinet with a power exceeding 100kW. Liquid cooling technology has become an indispensable solution for AI training rooms with its high efficiency, energy saving, stability and controllability. The following analyzes the importance of liquid cooling technology to AI training rooms from the aspects of technical advantages, practical applications, economy and future trends.
1. Heat dissipation challenges in AI training rooms and the core advantages of liquid cooling technology
①. Heat dissipation limit under high computing power
The training of large AI models requires the support of massive GPU clusters. For example, the single-chip power consumption of NVIDIA H100 GPU is 700W, and the power density of a single cabinet can reach 120kW19. Traditional air cooling technology can only cover cabinet power below 20kW, while liquid cooling has a heat dissipation efficiency that is more than 3,000 times that of air cooling, and can support a single cabinet power of up to 200kW.
②. Energy saving and consumption reduction and PUE optimization
The high energy consumption problem of AI computer rooms is becoming increasingly prominent, and the annual power consumption of global data centers has accounted for 1.5% of the total social power consumption5. Liquid cooling technology can reduce PUE (energy usage efficiency) from 1.5-1.8 of air cooling to 1.05-1.2 through efficient heat exchange, significantly reducing the energy consumption of the cooling system. For example, Inspur's cold plate liquid cooling solution has achieved PUE≤1.15, and the immersion solution can even be close to 1.05.
③ Improved stability and reliability
High-density computing equipment is prone to performance degradation or failure due to local overheating. Liquid cooling technology can reduce server failure rate by more than 30% through uniform heat dissipation, while reducing computer room noise (fan speed is reduced by 60%).
2. Practical application of liquid cooling technology in AI training rooms
①. Mainstream technical solutions: cold plate and immersion in parallel
Cold plate liquid cooling: indirect heat exchange through metal cold plates, high compatibility and low transformation cost, suitable for 30-80kW cabinets, with a market share of 90%17. For example, Inspur's cold plate solution increased the cabinet power density by 8 times and reduced PUE by 23% in a certain operator project in Beijing.
Immersion liquid cooling: The equipment is completely immersed in the coolant, with the highest heat dissipation efficiency (supporting 80-200kW cabinets), but the initial cost is relatively high. Sugon's phase change immersion liquid cooling technology has been applied to more than ten large data centers, with the longest stable operation of nearly 10 years.
②. Benchmark cases and innovative products
Huahong Digital's all-liquid-cooled intelligent computing all-in-one machine: through the dual drive of "liquid cooling + scenario-based computing power", high-density parallel computing is achieved, and a single cabinet supports the extreme performance of 7 graphics cards to meet the needs of large model training.
Sugon Digital's "air-liquid hybrid cooling" architecture: Combining the advantages of liquid cooling and air cooling, it can flexibly adapt to different equipment in the intelligent computing center, and the measured PUE is as low as 1.128.
3. The dual support of economic efficiency and policy-driven
①. TCO (total cost of ownership) advantage is evident
Although liquid cooling has a high initial investment, its long-term operating cost is significantly lower than air cooling. Taking the East China region as an example, the TCO of cold plate liquid cooling is lower than air cooling after 5 years, and the cost of immersion liquid cooling can be reduced by 5-10% per year1. In addition, liquid cooling technology can extend the life of servers and further optimize the cost structure.
②. Strong promotion of policies and standards
China's "East Data West Computing" project requires that the PUE of new data centers be ≤1.25, and liquid cooling has become the core means to meet the standard. The three major operators plan to achieve more than 50% of the project application of liquid cooling by 2025, and Shanghai and other regions require that the proportion of liquid cooling cabinets exceed 50%.
4. Future trends and challenges
①. Technology integration and scenario expansion
Air-liquid hybrid mode: air cooling is retained for low-power devices such as storage and network, and liquid cooling is used for high-computing power devices, taking into account both cost and performance.
Edge computing and miniaturization: Huahong Digital and other companies explore the deployment of liquid cooling in edge intelligent computing centers to achieve low-latency, high-efficiency real-time computing support.
②. Coolant and safety optimization
Traditional fluorinated liquids are gradually replaced by synthetic oils due to environmental issues. Inspur and other manufacturers develop negative pressure systems to eliminate the risk of leakage. In the future, coolants will evolve towards a more environmentally friendly and high thermal conductivity direction.
③. Ecological co-construction and standardization
Institutions such as the Global Computing Consortium (GCC) are promoting the unification of liquid cooling technology standards. For example, the "Double Zero Action" aims to achieve zero problems and zero waiting in the deployment of liquid cooling rooms.
Conclusion
Liquid cooling technology is not only the key to breaking through the heat dissipation bottleneck of AI training rooms, but also the core path to achieve green computing power. As technology matures, costs fall, and policies are promoted, liquid cooling will move from large-scale applications to full popularization and become the "standard configuration" of computing infrastructure in the AI era. In the future, through technological innovation and ecological collaboration, liquid cooling technology will further unleash the potential of AI and promote the high-quality development of the digital economy.
Recommended news
In the cryptocurrency industry, computing power is power. From Bitcoin mining to AI-driven smart contracts, computing power demand is growing exponentially.
As an important innovation in the field of data center heat dissipation, liquid cooling technology is moving from edge technology to mainstream application with the explosive growth of computing power demand and strict policy requirements on energy efficiency.