Scaling Laws Now Apply to Post-Training, Inference

Advertisements

The advent of DeepSeek has dramatically changed the landscape of artificial intelligence and computational power in the corporate worldNo longer do businesses face prohibitive costs associated with accessing top-tier large models, which had once been a significant barrier to entryThis seismic shift in accessibility is expected to catalyze growth and contribute to the wider distribution of computational power throughout various industries.

According to a report jointly released by the International Data Corporation (IDC) and Inspur Information, titled “Assessment Report on the Development of Artificial Intelligence Computing Power in China 2025,” the country's intelligent computing power is projected to experience rapid growth over the next couple of yearsThe report forecasts that by 2025, smart computing power in China will reach a staggering 1,037.3 exaFLOPS, an impressive 43% increase from the previous yearLooking ahead to 2026, predictions suggest that this number will double, hitting approximately 1,460.3 exaFLOPSThe report anticipates that market size for AI computing power in China will grow to around $25.9 billion in 2025, reflecting a 36.2% increase from 2024, while in 2026, the market is expected to expand to about $33.7 billion, or 1.77 times that of 2024.

One of the most significant impacts of DeepSeek has been its role in accelerating the growth of the inference marketEarlier this year, the launch of the DeepSeek-R1 model created a ripple effect that has prompted industry stakeholders to reconsider new frameworks for AI developmentFollowing its release, many technology stocks in the U.S. experienced a temporary downturn, demonstrating the model's immediate impact on the marketFor instance, on January 28, shares of Nvidia plunged over 10%, with a market value evaporating by more than $350 billionOther major players, including Taiwan Semiconductor Manufacturing Company (TSMC) and Avago Technologies, also saw significant price drops.

This initial market turbulence, however, proved transitory

Advertisements

The decrease in costs associated with large model computational power has simultaneously lowered the threshold for businesses eager to adopt such advanced modelsThe paradox identified by economist Jevons posits that improvements in algorithmic efficiency often result in increased computational demands rather than a reductionThe influx of new users and applications has accelerated the proliferation and practical application of large models, transforming the innovation paradigm within the industryAs a result, there has been a surge in the construction of data centers, edge computing, and endpoint computational power to accommodate this rising demand.

With the open-source nature of DeepSeek, lowering the barriers of entry is becoming a prevalent trendAs pointed out by Zhou Zhenggang, the Vice President of IDC China, the introduction of the open-source framework significantly invites more users into the arena of large models, thereby fostering the growth of the computational ecosystemHe further notes that the "Scaling Low" principle remains dominant in the current AI landscape, with organizations' demand for intelligent computing continuing to rise at an impressive rate.

Building upon this foundation, the scalability paradigm is now extending from pre-training phases to post-training and inference stagesZhou emphasizes the necessity for greater computational investment in post-training and inference, leveraging innovations such as reinforcement learning and advanced cognitive processing capabilitiesThis advancement presents new opportunities for increasing the depth of machine learning models, which ultimately enhances their cognitive capabilities.

Moreover, the open-source movement initiated by DeepSeek is stimulating a broader ecosystem of innovationThe increase in application development on model platforms is becoming evident, with even low-code tools beginning to integrate into these development environments, paving the way for a more inclusive and expansive approach to model creation

Advertisements

Zhou’s perspective aligns with the notion that this signifies a new era for model development platforms.

The surging demand for AI servers further emphasizes the dynamic impact of DeepSeekAccording to IDC data, the global AI server market is projected to be valued at $125.1 billion in 2024, with anticipated growth to $158.7 billion by 2025 and potentially reaching $222.7 billion by 2028. Of particular note, the proportion of generative AI servers is expected to rise significantly from 29.6% in 2025 to 37.7% by 2028.

In the context of China's computational market, IDC forecasts that the size of intelligent computing will reach 1,037.3 exaFLOPS by 2025 and surge to 2,781.9 exaFLOPS by 2028, while general computing power is expected to expand from 85.8 exaFLOPS to 140.1 exaFLOPS during the same periodZhou emphasizes the trend in increased requirements for intelligent computing, projecting a compound annual growth rate of 46.2% between 2023 and 2028, while general computing is predicted to comparatively grow at 18.8%.

The effects of DeepSeek on the AI server market are already becoming apparentObservations in the server market indicate a spike in inquiries and orders for AI servers shortly after the Lunar New YearLiu Jun, Senior Vice President of Inspur Information, noted that many clients are now seeking servers capable of running the DeepSeek-R1 671B modelLiu reported a sharp increase in inquiries over the past two weeks, as companies evaluate their options between cloud-based solutions and local deployments, both of which require robust AI servers to underpin model inference operationsThis demand surge is injecting fresh energy into the AI server market.

In Liu’s view, the growth in AI servers will not be a mere flash in the pan but rather indicative of a sustained upward trendHe anticipates that users will first undergo a proof of concept (POC) stage, trialing to identify scalable business applications before moving toward widespread deployment

Advertisements

Advertisements

Advertisements

Post Comment