Huawei Cloud: Fostering the Fertile Ground for Compute, Empowering AI Pioneers for Industries
SHANGHAI, Sept. 19, 2025 /CNW/ -- On the second day of HUAWEI CONNECT 2025, Zhang Ping'an, Huawei's Executive Director of the Board and CEO of Huawei Cloud, delivered a keynote speech titled "All Intelligence: Empowering AI Pioneers for Industries". He shared Huawei Cloud's innovation and practices in AI compute services, foundation models, embodied AI, AI agents, and much more.
Constant innovation in AI Compute Service: Unleashing powerful compute in the intelligent era
This year, Huawei Cloud announced its AI Compute Service powered by CloudMatrix384. The specifications of the Huawei CloudMatrix supernode will be upgraded from 384 cards to 8,192 cards. The supernodes can support a hyperscale cluster running on 500,000 to 1 million cards, thus providing robust AI compute, an invaluable resource in the intelligent era. Huawei Cloud also announced innovative memory storage with its Elastic Memory Service (EMS), which achieved an industry first by expanding video RAM with memory. This drastically reduces the latency of multi-round conversations on foundation models, greatly improving user experience.
Huawei Cloud has deployed fully liquid-cooled AI data centers in China's Guizhou, Inner Mongolia, and Anhui. These AI data centers support 80 kW heat dissipation per cabinet, reduce power usage effectiveness (PUE) to 1.1, and offer AI-enabled O&M. This means enterprises do not need to reconstruct traditional data centers or build new ones. Instead, they require only a pair of optical fibers in order to connect to the data center and access efficient AI compute, as well as full-stack dedicated AI cloud services, on Huawei Cloud.
Zhang Ping'an pointed out that Huawei Cloud's AI Token Service abstracts away the underlying technical complexity and directly provides users with the final AI computing results. This allows users to utilize the inference computing power in the most efficient way possible. The CloudMatrix384 supernode realizes the full pooling of compute, memory, and storage resources, decouples compute tasks, storage tasks, and AI expert systems, and converts serial tasks into distributed parallel tasks, greatly improving the inference performance of the system. In scenarios involving inference tasks with different latency requirements, such as online, nearline, and offline inference, CloudMatrix384 delivers an average inference performance per card that is 3 to 4 times that of H20.
At the conference, Zhang Ping'an announced the official launch of the AI Token Service powered by CloudMatrix384. The service delivers superior performance, service, and quality to customers.
Tackling challenges head-on: Helping enterprises build their own models
Huawei Cloud has been honing its Pangu Models by diving into industry-specific scenarios, and has worked with its customers to tackle their most pressing challenges head-on, reimagining what is possible in these industries. Huawei uses openPangu to provide best practices for AI training and inference, making it easier for developers to efficiently use AI computing power. Zhang Ping'an noted that, at the same time, Huawei is developing the closed-source Pangu Model. Huawei will continually increase investment in Pangu Models, constantly study industry scenarios to better understand customer requirements, and support customers in developing their own industry-specific models, thus accelerating intelligent transformation across industries.
Pangu Models have been applied in more than 500 scenarios across over 30 industries. They have played a significant role in fields like government services, finance, manufacturing, healthcare, coal mining, steel, railways, autonomous driving, and meteorology.
Moving beyond terminals: Enabling infinite intelligence evolution on the cloud
This year, Huawei Cloud launched the CloudRobo Embodied AI Platform, which deploys complex algorithms and intelligent logic on the cloud to realize more lightweight robots. By taking advantage of the massive computing power and advanced AI models on the cloud, the platform makes robot execution more intelligent. Cloud intelligence overcomes the limitations that have been holding robots back, making them applicable to more scenarios.
To build a unified, open, and secure communication channel between robots and the cloud, Huawei Cloud has launched the Robot to Cloud (R2C) Protocol. Zhang Ping'an announced that the first 20 partners of the R2C Protocol were officially onboard.
Kunpeng Cloud Services: Empowering industry innovation with software-hardware synergy and an open ecosystem
One of Huawei Cloud's key strategies is to develop Kunpeng-powered ARM cloud services that deliver performance, security, and reliability. In the past year, the number of Kunpeng compute cores on Huawei Cloud has increased from 9 million to 15 million, an increase of 67%. In addition, the Kunpeng platform has been continuously improved to support compatibility with mainstream software, and has also been adapted to more than 25,000 applications. The Kunpeng platform provides solid support for Kunpeng Cloud Services to be applied to even more general-computing scenarios, in addition to transcoding, databases, web applications, and cloud phones.
GaussDB: Building efficient, reliable data foundations based on supernodes and full pooling
Based on general-purpose computing supernodes, Huawei Cloud's GaussDB databases realize the layered pooling of compute, memory, and storage resources, and allow multi-read and multi-write on any node at the same time, breaking free from the restrictions of the traditional architecture where only the primary node supports data read/write. GaussDB databases also support dynamic load scheduling, greatly enhancing the performance of concurrent transaction processing. A GaussDB cluster deployed based on computing supernodes can process 5.4 million transactions per minute, marking a 2.9-fold performance increase over non-supernode clusters.
All-scenario distributed cloud: Ubiquitous and best possible compute with local access
Huawei Cloud has built a distributed cloud solution covering all scenarios, including CloudOcean, CloudSea, CloudLake, and CloudPond. This covers central regions, hotspot areas, and edge sites, bringing consistent Huawei Cloud experience to wherever customers' business is located.
Building an easy-to-use, effective, and open platform for developing and running agents
Huawei Cloud has launched Versatile, an enterprise-grade agent platform. It aims to serve as an easy-to-use, effective, and open platform for developing and running AI agents. With this platform, customers will be better equipped to quickly develop AI agents that suit their application scenarios.
Based on Versatile, users simply need to prepare and enter business description documents and flowcharts. After simple confirmation, the agent can be generated in two steps, greatly enhancing generation efficiency.
In addition to keynote speeches given at the event, Huawei Cloud brought a variety of agendas, such as summit forums and roundtables. Huawei Cloud also works with customers and partners to exhibit a wide array of innovative technologies and practices in fields such as cloud infrastructure, large models, databases, AI agents, and embodied AI, demonstrating how technology can facilitate the digital and intelligent transformation of industries.
SOURCE HUAWEI CLOUD

Liang Xu, [email protected]
Share this article