How Huawei Tamed a Trillion-Parameter Sparse Model? Key Technical Breakthroughs in MOE Training on Ascend NPU

How Huawei Tamed a Trillion-Parameter Sparse Model? Key Technical Breakthroughs in MOE Training on Ascend NPU

How Huawei Tamed a Trillion-Parameter Sparse Model? Key Technical Breakthroughs in MOE Training on Ascend NPU In the competition of large models, sparse large models represented by Mixture of Experts (MoE) are gradually becoming the new favorites in the AI field due to their outstanding efficiency. Recently, Huawei released a technical report titled “Pangu Ultra … Read more

Huawei Ascend NPU Achieves Near-Trillion Parameter Large Model, Showcasing Domestic Computing Power Strength

Huawei Ascend NPU Achieves Near-Trillion Parameter Large Model, Showcasing Domestic Computing Power Strength

Huawei has made significant breakthroughs in the training of AI large models, with its Ascend NPU successfully running a near-trillion parameter large model, marking a leap for domestic computing platforms into the world-leading ranks in AI large model training. Previously, training trillion-parameter large models faced numerous challenges, such as difficulties in load balancing, high communication … Read more

New Ascend NPU and MindIE Support Capabilities in SmartX AI Infrastructure: Solutions and Evaluations

New Ascend NPU and MindIE Support Capabilities in SmartX AI Infrastructure: Solutions and Evaluations

Recently, SmartX launched the DeepSeek solution based on AI infrastructure, providing a unified infrastructure resource stack for both AI large models and other enterprise business systems, helping enterprises quickly advance the implementation and validation processes of large models. At the same time, considering the needs of users in industries such as finance and healthcare for … Read more

Performance Evaluation of PyTorch and MindSpore Frameworks for Ascend NPU: Adapting Open Source Software Stack vs. Internal Software Stack

Performance Evaluation of PyTorch and MindSpore Frameworks for Ascend NPU: Adapting Open Source Software Stack vs. Internal Software Stack

This article evaluates the performance of MindSpore and PyTorch based on Huawei’s Ascend Neural Processing Unit (Ascend NPU). Huawei’s Ascend NPU isa chip designed specifically for accelerating neural network computations, which not onlysupports the self-developed framework MindSpore but alsohas been adapted for PyTorch. However, there is currentlya lack of comparative evaluation studies on MindSpore and … Read more

Deploying DeepSeek Large Model on Kunpeng Server with openEuler and Ascend NPU

Deploying DeepSeek Large Model on Kunpeng Server with openEuler and Ascend NPU

1. Deployment MethodUtilizingopenEuler 22.03 LTS SP4 + twoAscend NPUs, using Docker to launch themindIEimage, the model used is deepseek-32bDeployment process is as follows:1. Install the server operating system2. Install NPU drivers3. Install Docker and download the MindIE image4. Download the large model files5. Launch the MindIE image and load the large model6. Test the output … Read more