Modern C++: Achieving High-Performance Computing with SIMD Instruction Sets

Modern C++: Achieving High-Performance Computing with SIMD Instruction Sets

0. Introduction There are two common ways to improve efficiency: one is to increase concurrency, and the other is to increase data throughput.In previous articles, we discussed multithreading concurrency, which is a common method for increasing concurrency; this article will introduce SIMD, which is related to increasing data throughput, allowing for the processing of more … Read more

Introduction to Arm NEON and CPU Optimization Techniques

Introduction to Arm NEON and CPU Optimization Techniques

This article is selected from the Extreme Technology column “Embedded AI” and is authorized to be reproduced from the Zhihu author High Performance Computing Institute’s “Mobile Algorithm Optimization“. Previously, we learned how to quickly get started with NEON programming, Arm NEON optimization techniques, and Arm NEON Learning (3) NEON Assembly and Intrinsics Programming. This article … Read more