Introduction to Arm NEON and CPU Optimization Techniques

Introduction to Arm NEON and CPU Optimization Techniques

This article is selected from the Extreme Technology column “Embedded AI” and is authorized to be reproduced from the Zhihu author High Performance Computing Institute’s “Mobile Algorithm Optimization“. Previously, we learned how to quickly get started with NEON programming, Arm NEON optimization techniques, and Arm NEON Learning (3) NEON Assembly and Intrinsics Programming. This article … Read more

Introduction to CPU Optimization Techniques – NEON

Introduction to CPU Optimization Techniques - NEON

This article is reprinted from the Jishu Community Jishu Column: Embedded AI Author: High Performance Computing Research Institute 1. SIMD ARM NEON is a SIMD (Single Instruction Multiple Data) extension architecture suitable for ARM Cortex-A and Cortex-R series processors. SIMD uses a controller to manage multiple processors, executing the same operation on each data item … Read more

Faster and More Accurate Data Computation with Next-Gen DSP

Faster and More Accurate Data Computation with Next-Gen DSP

Traditionally, the architecture of Digital Signal Processors (DSP) has been limited in specific signal processing computation applications. The combination of Very Long Instruction Word (VLIW), Single Instruction Stream and Multiple Data Stream (SIMD) architectures provides the throughput necessary for high computational performance, with data sizes typically being 16, 24, and 32 fixed-point. This algorithm is … Read more

Faster And More Accurate Data Computation With Next-Gen DSP

Faster And More Accurate Data Computation With Next-Gen DSP

The architecture of traditional Digital Signal Processors (DSP) has become inadequate for specific signal processing computational applications. However, the combination of Very Long Instruction Word (VLIW) and Single Instruction/Multi Data (SIMD) architectures can provide the parallel throughput required for high computational performance, with data typically being 16, 24, and 32 bits wide. This is particularly … Read more

Understanding Arm NEON: CPU Optimization Techniques and Instruction Introduction

Understanding Arm NEON: CPU Optimization Techniques and Instruction Introduction

Click the card below to follow Arm Technology Academy This article is selected from the Extreme Technology column “Embedded AI” and is authorized to be reprinted from Zhihu author High-Performance Computing Institute’s “Mobile Algorithm Optimization“. Previously, we learned how to quickly get started with NEON programming, Arm NEON optimization technology, and Arm NEON Learning (III) … Read more

Getting Started with Arm NEON Programming: A Quick Guide

Getting Started with Arm NEON Programming: A Quick Guide

Follow the card below to subscribe to Arm Technology Academy This article is selected from the Jishu column “Infrastructure Open Source Software on Arm” in the Arm NEON learning series. NEON refers to a high-level SIMD (Single Instruction Multiple Data) extension instruction set suitable for Arm Cortex-A series processors, which can accelerate multimedia and signal … Read more