vLLM Framework Source Code Analysis: Block Allocation and Management

vLLM Framework Source Code Analysis: Block Allocation and Management

1. Block Overview A significant innovation of vLLM is the division of the physical layer GPU and CPU available memory into several blocks, which effectively reduces memory fragmentation issues. Specifically, vLLM’s blocks are divided into logical and physical levels, with a mapping relationship between the two. The following diagram explains the relationship between the two … Read more

DAC 2019 Low Power Object Detection System Design Challenge: GPU and FPGA Dual Championship Solutions Explained

DAC 2019 Low Power Object Detection System Design Challenge: GPU and FPGA Dual Championship Solutions Explained

Machine Heart Released Author: Zhang Xiaofan On June 5, 2019, the second “Low Power Object Detection System Design Challenge” hosted by the top electronic automation design conference DAC concluded in Las Vegas (Machine Heart reported on the first competition last year). This competition aimed to design high-precision and energy-efficient object detection systems for terminal devices, … Read more

Understanding CPU, MCU, MPU, SoC, DSP, ECU, GPU, and FPGA

Understanding CPU, MCU, MPU, SoC, DSP, ECU, GPU, and FPGA

In electronic engineering and embedded development, terms like CPU, MCU, MPU, SoC, DSP, ECU, GPU, and FPGA are often mentioned. However, due to similar names and overlapping uses, beginners often feel confused. This article will comprehensively analyze their characteristics from four perspectives: definition, performance, usage, and price. — 1. CPU (Central Processing Unit) Definition The … Read more

Cost-Effective Fine-Tuning with LoRA

Cost-Effective Fine-Tuning with LoRA

Selected from Sebastian Raschka’s blog Translated by Machine Heart Editor: Jiaqi This is the experience derived from hundreds of experiments by the author Sebastian Raschka, worth reading. Increasing the amount of data and the number of model parameters is a widely recognized direct method to improve neural network performance. Currently, mainstream large models have parameter … Read more

S-LoRA: Enabling Thousands of Large Models on a GPU

S-LoRA: Enabling Thousands of Large Models on a GPU

Machine Heart reports Editor: Danjiang Generally, the deployment of large language models adopts a “pre-training – then fine-tuning” approach. However, when fine-tuning the base model for numerous tasks (such as personalized assistants), the training and service costs can become extremely high. Low-Rank Adaptation (LoRA) is a parameter-efficient fine-tuning method, typically used to adapt the base … Read more

Unveiling the NVIDIA Jetson Xavier Development Kit

Unveiling the NVIDIA Jetson Xavier Development Kit

Previously, we reported on NVIDIA’s launch of the new generation embedded high-performance product, Jetson Xavier.NVIDIA has released a new “pocket-sized” development kit: This is how Jetson Xavier looks. Today, let’s take a closer look at the upcoming Jetson Xavier development kit! The dimensions should be 105mm * 105mm. Let’s take a look at the specifications: … Read more

Fun with PowerVR and BeagleBone Black Development

Fun with PowerVR and BeagleBone Black Development

The development board is cool, and the BeagleBone® Black (BBB) is one of the most interesting development boards. This widely used microcontroller board costs around £35 and boots Linux in just 10 seconds, allowing any interested developer to get started quickly. To this end, Imagination has been collaborating with the BeagleBoard.org® Foundation to enable users … Read more

Application of Edge GPU Computing in In-Vehicle Entertainment and Information Systems

Application of Edge GPU Computing in In-Vehicle Entertainment and Information Systems

With the rapid development of smart automotive technology, in-vehicle entertainment and information systems have become an indispensable part of modern vehicles. These systems not only cater to the entertainment needs of the car owner but also involve multiple functions such as navigation, driving assistance, and vehicle networking. As technology continues to advance, the demand for … Read more