inference Articles

Three Essential Modules for Logic Beginners

2025-06-20 by boardor

————- In the zero-based introduction to logic, there are three modules that must be mastered. These three modules are: ▶Inference Connectives ▶Three Major Rules of Inference for Conditional Propositions ▶Equivalence Transformation of Disjunctive and Conditional Propositions These three modules are extremely important, akin to the “Nine-Character Multiplication Table” in mathematics, serving as a crucial foundation … Read more

Deploying Multiple LoRA Adapters on a Base Model with vLLM

2025-06-12 by boardor

Source: DeepHub IMBA This article is approximately 2400 words long and is recommended for a 5-minute read. In this article, we will see how to use vLLM with multiple LoRA adapters. We all know that using LoRA adapters can customize large language models (LLMs). The adapters must be loaded on top of the LLM, and … Read more

Comprehensive Guide to Embedded AI Framework Tengine

2025-04-24 by boardor

Produced by | Zhixitong Open Course Lecturer | Wang Haitao Co-founder of OPEN AI LAB and Chief Architect of Tengine Reminder | Click the blue text above to follow us, and reply with the keyword “AI Framework” to obtain the courseware. Introduction: On the evening of April 8, Zhixitong Open Course launched a special session … Read more

Running Large Models on Mobile Devices Made Easy

2025-03-31 by boardor

Reporting by Machine Heart, Machine Heart Editorial Team For some inference tasks of large models, the bottleneck is not computational power (FLOPS). Recently, many people in the open-source community have been exploring optimization methods for large models. A project called llama.cpp has rewritten the inference code of LLaMa in pure C++, achieving excellent results and … Read more

Deep Learning Model Inference on Raspberry Pi Zero W Using Python

2025-03-20 by boardor

In the process of developing machine learning, once the model has been trained, the next step is to perform model inference. Depending on the deployment environment, it can be divided into three types of scenarios: Edge Computing: Generally refers to mobile phones and embedded devices, performing inference directly on the device where the data is … Read more

Complete Guide to Embedded AI Framework Tengine: Architecture, Operator Customization, and Engine Inference

2025-02-23 by boardor

Produced by | Smart Things Open Class Instructor | Wang Haitao Co-founder of OPEN AI LAB and Chief Architect of Tengine Reminder | Click the blue text above to follow us, and reply with the keyword “AI Framework” to obtain the course materials. Introduction: On the evening of April 8, Smart Things Open Class launched … Read more

Deploying and Evaluating the RK3588 YOLOv5s Model

2025-02-10 by boardor

MEGAWAY TECHNOLOGY RK3588 YOLOv5s Model Deployment and Evaluation 01/ Model Overview Model Name: YOLOv5s Model Type: Classification Model Official Repository: GitHub – ultralytics/yolov5: YOLOv5 in PyTorch> ONNX > CoreML > TFLite V7.0 Model Parameters (PARAMS): 7225885 Model Computation (FLOPS): 16.4 GFLOPs Deployment Device: RK3588 Deployment Environment: Ubuntu20.04/rknn_toolkit2 V1.6.0/OpenCV4.5.1 02/ Model Analysis YOLOV5 model outputs three … Read more

How to Convert Open Source Framework Models to Ascend Models Based on Orange Pi AIpro

2025-01-27 by boardor

In the previous introduction, we discussed how to develop AI inference applications based on Orange Pi AIpro, and learned that before inference, the original network model (which could be PyTorch/TensorFlow/Caffe, etc.) needs to be converted into an .om model. Only then can we call the Ascend aclmdlExecute and other model execution interfaces for model inference … Read more