model inference Articles

Wangsu Technology Upgrades Edge AI Platform: Empowering AIGC Business Innovation Across the Entire Chain

2025-11-14 by boardor

Global generative AI applications are penetrating vertical fields such as industry, healthcare, and finance at an unprecedented speed. Gartner predicts that by 2026, 80% of enterprises will use generative AI, with edge AI deployment exceeding 50%. As the demand for AI applications continues to grow, edge AI will unleash greater productivity for enterprises. Recently, Wangsu … Read more

Deploy AI Models in Just Three Lines of Code!

2025-05-20 by boardor

The development of artificial intelligence applications is accelerating, and the deployment work that developers face is becoming increasingly complex.The endless array of algorithm models, various architectures of AI hardware, different deployment requirements (servers, services, embedded, mobile, etc.), and different operating systems and programming languages pose significant challenges for AI developers in project implementation. To solve … Read more

Streaming Output for Model Inference in Transformers

2025-05-08 by boardor

This article will introduce how to implement streaming output for model inference in the transformers module. The transformers module provides a built-in Streaming method for streaming output during model inference. Additionally, we can use model deployment frameworks such as vLLM and TGI to better support streaming output for model inference. Below, we will detail how … Read more