Efficient Transformer for TinyML: Long-Short Distance Attention
Click the card below to follow the “LiteAI” public account Hi, everyone, I am Lite. Recently, I shared the Efficient Large Model Full-Stack Technology from Part 1 to 19, including large model quantization, fine-tuning, efficient inference of LLMs, quantum computing, generative AI acceleration, etc. The content links are as follows: Efficient Large Model Full-Stack Technology … Read more