From Solo Comedy to Group Debate: Sun Yat-sen University & Meituan Open Source MultiTalk: A State-of-the-Art Model for Multi-Character Dialogue Generation with Voice-Visual Alignment Accuracy of 98.7%!

From Solo Comedy to Group Debate: Sun Yat-sen University & Meituan Open Source MultiTalk: A State-of-the-Art Model for Multi-Character Dialogue Generation with Voice-Visual Alignment Accuracy of 98.7%!

MultiTalk, open-sourced by Sun Yat-sen University, Meituan, and Hong Kong University of Science and Technology, enables the generation of multi-character dialogue videos. It achieves state-of-the-art performance in synchronizing voice with lip movements and supports interactions between characters, objects, and scenes through prompts. Related Links Homepage: https://meigen-ai.github.io/multi-talk/ Code: https://github.com/MeiGen-AI/MultiTalk Paper: https://arxiv.org/abs/2505.22647 Paper Introduction In recent years, … Read more