MultiTalk: A State-of-the-Art Model for Multi-Character Dialogue Generation with 98.7% Speech-Visual Alignment Accuracy!

MultiTalk: A State-of-the-Art Model for Multi-Character Dialogue Generation with 98.7% Speech-Visual Alignment Accuracy!

Developed by Sun Yat-sen University, Meituan, and Hong Kong University of Science and Technology, MultiTalk enables the generation of multi-character dialogue videos. It achieves state-of-the-art performance in synchronizing speech with lip movements and supports interactions between characters, objects, and scenes through prompts. Related Links Homepage: https://meigen-ai.github.io/multi-talk/ Code: https://github.com/MeiGen-AI/MultiTalk Paper: https://arxiv.org/abs/2505.22647 Paper Introduction In recent years, … Read more