MeiGen-MultiTalk: Enabling Multi-Person Interactive Video Generation from a Single Photo

MeiGen-MultiTalk: Enabling Multi-Person Interactive Video Generation from a Single Photo

Reposted from Big Company Talk Recently, Meituan launched the audio-driven multi-person dialogue video generation framework MultiTalk, which has been open-sourced on GitHub. It introduces the innovative L-RoPE binding technology, which accurately addresses the challenges of multi-audio streams and character misalignment through label rotation positional encoding. This framework innovatively employs local parameter training and multi-task learning … Read more