Individual audio track comparisons with baseline methods

Example 2 Back to index

Track 1: "Curtains rustle gently" (All videos are identical)

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Track 2: "Keyboard clicks intermittently" Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Track 3: "Cat meows softly" Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Track 4: "TV hums quietly in the distance" Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Track 5: "Footsteps echo faintly" Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Composite audio Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)