Individual audio track comparisons with baseline methods

Example 1 Back to index

Track 1: "Wind whispers through the fence" (All videos are identical)

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Track 2: "Hammer strikes wood repeatedly" Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Track 3: "Pigeones coo softly" Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Track 4: "Birds chirp faintly in the distance" Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Track 5: "Metal clinks softly against metal" Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)

Composite audio Back to index

MMAudio MMAudio + Negative prompting MMAudio + NAG (ours)