Decomposed atomic editing actions
- Remove the sound of explosion.
- Add the sound of video game playing at left by 3dB.
Declarative instruction: “Make it sound like a lively parade”
ALM inferenced atomic editing steps:
Original
SmartDJ (Ours)
Declarative instruction: “Make this sound like a windy farm.”
ALM inferenced atomic editing steps:
Original
SmartDJ (Ours)
Declarative instruction: “Make this sound like a stormy day.”
ALM inferenced atomic editing steps:
Original
SmartDJ (Ours)
Declarative instruction: “Simulate the sounds of a busy highway”
ALM inferenced atomic editing steps:
Original
SmartDJ (Ours)
Edit instruction: “change the sound of a man talking and plastic crinkling and crumpling at the right front to the front”
Original
Audit
SmartDJ (Ours)
Target (Ground truth)
Edit instruction: “Change the sound of woman speaking, food frying at the front to the right”
Original
Audit
SmartDJ (Ours)
Target (Ground truth)
Edit instruction: “Reverb the sound of laughter and speech at the right with high reverberations”
Original
Audit
SmartDJ (Ours)
Target (Ground truth)
Edit instruction: “Reverb the sound of baby cries and adult male speaks at the right front with mid reveberations”
Original
Audit
SmartDJ (Ours)
Target (Ground truth)
Edit instruction: “Shift the sound of a baby is crying at the front by -3 seconds”
Original
Audit
SmartDJ (Ours)
Target (Ground truth)
Edit instruction: “Shift the sound of engines hum and squealing tires at the right by 3 seconds”
Original
Audit
SmartDJ (Ours)
Target (Ground truth)
Edit instruction: “Change the timbre of the sound of loud humming and wind blowing at the left to be more muffled”
Original
Audit
SmartDJ (Ours)
Target (Ground truth)
Edit instruction: “Change the timbre of the sound of train horns blowing at the left front to be more bright”
Original
Audit
SmartDJ (Ours)
Target (Ground truth)
Declarative instruction: “Make this sound like in a game room.”
Decomposed atomic editing actions
Original
Edited
Declarative instruction: “Make this sound like in a farm.”
Decomposed atomic editing actions
Original
Edited
Declarative instruction: “Make this sound like a busy office.”
Decomposed atomic editing actions
Original
Before any editing
Step 1 – Remove drilling
“remove the sound of drilling”
Step 2 – Turn up typewriter
“turn up the sound of typewriter type by 2dB”
Step 3 – Add phone ringring
“Add the sound of phone ringing at right by 3dB" ”
Declarative instruction: “Make this sound like a workshop by the dock.”
Decomposed atomic editing actions
Original
Before any editing
Step 1 – Remove metal knock
“remove the sound of metal knock”
Step 2 – Add seagulls
“add seagulls squawking at left by 3 dB”
Step 3 – Add waves
“add the sound of waves lapping at right by 2 dB”
Declarative instruction: “Make this sound like a workshop by the dock”
ALM inferenced atomic editing steps:
Original
ZETA
AudioEditor
Audit
SmartDJ (Ours)
Declarative instruction: “Make this sound like a protest in a city”
ALM inferenced atomic editing steps:
Original
ZETA
AudioEditor
Audit
SmartDJ (Ours)
Declarative instruction: “Make this sound like a serene beach”
ALM inferenced atomic editing steps:
Original
ZETA
AudioEditor
Audit
SmartDJ (Ours)
Declarative instruction: “Make this sound like a busy city street”
ALM inferenced atomic editing steps:
Original
ZETA
AudioEditor
Audit
SmartDJ (Ours)
Declarative instruction: “Make this sound like a cozy living room”
ALM inferenced atomic editing steps:
Original
ZETA
AudioEditor
Audit
SmartDJ (Ours)
Declarative instruction: “Make this sound like in an outdoor concert”
ALM inferenced atomic editing steps:
Original
ZETA
AudioEditor
Audit
SmartDJ (Ours)
Declarative instruction: “Make this sound like a busy daycare center”
ALM inferenced atomic editing steps:
Original
ZETA
AudioEditor
Audit
SmartDJ (Ours)
Edit instruction: “Add the sound of music playing and people singning at the right with 0 db”
Original
ZETA
AudioEditor
Audit
SmartDJ (Ours)
Edit instruction: “Remove the sound of baby crying at the front”
Original
ZETA
AudioEditor
Audit
SmartDJ (Ours)
Target (Ground truth)