<!DOCTYPE html>
<html lang="en"><head>
  <meta charset="utf-8">
  <meta http-equiv="X-UA-Compatible" content="IE=edge">
  <meta name="viewport" content="width=device-width, initial-scale=1"><!-- Begin Jekyll SEO tag v2.8.0 -->
<meta name="generator" content="Jekyll v4.3.2" />
<meta property="og:locale" content="en_US" />
<link rel="canonical" href="http://localhost:4000/" />
<meta property="og:url" content="http://localhost:4000/" />
<meta property="og:type" content="website" />
<meta name="twitter:card" content="summary" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"WebSite","url":"http://localhost:4000/"}</script>
<!-- End Jekyll SEO tag -->
<link rel="stylesheet" href="assets/main.css"><link type="application/atom+xml" rel="alternate" href="http://localhost:4000/feed.xml" /></head>
<body><header class="site-header" role="banner">

  <div class="wrapper"><a class="site-title" rel="author" href="index.html"></a><nav class="site-nav">
        <input type="checkbox" id="nav-trigger" class="nav-trigger" />
        <label for="nav-trigger">
          <span class="menu-icon">
            <svg viewBox="0 0 18 15" width="18px" height="15px">
              <path d="M18,1.484c0,0.82-0.665,1.484-1.484,1.484H1.484C0.665,2.969,0,2.304,0,1.484l0,0C0,0.665,0.665,0,1.484,0 h15.032C17.335,0,18,0.665,18,1.484L18,1.484z M18,7.516C18,8.335,17.335,9,16.516,9H1.484C0.665,9,0,8.335,0,7.516l0,0 c0-0.82,0.665-1.484,1.484-1.484h15.032C17.335,6.031,18,6.696,18,7.516L18,7.516z M18,13.516C18,14.335,17.335,15,16.516,15H1.484 C0.665,15,0,14.335,0,13.516l0,0c0-0.82,0.665-1.483,1.484-1.483h15.032C17.335,12.031,18,12.695,18,13.516L18,13.516z"/>
            </svg>
          </span>
        </label>

        <div class="trigger"></div>
      </nav></div>
</header>
<main class="page-content" aria-label="Content">
      <div class="wrapper">
        <h1 id="multi-source-diffusion-models">Multi Source Diffusion Models</h1>

<ul>
  <li><strong>Paper:</strong> <strong><em>Multi-Source Diffusion Models for Simultaneous Audio Generation and Separation</em></strong>
 <br /><br /></li>
  <li><strong>Abstract:</strong> In this work, we define a diffusion-based generative model capable of both music synthesis and source separation by learning the score of the joint probability density of sources sharing a context. Alongside the classic total inference tasks (i.e. generating a mixture, separating the sources), we also introduce and experiment on the partial inference task of source imputation, where we generate a subset of the sources given the others (e.g., play a piano track that goes well with the drums). Additionally, we introduce a novel inference method for the separation task. We train our model on Slakh2100, a standard dataset for musical source separation, provide qualitative results in the generation settings, and showcase competitive quantitative results in the separation setting. Our method is the first example of a single model that can handle both generation and separation tasks, thus representing a step toward general audio models.
Below are some examples of the tasks we can perform with our approach.</li>
</ul>

<h2 id="generation">Generation</h2>
<p>Here we ask the neural model to randomly generate some new music:</p>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Sample #1</th>
      <th style="text-align: center">Sample #2</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls=""><source src="media/generation/1/mix.mp3" /></audio></td>
      <td style="text-align: center"><audio controls=""><source src="media/generation/2/mix.mp3" /></audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Sample #3</th>
      <th style="text-align: center">Sample #4</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls=""><source src="media/generation/3/mix.mp3" /></audio></td>
      <td style="text-align: center"><audio controls=""><source src="media/generation/4/mix.mp3" /></audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Sample #5</th>
      <th style="text-align: center">Sample #6</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls=""><source src="media/generation/5/mix.mp3" /></audio></td>
      <td style="text-align: center"><audio controls=""><source src="media/generation/6/mix.mp3" /></audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Sample #7</th>
      <th style="text-align: center">Sample #8</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls=""><source src="media/generation/7/mix.mp3" /></audio></td>
      <td style="text-align: center"><audio controls=""><source src="media/generation/8/mix.mp3" /></audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Sample #9</th>
      <th style="text-align: center">Sample #10</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls=""><source src="media/generation/9/mix.mp3" /></audio></td>
      <td style="text-align: center"><audio controls=""><source src="media/generation/10/mix.mp3" /></audio></td>
    </tr>
  </tbody>
</table>

<p><br /><br /></p>

<h2 id="source-imputation-aka-partial-generation">Source Imputation (a.k.a. partial generation)</h2>
<p>Given a source stem as input (e.g. drums), the neural model generates accompanying instruments:</p>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Original Source #1</th>
      <th style="text-align: center">Generated accompaniment #1</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/1/fixed.mp3" type="audio/mpeg" /> </audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/1/mix.mp3" type="audio/mpeg" /> </audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Original Source #2</th>
      <th style="text-align: center">Generated accompaniment #2</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/2/fixed.mp3" type="audio/mpeg" /> </audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/2/mix.mp3" type="audio/mpeg" /> </audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Original Source #3</th>
      <th style="text-align: center">Generated accompaniment #3</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/3/fixed.mp3" type="audio/mpeg" /> </audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/3/mix.mp3" type="audio/mpeg" /> </audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Original Source #4</th>
      <th style="text-align: center">Generated accompaniment #4</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/4/fixed.mp3" type="audio/mpeg" /> </audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/4/mix.mp3" type="audio/mpeg" /> </audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Original Source #5</th>
      <th style="text-align: center">Generated accompaniment #5</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/5/fixed.mp3" type="audio/mpeg" /> </audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/5/mix.mp3" type="audio/mpeg" /> </audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Original Source #6</th>
      <th style="text-align: center">Generated accompaniment #6</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/6/fixed.mp3" type="audio/mpeg" /> </audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/6/mix.mp3" type="audio/mpeg" /> </audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Original Source #7</th>
      <th style="text-align: center">Generated accompaniment #7</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/7/fixed.mp3" type="audio/mpeg" /> </audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/7/mix.mp3" type="audio/mpeg" /> </audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Original Source #8</th>
      <th style="text-align: center">Generated accompaniment #8</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/8/fixed.mp3" type="audio/mpeg" /> </audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/inpainting/8/mix.mp3" type="audio/mpeg" /> </audio></td>
    </tr>
  </tbody>
</table>

<p><br /><br /></p>

<h2 id="source-separation">Source Separation</h2>
<p>Finally, it is possible to use our model to extract single sources from an input mixture:</p>

<p><strong>Input Mixture 1</strong></p>
<audio controls="" preload="none"><source src="media/separation/1/mix.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Bass</th>
      <th style="text-align: center">Separated Drums</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/1/sep1.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/1/sep2.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Guitar</th>
      <th style="text-align: center">Separated Piano</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/1/sep3.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/1/sep4.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<p><br /><br /></p>

<p><strong>Input Mixture 2</strong></p>
<audio controls="" preload="none"><source src="media/separation/2/mix.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Bass</th>
      <th style="text-align: center">Separated Drums</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/2/sep1.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/2/sep2.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Guitar</th>
      <th style="text-align: center">Separated Piano</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/2/sep3.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/2/sep4.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<p><br /><br /></p>

<p><strong>Input Mixture 3</strong></p>
<audio controls="" preload="none"><source src="media/separation/3/mix.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Bass</th>
      <th style="text-align: center">Separated Drums</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/3/sep1.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/3/sep2.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Guitar</th>
      <th style="text-align: center">Separated Piano</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/3/sep3.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/3/sep4.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<p><br /><br /></p>

<p><strong>Input Mixture 4</strong></p>
<audio controls="" preload="none"><source src="media/separation/4/mix.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Bass</th>
      <th style="text-align: center">Separated Drums</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/4/sep1.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/4/sep2.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Guitar</th>
      <th style="text-align: center">Separated Piano</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/4/sep3.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/4/sep4.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<p><br /><br /></p>

<p><strong>Input Mixture 5</strong></p>
<audio controls="" preload="none"><source src="media/separation/5/mix.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Bass</th>
      <th style="text-align: center">Separated Drums</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/5/sep1.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/5/sep2.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Guitar</th>
      <th style="text-align: center">Separated Piano</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/5/sep3.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/5/sep4.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<p><br /><br /></p>

<p><strong>Input Mixture 6</strong></p>
<audio controls="" preload="none"><source src="media/separation/6/mix.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Bass</th>
      <th style="text-align: center">Separated Drums</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/6/sep1.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/6/sep2.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Guitar</th>
      <th style="text-align: center">Separated Piano</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/6/sep3.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/6/sep4.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<p><br /><br /></p>

<p><strong>Input Mixture 7</strong></p>
<audio controls="" preload="none"><source src="media/separation/7/mix.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Bass</th>
      <th style="text-align: center">Separated Drums</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/7/sep1.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/7/sep2.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Guitar</th>
      <th style="text-align: center">Separated Piano</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/7/sep3.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/7/sep4.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<p><br /><br /></p>

<p><strong>Input Mixture 8</strong></p>
<audio controls="" preload="none"><source src="media/separation/8/mix.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Bass</th>
      <th style="text-align: center">Separated Drums</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/8/sep1.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/8/sep2.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

<table>
  <thead>
    <tr>
      <th style="text-align: center">Separated Guitar</th>
      <th style="text-align: center">Separated Piano</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/8/sep3.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
      <td style="text-align: center"><audio controls="" preload="none"><source src="media/separation/8/sep4.wav.mp3" type="audio/mpeg" /> Your browser does not support the audio element.</audio></td>
    </tr>
  </tbody>
</table>

      </div>
    </main><footer class="site-footer h-card">
  <data class="u-url" href="/"></data>

  <div class="wrapper">

    <h2 class="footer-heading"></h2>

    <div class="footer-col-wrapper">
      <div class="footer-col footer-col-1">
        <ul class="contact-list">
          <li class="p-name"></li></ul>
      </div>

      <div class="footer-col footer-col-2"><ul class="social-media-list"></ul>
</div>

      <div class="footer-col footer-col-3">
        <p></p>
      </div>
    </div>

  </div>

</footer>
</body>

</html>
