<html>
<head><title>Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph</title></head>
<div class="container">
	<main role="main">
		<article itemscope="" itemtype="https://schema.org/BlogPosting">
            <h1 class="entry-title" itemprop="headline">Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph</h1>
			<h2 id="zsvc-samples">Zero-shot voice conversion samples</h2>
			<h3 id="Librispeech-samples">Note:</h3>
			<p>1. To distinguish all the details of different conversion results, please wear headphones when listening to the following samples.</p>
			<p>2. 'target' means the voice conversion system extracts style information directly from the utterance in this column.</p>
			<p>3. 'authentic target' is shown for the convenience of comparison. It shows the real recording of the target speaker speaking the same sentence as source utterance. 
			However, the voice conversion system does not use this authentic target utterance when converting the voice. Instead, other utterances spoken by the same target speaker are fed into the voice conversion system for style extraction.</p>
			<h3 id="Librispeech-samples">Librispeech samples</h3>
			<table><thead><tr>
			<th style="text-align: center">Source</th>
			<th style="text-align: center">Target</th>
			<th style="text-align: center">Converted</th>
			</tr></thead>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/src/80.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/tar/80.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/cvt/80.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/src/180.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/tar/180.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/cvt/180.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/src/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/tar/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/cvt/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/src/200.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/tar/200.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/cvt/200.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/src/220.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/tar/220.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/librispeech/cvt/220.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			</table>
			
			<h3 id="System-comparison">System comparison (style extracted from 5 target utterances)</h3>
			<table><thead><tr>
			<th style="text-align: center">Source</th>
			<th style="text-align: center">Authentic Target</th>
			<th style="text-align: center">AutoVC</th>
			<th style="text-align: center">AdaIN-VC</th>
			<th style="text-align: center">FragmentVC</th>
			<th style="text-align: center">S2VC</th>
			<th style="text-align: center">Retriever(ours)</th>
			</tr></thead>

			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/src/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/tar/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/AutoVC/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/AdaIN-VC/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/FragmentVC/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/S2VC/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/Retriever/40.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>

			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/src/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/tar/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/AutoVC/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/AdaIN-VC/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/FragmentVC/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/S2VC/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/Retriever/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>

			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/src/590.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/tar/590.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/AutoVC/590.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/AdaIN-VC/590.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/FragmentVC/590.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/S2VC/590.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/Retriever/590.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/src/830.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/tar/830.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/AutoVC/830.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/AdaIN-VC/830.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/FragmentVC/830.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/S2VC/830.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/system_comparison/Retriever/830.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			</table>
			
			<h3 id="Conversion results of using different number of style token">Conversion results of using different number of style token</h3>
			<table><thead><tr>
			<th style="text-align: center">Source</th>
			<th style="text-align: center">Authentic Target</th>
			<th style="text-align: center">1 style token</th>
			<th style="text-align: center">5 style tokens</th>
			<th style="text-align: center">10 style tokens</th>
			<th style="text-align: center">60 style tokens</th>
			</tr></thead>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/src/120.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/tar/120.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/1/120.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/5/120.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/10/120.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/60/120.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/src/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/tar/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/1/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/5/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/10/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/60/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/src/440.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/tar/440.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/1/440.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/5/440.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/10/440.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/60/440.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/src/460.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/tar/460.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/1/460.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/5/460.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/10/460.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/N_style_token/60/460.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			</table>
			
			<h3 id="Ablation">Ablation</h3>
			<table><thead><tr>
			<th style="text-align: center">Source</th>
			<th style="text-align: center">Authentic Target</th>
			<th style="text-align: center">Too narrow bottleneck</th>
			<th style="text-align: center">Too wide bottleneck</th>
			<th style="text-align: center">AdaIN decoder</th>
			<th style="text-align: center">Retriever</th>
			</tr></thead>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/src/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/tar/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/too_narrow/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/too_wide/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/AdaIN_decoder/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/Retriever/340.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/src/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/tar/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/too_narrow/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/too_wide/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/AdaIN_decoder/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/ablation/Retriever/360.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			</table>
			
			<h3 id="Inference-scalability">Conversion result of using different number of target utterances during inference</h3>
			<table><thead><tr>
			<th style="text-align: center">Source</th>
			<th style="text-align: center">Authentic Target</th>
			<th style="text-align: center">1 target utterance</th>
			<th style="text-align: center">3 target utterance</th>
			<th style="text-align: center">5 target utterance</th>
			<th style="text-align: center">10 target utterance</th>
			</tr></thead>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/src/20.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/tar/20.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/1/20.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/3/20.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/5/20.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/10/20.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			<tbody><tr>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/src/180.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/tar/180.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/1/180.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/3/180.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/5/180.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			<td style="text-align: center"><audio controls="controls"><source src="./samples/inference_scalability/10/180.wav" autoplay="">Your browser does not support the audio element.</audio></td>
			</tr></tbody>
			</table>

			</section>
		</article>
	</main>
</div>
</body></html>