<!DOCTYPE html>
<html>
  <head>
    <title>Style equalization, Speech synthesis examples</title>
    <link href="style.css" rel="stylesheet">
    <script src="script.js" type="text/javascript"></script>
  </head>
  <body>
    <main>
      <div>
        <h1>Speech synthesis results</h1>
        <p>We showcase synthesized speech results of the baselines and the proposed method. Our goal is to mimic both the voice characteristics of a speaker and the recoding condition (like noise and microphone response). To remove the effect of the pretrained vocoder when comparing synthesized speech samples with real speech samples, all real speech samples are converted to mel-spectrogram and reconstructed back to waveform using the same vocoder that is used by the generative models. </p>
        <p><b>Our setting:</b><br></p>
        <ul>
          <li>- We only need (audio, texts) pairs during training.</li>
          <li>- We do <em>not</em> use any style labels like speaker IDs or any attribute labels.</li>
          <li>- Note that our model is a sequence-to-sequence model, not an image model.</li>
        </ul>
        <p>We trained two models using the proposed method, one on the LibriTTS dataset, which contains ~500 hours of audios from ~2,300 speakers, and the other on the VCTK dataset, which contains ~40 hours of audios from 110 speakers.</p>
        <p><b>Proposed and baseline methods:</b><br></p>
        <ul>
          <li>- <b>gst-n:</b> Global style token with n tokens. (unsupervised).</li>
          <li>- <b>proposed:</b> Our proposed style equalization. (unsupervised).</li>
          <li>- <b>gst-nS:</b> It is a supervised gst-n method that uses a pretrained speaker embedding. The speaker embedding requires speaker IDs to train and was trained using 2000 hours of audio samples from 7000 speakers in the VoxCeleb dataset. Note that this information is not accessible to the proposed method.</li>
          <p>All models in the same comparison are trained on the same dataset (gst-nS has additional speaker information).</p>
        </ul>
        <section id="libritts_unseen_nonparallel">
          <h2>LibriTTS, unseen speaker, nonparallel text</h2>
          <p>We randomly select style examples from the dev-clean split of LibriTTS dataset.  The input text is fixed (shown above each table) while we changing the style inputs.</p>
          <div>
            <h5>Input text 1:</h5>
            <p>I did not see any reason to change the captain.</p>
          </div>
          <table>
            <tr>
              <th class="text_header">style text</th>
              <th>style input</th>
              <th>gst-64<br>(unsupervised)</th>
              <th>gst-192<br>(unsupervised)</th>
              <th>proposed<br>(unsupervised)</th>
              <th class="border_left">gst-64s<br>(supervised)</th>
              <th>gst-192s<br>(supervised)</th>
            </tr>
            <tr>
              <th class="text">When the candle ends sent up their conical yellow flames, all the colored figures from Austria stood out clear and full of meaning against the green boughs.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/0/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/0/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/0/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/0/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text0/0/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/0/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">The man shrugged his broad shoulders and turned back into the arabesque chamber.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/1/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/1/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/1/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/1/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text0/1/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/1/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">He had been a clerk in a banking house, and was transported for embezzlement, though, by some, grave doubts as to his guilt were entertained.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/2/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/2/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/2/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/2/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text0/2/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/2/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">As it dropped it set at liberty three legs on hinges, which supported the panel when let down, and which placed themselves straight on the ground like the legs of a table, and supported it above the earth like a platform.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/3/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/3/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/3/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/3/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text0/3/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/3/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">People came running in from all sides; they threw water in the princess's face and did all they could to restore her, but nothing would bring her to.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/4/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/4/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/4/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/4/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text0/4/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/4/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Some do better than others, but none build like Mother Magpie.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/5/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/5/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/5/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/5/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text0/5/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text0/5/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
          <div>
            <h5>Input text 2:</h5>
            <p>Next year it plans to open an office in Tokyo.</p>
          </div>
          <table>
            <tr>
              <th class="text_header">style text</th>
              <th>style input</th>
              <th>gst-64<br>(unsupervised)</th>
              <th>gst-192<br>(unsupervised)</th>
              <th>proposed<br>(unsupervised)</th>
              <th class="border_left">gst-64s<br>(supervised)</th>
              <th>gst-192s<br>(supervised)</th>
            </tr>
            <tr>
              <th class="text">I had meant it to be the story of my life, but how little of my life is in it!</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/0/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/0/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/0/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/0/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text1/0/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/0/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Every landscape, low and high, seems doomed to be trampled and harried.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/1/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/1/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/1/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/1/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text1/1/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/1/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">As the inspiring music, the grand tramp drew near, Christie felt the old thrill and longed to fall in and follow the flag anywhere.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/2/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/2/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/2/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/2/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text1/2/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/2/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">We saw the United States flag flying from the ramparts, and thought that Yank would probably be asleep or catching lice, or maybe engaged in a game of seven-up.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/3/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/3/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/3/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/3/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text1/3/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/3/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">We can give this poor beggar some alms and send him away with a blessing.&quot;</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/4/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/4/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/4/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/4/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text1/4/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/4/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">The terrible office he had held for twenty-five years had succeeded in making him more or less than man.</th>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/5/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/5/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/5/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/5/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_unseen/text1/5/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_unseen/text1/5/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
        </section>
        <section id="libritts_unseen_ablation">
          <h2>LibriTTS, ablation study</h2>
          <p>We conduct ablation study on the effect of the proposed style equalization. In this study, we compare the proposed model trained with and without style equalization.All style input are unseen, ie, not in the training set.  Please notice the difference between the parallel and nonparallel settings.</p>
          <div>
            <h5>Example: 1</h5>
          </div>
          <table>
            <tr>
              <th class="parallel_header"></th>
              <th class="text_header">target text</th>
              <th>style input</th>
              <th>without style eq.</th>
              <th>proposed</th>
            </tr>
            <tr>
              <th class="text"><b>parallel text</b></th>
              <th class="text">There is a healthy bank-holiday atmosphere about this book which is extremely pleasant.</th>
              <td><audio src="assets/libritts/ablation/0/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/0/parallel_no_eq.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/0/parallel_proposed.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text"><b>nonparallel text</b></th>
              <th class="text">What is the difference between cappuccino and latte?.</th>
              <td><audio src="assets/libritts/ablation/0/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/0/nonparallel_no_eq.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/0/nonparallel_proposed.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
          <div>
            <h5>Example: 2</h5>
          </div>
          <table>
            <tr>
              <th class="parallel_header"></th>
              <th class="text_header">target text</th>
              <th>style input</th>
              <th>without style eq.</th>
              <th>proposed</th>
            </tr>
            <tr>
              <th class="text"><b>parallel text</b></th>
              <th class="text">Sheep Rock is about twenty miles from Sisson's, and is one of the principal winter pasture grounds of the wild sheep, from which it takes its name.</th>
              <td><audio src="assets/libritts/ablation/1/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/1/parallel_no_eq.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/1/parallel_proposed.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text"><b>nonparallel text</b></th>
              <th class="text">It has been a while since my last cigarette.</th>
              <td><audio src="assets/libritts/ablation/1/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/1/nonparallel_no_eq.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/1/nonparallel_proposed.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
          <div>
            <h5>Example: 3</h5>
          </div>
          <table>
            <tr>
              <th class="parallel_header"></th>
              <th class="text_header">target text</th>
              <th>style input</th>
              <th>without style eq.</th>
              <th>proposed</th>
            </tr>
            <tr>
              <th class="text"><b>parallel text</b></th>
              <th class="text">Mrs. Bozzle, who well understood that business was business, and that wives were not business, felt no anger at this, and handed her husband his best coat.</th>
              <td><audio src="assets/libritts/ablation/2/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/2/parallel_no_eq.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/2/parallel_proposed.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text"><b>nonparallel text</b></th>
              <th class="text">The trees grow taller and taller, and finally into the sky.</th>
              <td><audio src="assets/libritts/ablation/2/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/2/nonparallel_no_eq.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/ablation/2/nonparallel_proposed.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
        </section>
        <section id="libritts_interpolation">
          <h2>LibriTTS, unseen style interpolation</h2>
          <p>We showcase the capability of the proposed method to interpolate between two unseen styles.</p>
          <div>
            <h5>Input text 1:</h5>
            <p>In a short time, boil up the vinegar again, add pepper and ginger in the above proportion, and instantly cover them up.</p>
          </div>
          <table class="hw_table">
            <tr>
              <th class="hw_interp_text_header hw_very_first_row">style text 1</th>
              <th class="text hw_very_first_row">In a short time, boil up the vinegar again, add pepper and ginger in the above proportion, and instantly cover them up.</th>
            </tr>
            <tr>
              <th class="hw_interp_text_header">style 1</th>
              <td><audio src="assets/libritts/interpolation/0/style_from.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 0</th>
              <td><audio src="assets/libritts/interpolation/0/0.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 0.25</th>
              <td><audio src="assets/libritts/interpolation/0/0.25.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 0.5</th>
              <td><audio src="assets/libritts/interpolation/0/0.5.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 0.75</th>
              <td><audio src="assets/libritts/interpolation/0/0.75.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 1</th>
              <td><audio src="assets/libritts/interpolation/0/1.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_text_header">style 2</th>
              <td><audio src="assets/libritts/interpolation/0/style_to.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_text_header hw_last_row">style text 2</th>
              <th class="text hw_last_row">Perhaps the profession of doing good may be full, but every body should be kind at least to himself.</th>
            </tr>
          </table>
          <div>
            <h5>Input text 2:</h5>
            <p>The storm rushed in; she put up her hand to shield the light from danger.</p>
          </div>
          <table class="hw_table">
            <tr>
              <th class="hw_interp_text_header hw_very_first_row">style text 1</th>
              <th class="text hw_very_first_row">The storm rushed in; she put up her hand to shield the light from danger.</th>
            </tr>
            <tr>
              <th class="hw_interp_text_header">style 1</th>
              <td><audio src="assets/libritts/interpolation/1/style_from.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 0</th>
              <td><audio src="assets/libritts/interpolation/1/0.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 0.25</th>
              <td><audio src="assets/libritts/interpolation/1/0.25.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 0.5</th>
              <td><audio src="assets/libritts/interpolation/1/0.5.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 0.75</th>
              <td><audio src="assets/libritts/interpolation/1/0.75.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_inner_text_header">interp coeff = 1</th>
              <td><audio src="assets/libritts/interpolation/1/1.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_text_header">style 2</th>
              <td><audio src="assets/libritts/interpolation/1/style_to.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="hw_interp_text_header hw_last_row">style text 2</th>
              <th class="text hw_last_row">&quot;I've seen them do that in the wild west shows too many times not to know how myself.&quot;</th>
            </tr>
          </table>
        </section>
        <section id="libritts_seen_nonparallel">
          <h2>LibriTTS, seen speaker, nonparallel text</h2>
          <p>We randomly select style examples from the train-all-960 split (train-clean-100 + train-clean-360 + train-other-500) of LibriTTS dataset.  The input text is fixed (shown above each table) while we changing the style inputs.</p>
          <div>
            <h5>Input text 0:</h5>
            <p>Please change the channel of the television, thank you.</p>
          </div>
          <table>
            <tr>
              <th class="text_header">style text</th>
              <th>style input</th>
              <th>gst-64<br>(unsupervised)</th>
              <th>gst-192<br>(unsupervised)</th>
              <th>proposed<br>(unsupervised)</th>
              <th class="border_left">gst-64s<br>(supervised)</th>
              <th>gst-192s<br>(supervised)</th>
            </tr>
            <tr>
              <th class="text">&quot;She'll wake up fast enough when it's time to eat, and so will you,&quot; said Marie, with profound wisdom.</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/0/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/0/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/0/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/0/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text0/0/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/0/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">The greatest general of the South was Lee, and his greatest lieutenant was Jackson.</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/1/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/1/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/1/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/1/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text0/1/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/1/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Then indeed his cheek turned livid, and the eye which had hitherto preserved its steadiness sought the floor.</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/2/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/2/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/2/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/2/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text0/2/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/2/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">He's like a cat,--as sleek, and cunning, and fierce.</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/3/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/3/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/3/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/3/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text0/3/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/3/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">&quot;Yes, the noise outside the city wall is new, but the principle is old.&quot;</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/4/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/4/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/4/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/4/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text0/4/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text0/4/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
          <div>
            <h5>Input text 1:</h5>
            <p>What is it that you are looking for?</p>
          </div>
          <table>
            <tr>
              <th class="text_header">style text</th>
              <th>style input</th>
              <th>gst-64<br>(unsupervised)</th>
              <th>gst-192<br>(unsupervised)</th>
              <th>proposed<br>(unsupervised)</th>
              <th class="border_left">gst-64s<br>(supervised)</th>
              <th>gst-192s<br>(supervised)</th>
            </tr>
            <tr>
              <th class="text">Was ever such a view entertained of Caesar, Socrates or of any other historical character?</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/0/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/0/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/0/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/0/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text1/0/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/0/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">The room was now in dusk, save for the bulbs which made the portrait shine forth like a wayside shrine.</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/1/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/1/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/1/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/1/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text1/1/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/1/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Below this diadem hung, pendent, clusters of other disks, swarmed like the globular hiving of the constellation Hercules' captured stars.</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/2/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/2/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/2/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/2/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text1/2/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/2/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">&quot;What is his name, Miss Greeb?&quot; repeated Lucian, quite impervious to the hint.</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/3/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/3/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/3/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/3/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text1/3/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/3/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">&quot;There, Rob, you must forgive him; we're none of-us-perfect.</th>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/4/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/4/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/4/gst-192.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/4/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/libritts/nonparallel_text_seen/text1/4/gst-64s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/nonparallel_text_seen/text1/4/gst-192s.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
        </section>
        <section id="libritts_prior">
          <h2>LibriTTS, random styles from the prior distribution</h2>
          <p>We showcase the capability of the proposed method to sample random styles from the learned prior distribution.</p>
          <div>
            <h5>Input text 1:</h5>
            <p>This is not the end, it is just the beginning.</p>
          </div>
          <table>
            <tr>
              <td><audio src="assets/libritts/prior/text0/0.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/1.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/2.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/3.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <td><audio src="assets/libritts/prior/text0/4.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/5.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/6.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/7.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <td><audio src="assets/libritts/prior/text0/8.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/9.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/10.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/11.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <td><audio src="assets/libritts/prior/text0/12.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/13.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/14.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text0/15.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
          <div>
            <h5>Input text 2:</h5>
            <p>You can't always get what you want.</p>
          </div>
          <table>
            <tr>
              <td><audio src="assets/libritts/prior/text1/0.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/1.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/2.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/3.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <td><audio src="assets/libritts/prior/text1/4.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/5.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/6.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/7.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <td><audio src="assets/libritts/prior/text1/8.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/9.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/10.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/11.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <td><audio src="assets/libritts/prior/text1/12.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/13.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/14.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/libritts/prior/text1/15.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
        </section>
        <section id="vctk_nonparallel">
          <h2>VCTK, seen speaker, nonparallel text</h2>
          <p>We showcase nonparallel-text speech synthesis with seen speakers. The style example is shown first at each row.</p>
          <table>
            <tr>
              <th class="text_header">input text</th>
              <th>style input</th>
              <th>gst-16<br>(unsupervised)</th>
              <th>gst-64<br>(unsupervised)</th>
              <th>proposed<br>(unsupervised)</th>
              <th class="border_left">gst-16s<br>(supervised)</th>
              <th>gst-64s<br>(supervised)</th>
            </tr>
            <tr>
              <th class="text">My car is just right by the corner.</th>
              <td><audio src="assets/vctk/nonparallel_text/0/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/0/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/0/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/0/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/0/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/0/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">There is a house on top of the mountain.</th>
              <td><audio src="assets/vctk/nonparallel_text/1/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/1/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/1/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/1/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/1/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/1/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Can you show me where the coffee shop is?</th>
              <td><audio src="assets/vctk/nonparallel_text/2/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/2/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/2/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/2/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/2/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/2/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">How are you doing today?</th>
              <td><audio src="assets/vctk/nonparallel_text/3/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/3/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/3/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/3/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/3/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/3/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">The light shining through the windows makes the room beautiful.</th>
              <td><audio src="assets/vctk/nonparallel_text/4/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/4/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/4/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/4/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/4/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/4/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Can you bring me some tea, please?</th>
              <td><audio src="assets/vctk/nonparallel_text/5/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/5/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/5/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/5/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/5/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/5/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">I look at the sky and see nothing.</th>
              <td><audio src="assets/vctk/nonparallel_text/6/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/6/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/6/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/6/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/6/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/6/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">I think I am going to find my keys soon.</th>
              <td><audio src="assets/vctk/nonparallel_text/7/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/7/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/7/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/7/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/7/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/7/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Please teach me calculus.</th>
              <td><audio src="assets/vctk/nonparallel_text/8/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/8/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/8/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/8/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/8/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/8/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Am I dreaming, or is it really you?</th>
              <td><audio src="assets/vctk/nonparallel_text/9/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/9/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/9/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/9/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/nonparallel_text/9/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/nonparallel_text/9/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
        </section>
        <section id="vctk_parallel">
          <h2>VCTK, seen speaker, parallel text</h2>
          <p>We showcase parallel-text speech synthesis with seen speakers. The style example is shown first at each row.</p>
          <table>
            <tr>
              <th class="text_header">input text</th>
              <th>style input</th>
              <th>gst-16<br>(unsupervised)</th>
              <th>gst-64<br>(unsupervised)</th>
              <th>proposed<br>(unsupervised)</th>
              <th class="border_left">gst-16s<br>(supervised)</th>
              <th>gst-64s<br>(supervised)</th>
            </tr>
            <tr>
              <th class="text">What kind of person is he?</th>
              <td><audio src="assets/vctk/parallel_text/0/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/0/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/0/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/0/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/0/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/0/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">It is still too early for any likely contenders to have emerged.</th>
              <td><audio src="assets/vctk/parallel_text/1/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/1/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/1/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/1/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/1/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/1/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">If you're going to do it, do it right.</th>
              <td><audio src="assets/vctk/parallel_text/2/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/2/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/2/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/2/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/2/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/2/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">On the front line beyond the bridge the scene was utter chaos.</th>
              <td><audio src="assets/vctk/parallel_text/3/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/3/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/3/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/3/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/3/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/3/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Still, in the end, it was a fair result.</th>
              <td><audio src="assets/vctk/parallel_text/4/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/4/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/4/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/4/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/4/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/4/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">I think it is a sensible change.</th>
              <td><audio src="assets/vctk/parallel_text/5/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/5/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/5/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/5/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/5/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/5/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">When a man looks for something beyond his reach, his friends say he is looking for the pot of gold at the end of the rainbow.</th>
              <td><audio src="assets/vctk/parallel_text/6/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/6/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/6/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/6/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/6/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/6/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">Mexico City was a wonderful experience.</th>
              <td><audio src="assets/vctk/parallel_text/7/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/7/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/7/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/7/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/7/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/7/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">If the red of the second bow falls upon the green of the first, the result is to give a bow with an abnormally wide yellow band, since red and green light when mixed form yellow.</th>
              <td><audio src="assets/vctk/parallel_text/8/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/8/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/8/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/8/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/8/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/8/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
            <tr>
              <th class="text">The allegations were still under investigation, he added.</th>
              <td><audio src="assets/vctk/parallel_text/9/style_from.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/9/gst-16.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/9/gst-64.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/9/proposed.mp3" controls preload="metadata"></audio></td>
              <td class="border_left"><audio src="assets/vctk/parallel_text/9/gst-16s.mp3" controls preload="metadata"></audio></td>
              <td><audio src="assets/vctk/parallel_text/9/gst-64s.mp3" controls preload="metadata"></audio></td>
            </tr>
          </table>
        </section>
      </div>
      <nav class="section-nav">
        <ol>
          <li>
            <a href="index.html#style_eq">Style Equalization</a>
          </li>
          <li>
            <a href="index.html#speech_synthesis">Speech synthesis demo video</a>
            <ul>
              <li>
                <a href="#libritts_unseen_nonparallel">LibriTTS unseen speaker</a>
              </li>
              <li>
                <a href="#libritts_unseen_ablation">LibriTTS ablation study</a>
              </li>
              <li>
                <a href="#libritts_interpolation">LibriTTS unseen style interpolation</a>
              </li>
              <li>
                <a href="#libritts_seen_nonparallel">LibriTTS seen speaker</a>
              </li>
              <li>
                <a href="#libritts_prior">LibriTTS random styles from prior distribution</a>
              </li>
              <li>
                <a href="#vctk_nonparallel">VCTK nonparallel text</a>
              </li>
              <li>
                <a href="#vctk_parallel">VCTK parallel text</a>
              </li>
            </ul>
          </li>
          <li>
            <a href="index.html#handwriting_synthesis">Handwriting synthesis demo video</a>
            <ul>
              <li>
                <a href="_handwriting.html#handwriting_nonparallel">Nonparallel text</a>
              </li>
              <li>
                <a href="_handwriting.html#handwriting_parallel">Parallel text</a>
              </li>
              <li>
                <a href="_handwriting.html#handwriting_prior">Random samples from prior</a>
              </li>
              <li>
                <a href="_handwriting.html#handwriting_interpolation">Style interpolation</a>
              </li>
            </ul>
          </li>
          <li>
            <a href="index.html#intro">Quick introduction video</a>
          </li>
        </ol>
      </nav>
    </main>
  </body>
</html>