<!DOCTYPE html>
<html>
<head>

  <style> .results-carousel .item img { width: 512px; height: 512px; } </style>
  <meta charset="utf-8">
  <meta name="description"
        content="AnyAvatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters">
  <meta name="keywords" content="Video Generation, Camera Motion transfer, Camera-Object Motion, Disentanglement">
  <meta name="viewport" content="width=device-width, initial-scale=1">
  <title style="color: white;">AnyAvatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters</title>

  <script async src="https://www.googletagmanager.com/gtag/js?id=G-PYVRSFMDRL"></script>
  <script>
    window.dataLayer = window.dataLayer || [];

    function gtag() {
      dataLayer.push(arguments);
    }

    gtag('js', new Date());

    gtag('config', 'G-PYVRSFMDRL');
  </script>

  <link href="https://fonts.googleapis.com/css?family=Google+Sans|Noto+Sans|Castoro"
        rel="stylesheet">

  <link rel="stylesheet" href="./static/css/bulma.min.css">
  <link rel="stylesheet" href="./static/css/bulma-carousel.min.css">
  <link rel="stylesheet" href="./static/css/bulma-slider.min.css">
  <link rel="stylesheet" href="./static/css/fontawesome.all.min.css">
  <link rel="stylesheet"
        href="https://cdn.jsdelivr.net/gh/jpswalsh/academicons@1/css/academicons.min.css">
  <link rel="stylesheet" href="./static/css/index.css">
  <link rel="icon" href="./static/images/favicon.svg">

  <script src="https://ajax.googleapis.com/ajax/libs/jquery/3.5.1/jquery.min.js"></script>
  <script defer src="./static/js/fontawesome.all.min.js"></script>
  <script src="./static/js/bulma-carousel.min.js"></script>
  <script src="./static/js/bulma-slider.min.js"></script>
  <script src="./static/js/index.js"></script>
</head>
<body style="background-color: black;">

  <section class="hero">
    <div class="hero-body">
      <div class="container is-max-desktop">
        <div class="columns is-centered">
          <div class="column has-text-centered">
            <h2 class="title is-1 publication-title" style="color: white;font-size: 2.5rem;">AnyAvatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters</h1>
            <div class="is-size-5 publication-authors">
              <span class="author-block" >
            </div>
  
            </div>
          </div>
        </div>
      </div>
    </div>
  </section>

<!-- 单独放视频部分 -->
<section class="teaser1-section">
  <div class="container">
    <div class="columns is-centered">
      <div class="column is-full">
        <style>
          .videos-container {
            display: flex;
            flex-wrap: wrap;
            justify-content: center;
            gap: 10px;
          }
          .videos-container-top .video-item {
            flex: 0 0 100%;
            max-width: 80%;
            box-sizing: border-box;
          }
          .video-item video {
            width: 100%;
            height: auto;
            border-radius: 8px;
            background-color: #000;
            display: block;
          }
        </style>
        <div class="videos-container videos-container-top">
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/teaser/AnyAvatar.mp4" type="video/mp4">
            </video>
          </div>
        </div>
      </div>
    </div>
  </div>
</section>



<section class="section">
  <div class="container is-max-desktop">
    <div class="columns is-centered has-text-centered">
      <div class="column is-four-fifths">
        <div class="columns is-centered">
        </div>
        <h2 class="title is-3" style="color: white;">Abstract</h2>
        <div class="content has-text-justified">
          <p style="color: white;">
            Recent years have witnessed significant progress in audio-driven human animation. 
            However, critical challenges remain in (i) generating highly dynamic videos while 
            preserving character consistency, (ii) achieving precise emotion alignment between 
            characters and audio, and (iii) enabling multi-character audio-driven animation. 
            To address these challenges, we propose HunyuanVideo-Avatar, a multimodal diffusion 
            transformer (MM-DiT)-based model capable of simultaneously generating dynamic, 
            emotion-controllable, and multi-character dialogue videos. Concretely, HunyuanVideo-Avatar 
            introduces three key innovations: (i) A character image injection module is designed 
            to replace the conventional addition-based character conditioning scheme, eliminating 
            the inherent condition mismatch between training and inference. This ensures the 
            dynamic motion and strong character consistency; (ii) An Audio Emotion Module (AEM) 
            is introduced to extract and transfer the emotional cues from an emotion reference 
            image to the target generated video, enabling fine-grained and accurate emotion style 
            control; (iii) A Face-Aware Audio Adapter (FAA) is proposed to isolate the audio-driven 
            character with latent-level face mask, enabling independent audio injection via 
            cross-attention for multi-character scenarios. These innovations empower HunyuanVideo-Avatar 
            to surpass state-of-the-art methods on benchmark datasets and a newly proposed wild dataset, 
            generating realistic avatars in dynamic, immersive scenarios.
        </div>
      </div>
    </div>
  </div>
</section>


<section class="section">
  <div class="container is-max-desktop">
    <div class="columns is-centered has-text-centered">
      <div class="column is-four-fifths">
        <h2 class="title is-3" style="color: white;">Method FrameWork</h2>
        <div class="has-text-centered" style="margin-top: 2em;">
          <figure class="image" style="display: block; position: relative; width: 100%; overflow: visible;">
            <img src="./static/fig/method.jpg" alt="Descriptive alt text"
                 style="width: 140%; max-width: 900px; position: relative; left: 50%; transform: translateX(-50%);">
          </figure>
        </div>
      </div>
    </div>
  </div>
</section>


<section class="demo1-section">
  <div class="container is-max-desktop">
    <div class="columns is-centered">

      <div class="column is-full">

        <div class="has-text-centered">
          <h2 class="title is-4" style="color: white;">
            Audio-Driven Human Animation
          </h2>
        </div>

        <div style="height: 20px;"></div>

        <p style="color: white;">
        </p>

        <div style="height: 20px;"></div>

        <style class="video-style2">
          .videos-container {
            display: flex;
            flex-wrap: wrap;
            justify-content: center;
            gap: 10px;
          }

          .videos-container-4x2 .video-item {
            flex: 0 0 100%;
            max-width: 100%;
            box-sizing: border-box;
          }

          .video-item video {
            width: 100%;
            height: auto;
            border-radius: 8px;
            background-color: #000;
          }
        </style>

        <div class="videos-container videos-container-4x2">
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/duojingbie/3.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/duojingbie/4.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/duojingbie/2.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/duojingbie/36_audio_2.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/duojingbie/9_audio_1.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/duojingbie/25_audio_3.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/duojingbie/32_audio_5.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/duojingbie/1.mp4" type="video/mp4">
            </video>
          </div>

        </div>
      </div>
    </div>
  </div>
</section>




<section class="demo4-section">
  <div class="container is-max-desktop">
    <div class="columns is-centered">

      <div class="column is-full">

        <div class="has-text-centered">
          <h2 class="title is-4" style="color: white;">
            Multiple Characters
          </h2>
        </div>

        <!-- 加个间距 -->
        <div style="height: 20px;"></div>

        <p style="color: white;">
        </p>

        <!-- 加个间距 -->
        <div style="height: 20px;"></div>

        <style class="video-style2">
          .videos-container {
            display: flex;
            flex-wrap: wrap;
            justify-content: center;
            gap: 10px;
          }

          .videos-container-3x2 .video-item {
            flex: 0 0 calc(33.33% - 7px);
            box-sizing: border-box;
          }

          .video-item video {
            width: 100%;
            height: auto;
            border-radius: 8px;
            background-color: #000;
          }
        </style>

        <div class="videos-container videos-container-3x2">
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/multi/1.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/multi/6.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/multi/3.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/multi/5.mp4" type="video/mp4">
            </video>
          </div>

          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/multi/4.mp4" type="video/mp4">
            </video>
          </div>


          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/multi/2.mp4" type="video/mp4">
            </video>
          </div>
        </div>
      </div>
    </div>
  </div>
</section>


<section class="demo3-section">
  <div class="container is-max-desktop">
    <div class="columns is-centered">

      <div class="column is-full">

        <div class="has-text-centered">
          <h2 class="title is-4" style="color: white;">
            Diverse Character Styles
          </h2>
        </div>

        <div style="height: 20px;"></div>

        <p style="color: white;">
        </p>
        <div style="height: 20px;"></div>

        <style class="video-style2">
          .videos-container {
            display: flex;
            flex-wrap: wrap;
            justify-content: center;
            gap: 10px;
          }

          .videos-container-4x2 .video-item {
            flex: 0 0 calc(25% - 7.5px);
            box-sizing: border-box;
          }

          .video-item video {
            width: 100%;
            height: auto;
            border-radius: 8px;
            background-color: #000;
          }
        </style>

        <div class="videos-container videos-container-4x2">
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/style/54_audio_3.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/style/46_audio_5.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/style/111.mov" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/style/75_audio_1.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/style/49_audio_9.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/style/50_audio_2.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/style/60_audio_3.mp4" type="video/mp4">
            </video>
          </div>
          <div class="video-item">
            <video autoplay controls muted loop playsinline>
              <source src="./static/style/73_audio_6.mp4" type="video/mp4">
            </video>
          </div>
        </div>
      </div>
    </div>
  </div>
</section>


<section class="teaser-section">
  <div class="container is-max-desktop">
    <div class="columns is-centered">
      <div class="column is-full">
        <style class="video-style1">
          .videos-container {
            display: flex;
            flex-wrap: wrap;
            justify-content: center;
            gap: 10px;
          }

          .videos-container-3x2 .video-item {
            flex: 0 0 calc(33.333% - 7px);
            box-sizing: border-box;
          }

          .video-item video {
            width: 100%;
            height: auto;
            border-radius: 8px;
            background-color: #000;
          }
        </style>

        <div class="videos-container videos-container-3x2">

        </div>
      </div>
    </div>
  </div>
</section> 



<style>
.bibtex-title {
  color: #fff !important;
}
.bibtex-block {
  background: #111 !important;
  color: #fff !important;
  border-radius: 8px;
  padding: 1em;
  font-size: 1em;
  overflow-x: auto;
}
.bibtex-block code {
  background: none !important;
  color: inherit !important;
}
</style>



</body>
</html>
