<!doctype html>
<html lang="en">
<html xmlns="http://www.w3.org/1999/xhtml"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><link href="./Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation_files/chalkduster" rel="stylesheet">
<style>
	@import url('https://fonts.cdnfonts.com/css/chalkduster');
</style>
<head>
	<!-- Required meta tags -->
	<meta charset="utf-8">
	<meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">

	<!-- Bootstrap CSS -->
	<link href="./resources/bootstrap.min.css" rel="stylesheet">
	<link href="./resources/stylesheet.css" rel="stylesheet">

	<title>Supplementary Website</title>
</head>


<body data-new-gr-c-s-check-loaded="14.1110.0" data-gr-ext-installed="">

	<section class="jumbotron text-center pb-2">
		<div class="container">
			<h1 class="jumbotron-heading">Weakly Supervised Motion Learning for <br> Co-speech Gesture Video Generation</h1>

			<h4 class="font-italic pt-2" style="font-weight: normal">ICLR 2026 [Submission ID: 7934]</h4>

		</div>
	</section>

	<div class="container">


		<div class="row pt-3 text-center">
			  

		</div>		
		<div class="row justify-content-sm-center">
			  

			<a class="sm-1 mx-1 btn btn-primary" href="./comparisons.html" role="button">Videos for Comparisons</a>
			<a class="sm-1 mx-1 btn btn-primary" href="./ablations.html" role="button">Videos for Ablation Studies</a>
			<a class="sm-1 mx-1 btn btn-primary" href="./indentities.html" role="button">Videos for Other Identities</a>
			<a class="sm-1 mx-1 btn btn-primary" href="./long.html" role="button">Long Videos</a>

		</div>		



			<hr class="mt-5">


			
			<h2 class="pt-4"><p class="text-center">Videos for Comparisons</p></h2>

			<p class="lead">On this page, we present video comparisons between our method, S2G, MYA, and EchoMimicV2.  
				<br><br>
				Our approach generates high-fidelity videos with clear hand details, realistic finger articulation, and stable backgrounds. In contrast, S2G, MYA, and EchoMimicV2 struggle with visual consistency, exhibiting background flickering, hand blurring, and noticeable finger distortions. Furthermore, MYA tends to overfit to appearance features seen during training, causing it to reproduce memorized attributes rather than adhering to the provided reference image, leading to noticeable inconsistencies.

			</p>

			<table width="1200" style="margin-left: -55px;" align="center">
				<tbody>

						<tr>
	
	
							<th style="text-align: center; padding: 10px;">

								<div style="width: 1000px; margin: auto; border-bottom: 1px solid #000;">
									<div class="row justify-content-sm-center" style="display: flex; width: 1000px; height: 40px; margin-left: 0px">
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">First Frame</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">S2G</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">MYA</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">EchoMimicV2</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">Ours</p>
										</div>
									</div>
									<video autoplay="autoplay" controls="controls" loop="loop" muted="muted" src="sota/10_stack.mp4" style="width:1000px; border: 5px solid #000;" type="video/mp4"></video>
								</div>
							</th>
						</tr>
						
						
						<th style="text-align: center; padding: 10px;">

							<div style="width: 1000px; margin: auto; border-bottom: 1px solid #000;">
								<div class="row justify-content-sm-center" style="display: flex; width: 1000px; height: 40px; margin-left: 0px">
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">First Frame</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">S2G</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">MYA</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">EchoMimicV2</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">Ours</p>
										</div>
								</div>
								<video autoplay="autoplay" controls="controls" loop="loop" muted="muted" src="sota/11_stack.mp4" style="width:1000px; border: 5px solid #000;" type="video/mp4"></video>
							</div>
						</th>
						</tr>
						
						
						<th style="text-align: center; padding: 10px;">

							<div style="width: 1000px; margin: auto; border-bottom: 1px solid #000;">
								<div class="row justify-content-sm-center" style="display: flex; width: 1000px; height: 40px; margin-left: 0px">
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">First Frame</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">S2G</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">MYA</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">EchoMimicV2</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">Ours</p>
										</div>
								</div>
								<video autoplay="autoplay" controls="controls" loop="loop" muted="muted" src="sota/12_stack.mp4" style="width:1000px; border: 5px solid #000;" type="video/mp4"></video>
							</div>
						</th>
						</tr>
						
						
						
						<th style="text-align: center; padding: 10px;">

							<div style="width: 1000px; margin: auto; border-bottom: 1px solid #000;">
								<div class="row justify-content-sm-center" style="display: flex; width: 1000px; height: 40px; margin-left: 0px">
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">First Frame</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">S2G</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">MYA</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">EchoMimicV2</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">Ours</p>
										</div>
								</div>
								<video autoplay="autoplay" controls="controls" loop="loop" muted="muted" src="sota/13_stack.mp4" style="width:1000px; border: 5px solid #000;" type="video/mp4"></video>
							</div>
						</th>
						</tr>
						
						
						<th style="text-align: center; padding: 10px;">

							<div style="width: 1000px; margin: auto; border-bottom: 1px solid #000;">
								<div class="row justify-content-sm-center" style="display: flex; width: 1000px; height: 40px; margin-left: 0px">
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">First Frame</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">S2G</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">MYA</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">EchoMimicV2</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">Ours</p>
										</div>
								</div>
								<video autoplay="autoplay" controls="controls" loop="loop" muted="muted" src="sota/15_stack.mp4" style="width:1000px; border: 5px solid #000;" type="video/mp4"></video>
							</div>
						</th>
						</tr>
						
						
						
						<th style="text-align: center; padding: 10px;">

							<div style="width: 1000px; margin: auto; border-bottom: 1px solid #000;">
								<div class="row justify-content-sm-center" style="display: flex; width: 1000px; height: 40px; margin-left: 0px">
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">First Frame</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">S2G</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">MYA</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">EchoMimicV2</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">Ours</p>
										</div>
								</div>
								<video autoplay="autoplay" controls="controls" loop="loop" muted="muted" src="sota/16_stack.mp4" style="width:1000px; border: 5px solid #000;" type="video/mp4"></video>
							</div>
						</th>
						</tr>
						
						
						<th style="text-align: center; padding: 10px;">

							<div style="width: 1000px; margin: auto; border-bottom: 1px solid #000;">
								<div class="row justify-content-sm-center" style="display: flex; width: 1000px; height: 40px; margin-left: 0px">
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">First Frame</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">S2G</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">MYA</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">EchoMimicV2</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">Ours</p>
										</div>
								</div>
								<video autoplay="autoplay" controls="controls" loop="loop" muted="muted" src="sota/1_stack.mp4" style="width:1000px; border: 5px solid #000;" type="video/mp4"></video>
							</div>
						</th>
						</tr>
						

					
					
					<th style="text-align: center; padding: 10px;">

						<div style="width: 1000px; margin: auto; border-bottom: 1px solid #000;">
							<div class="row justify-content-sm-center" style="display: flex; width: 1000px; height: 40px; margin-left: 0px">
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">First Frame</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">S2G</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">MYA</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">EchoMimicV2</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">Ours</p>
										</div>
							</div>
							<video autoplay="autoplay" controls="controls" loop="loop" muted="muted" src="sota/4_stack.mp4" style="width:1000px; border: 5px solid #000;" type="video/mp4"></video>
						</div>
					</th>
				</tr>
				
				
				<th style="text-align: center; padding: 10px;">

					<div style="width: 1000px; margin: auto; border-bottom: 1px solid #000;">
						<div class="row justify-content-sm-center" style="display: flex; width: 1000px; height: 40px; margin-left: 0px">
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">First Frame</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">S2G</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">MYA</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">EchoMimicV2</p>
										</div>
										<div style="flex: 0 0 20%; text-align: center;">
											<p style="font-family: Chalkduster; font-size: 16px; margin: 0;">Ours</p>
										</div>
						</div>
						<video autoplay="autoplay" controls="controls" loop="loop" muted="muted" src="sota/5_stack.mp4" style="width:1000px; border: 5px solid #000;" type="video/mp4"></video>
					</div>
				</th>
			</tr>
	
	



						
				</tbody>
			</table>



		</div>


		<script src="./resources/jquery-3.4.1.slim.min.js"></script>
		<script src="./resources/popper.min.js"></script>
		<script src="./resources/bootstrap.min.js"></script>

	</body>


</body><grammarly-desktop-integration data-grammarly-shadow-root="true"><template shadowrootmode="open"><style>
	div.grammarly-desktop-integration {
	  position: absolute;
	  width: 1px;
	  height: 1px;
	  padding: 0;
	  margin: -1px;
	  overflow: hidden;
	  clip: rect(0, 0, 0, 0);
	  white-space: nowrap;
	  border: 0;
	  -moz-user-select: none;
	  -webkit-user-select: none;
	  -ms-user-select:none;
	  user-select:none;
	}
  
	div.grammarly-desktop-integration:before {
	  content: attr(data-content);
	}
  </style><div aria-label="grammarly-integration" role="group" tabindex="-1" class="grammarly-desktop-integration" data-content="{&quot;mode&quot;:&quot;full&quot;,&quot;isActive&quot;:true,&quot;isUserDisabled&quot;:false}"></div></template></grammarly-desktop-integration></html>

	</html>
