<!doctype html><meta charset=utf-8>

<head>
    <!-- Bootstrap -->
    <link href="css/bootstrap-4.4.1.css" rel="stylesheet">
    <link href="https://fonts.googleapis.com/css?family=Open+Sans" rel="stylesheet" type="text/css">
    <link href="css/misc.css" rel="stylesheet" type="text/css">
    <link href="https://fonts.googleapis.com/css2?family=Inter&display=swap" rel="stylesheet">
    <script src="https://polyfill.io/v3/polyfill.min.js?features=es6"></script>
    <script id="MathJax-script" async
    src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-mml-chtml.js">
    </script>

    <script>
    window.dataLayer = window.dataLayer || [];
    function gtag(){dataLayer.push(arguments);}
    gtag('js', new Date());

    gtag('config', '');
    </script>
</head>

<title>CoRL 2025 Submission #39</title>

<div class="col-sm-12" style="text-align: center;">
    <h1 class="name" style="font-family:'Inter';font-weight: 500;"><span>Elucidating the Design Space of </span><span>
Torque-aware Vision-Language-Action Models</span><br>
</div>
<!-- Table of Contents -->
<div class="col-sm-12" style="font-family:'Helvetica';text-align: center;">
    <a style="font-family:'Inter';font-size:1.25em;">Paper Submission #39</a>
    <br>
    <a style="font-family:'Inter';font-size:1.15em;color:#4B9AE7">Anonymous Author(s)</a><br>

    <a style="font-family:'Inter';font-size:1.05em;">Affiliation</a><br>

    <!-- <h3>Table of Contents</h3>
    <ul style="list-style-type: none; padding: 0;">
        <li><a href="#baseline-comparisons">Baseline Comparisons</a></li>
        <li><a href="#dynamics-effects">Dynamics Effects</a></li>
        <li><a href="#applications">Applications</a></li>
    </ul> -->
</div>

<div class="main">
    <section class="section" id="Torque Demonstration">
        <div class="container text-center">
        <div class="container">
          
            <hr>
            <h1 style="text-align:center; margin-top: 0pt; margin-bottom: 10pt;font-family:'Inter';">Torque Demonstration</h1>

            <div class="col-12 text-center">
                <p style="text-align:left">
                    In this section, we present a video demonstrating torque variations in real-world scenarios. The video showcases a robot arm performing tasks with varying joint torques. From left to right, we present three tasks: Charger Plugging, USB Plugging, and Button Pushing. For each task, the first video provides a top-down view, the second video offers a front view, and the third video visualizes the torque variations. These three videos are time-synchronized and played at 1x speed.
                </p>
            </div>
<style>
  .no-gutters {
    margin-right: 0;
    margin-left: 0;
  }
  .no-gutters > [class*="col-"] {
    padding-right: 0;
    padding-left: 0;
  }
  video {
    width: 100% !important;
    height: auto !important;
    display: block;
  }
  .video-title {
    text-align: center;   /* 居中题目 */
    margin-top: 0.5rem;   /* 跟视频留一点空隙 */
    font-size: 1rem;
    color: #333;
  }
</style>

<div class="container text-center">
  <div class="row no-gutters my-4">
    <div class="col-4">
      <video controls loop autoplay muted src="./videos/torque_demonstration/Charger_Plugging.mp4"></video>
      <p class="video-title">Charger Plugging</p>
    </div>
    <div class="col-4">
      <video controls loop autoplay muted src="./videos/torque_demonstration/USB_Plugging.mp4"></video>
      <p class="video-title">USB Plugging</p>
    </div>
    <div class="col-4">
      <video controls loop autoplay muted src="./videos/torque_demonstration/Button_Pushing.mp4"></video>
      <p class="video-title">Button Pushing</p>
    </div>
  </div>
</div>

<section class="section" id="Contact-Rich Tasks">
        <div class="container text-center">
        <div class="container">
          
            <hr>
            <h1 style="text-align:center; margin-top: 0pt; margin-bottom: 10pt;font-family:'Inter';">Contact-Rich Tasks</h1>

            <div class="col-12 text-center">
                <p style="text-align:left">
                    In this section, we present five videos showcasing the performance of the torque-aware model in the following five contact-rich tasks: Button Pushing, Charger Plugging, USB Plugging, Door Opening, and Drawer Opening. These videos are played at 1x speed.
                </p>
            </div>

            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Button Pushing</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/contact_rich/Button_Pushing.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div>



            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Charger Plugging</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/contact_rich/Charger_Plugging.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div><br>


            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">USB Plugging</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/contact_rich/USB_Plugging.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div><br>

            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Socket Unplugging</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/contact_rich/Socket_Unplugging.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div><br>
            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Door Handle Turning</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/contact_rich/Door_Handle_Turning.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div><br>
        </div>
</section><br>

<section class="section" id="Regular Tasks">
        <div class="container text-center">
        <div class="container">
          
            <hr>
            <h1 style="text-align:center; margin-top: 0pt; margin-bottom: 10pt;font-family:'Inter';">Regular Tasks</h1>

            <div class="col-12 text-center">
                <p style="text-align:left">
                    In this section, we present five videos showcasing the performance of the torque-aware model in the following five regular tasks: Bottle Pick and Place, Liquid Pouring, Stacking Cubes, Push-to-Position, and Opening a Drawer. These videos are played at 1x speed.
                </p>
            </div>

            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Bottle Pick and Place</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/regular/Bottle_Pick_and_Place.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div>



            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Liquid Pouring</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/regular/Liquid_Pouring.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div><br>


            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Stacking Cubes</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/regular/Stacking_Cubes.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div><br>
            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Push-to-Position</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/regular/Push_to_Position.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>

            

            </div><br>
            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Opening a Drawer</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/regular/Opening_a_Drawer.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div><br>
        </div>
</section><br>

<section class="section" id="Cross Embodiment">
        <div class="container text-center">
        <div class="container">
          
            <hr>
            <h1 style="text-align:center; margin-top: 0pt; margin-bottom: 10pt;font-family:'Inter';">Cross Embodiment</h1>

            <div class="col-12 text-center">
                <p style="text-align:left">
                    In this section, we present a video demonstrating the performance of cross embodiment performance using the ROKAE SR robotic arm. The tasks include inserting a fast-charging connector and a slow-charging connector. The video is played at 1x speed.
                </p>
            </div>

            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Inserting a Fast-Charging Connector</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/cross_embodiment/Fast_Charging.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div>



            <div class="container text-center">
                <div class="row align-items-center my-4">
                    <div class="col-lg-2 col-md-2 col-sm-2 col-2 text-center">
                        <h5 class="mt-2">Inserting a Slow-Charging Connector</h5>
                    </div>
                    <div class="col-lg-1 col-md-1 col-sm-1 col-1 text-center"></div>

                    <div class="col-lg-9 col-md-9 col-sm-9 col-9 text-center">
                                <div class="embed-responsive embed-responsive-16by9">
                                    <video controls loop autoplay muted>
                                        <source src="./videos/cross_embodiment/Slow_Charging.mp4" type="video/mp4">
                                    </video>
                                </div>
                    </div>
                </div>
            </div><br>

        </div>
</section><br>

</div>
