<!DOCTYPE html>
<html>
<head>
  <meta charset="utf-8">
  <meta name="description"
        content="Effectively combining tactile sensing and reinforcement learning (RL) creates
        powerful new pathways for sophisticated robot manipulation. However, tactile
        information is not always fully exploited by neural network-based approaches
        in deep RL due to its unique characteristics (e.g. sparsity). Departing from
        conventional reliance on idealised state representations, we present a new approach
        to strengthen the performance of sensory-driven agents for complex manipulation
        tasks. We provide a novel application and analysis of tailored reconstruction and
        multi-step dynamics objectives that help the agent more effectively leverage its
        tactile observations, and propose training these objectives on a separated auxiliary
        memory. We find that dynamics-based objectives unlock higher-performing agents
        that are able to predict future contacts with high precision. Experimental results
        show the efficacy of our approach through a simulated robotic agent on three
        complex control tasks with touch and proprioception alone. ">
  <meta name="keywords" content="Reinforcement learning, representation learning, tactile sensing">
  <meta name="viewport" content="width=device-width, initial-scale=1">
  <title>Enhancing Tactile-based Reinforcement Learning for Robotic Control</title>

  <!-- Global site tag (gtag.js) - Google Analytics -->
  <script async src="https://www.googletagmanager.com/gtag/js?id=G-PYVRSFMDRL"></script>
  <script>
    window.dataLayer = window.dataLayer || [];

    function gtag() {
      dataLayer.push(arguments);
    }

    gtag('js', new Date());

    gtag('config', 'G-PYVRSFMDRL');
  </script>

  <link href="https://fonts.googleapis.com/css?family=Google+Sans|Noto+Sans|Castoro"
        rel="stylesheet">

  <link rel="stylesheet" href="./static/css/bulma.min.css">
  <link rel="stylesheet" href="./static/css/bulma-carousel.min.css">
  <link rel="stylesheet" href="./static/css/bulma-slider.min.css">
  <link rel="stylesheet" href="./static/css/fontawesome.all.min.css">
  <link rel="stylesheet"
        href="https://cdn.jsdelivr.net/gh/jpswalsh/academicons@1/css/academicons.min.css">
  <link rel="stylesheet" href="./static/css/index.css">
  <link rel="icon" href="./static/images/icon.png">

  <script src="https://ajax.googleapis.com/ajax/libs/jquery/3.5.1/jquery.min.js"></script>
  <script defer src="./static/js/fontawesome.all.min.js"></script>
  <script src="./static/js/bulma-carousel.min.js"></script>
  <script src="./static/js/bulma-slider.min.js"></script>
  <script src="./static/js/index.js"></script>
</head>
<body>


<section class="hero">
  <div class="hero-body">
    <div class="container is-max-desktop">
      <div class="columns is-centered">
        <div class="column has-text-centered">
          <h1 class="title is-1 publication-title">Enhancing Tactile-based Reinforcement Learning for Robotic Control</h1>

          <div class="column has-text-centered">
            <div class="publication-links">

            </div>

          </div>
        </div>
      </div>
    </div>
  </div>
</section>


<section class="section">
  <div class="container is-max-desktop">
    <!-- Abstract. -->
    <div class="columns is-centered has-text-centered">
      <div class="column is-four-fifths">
        <h2 class="title is-3">Abstract</h2>
        <div class="content has-text-justified">
          <p>
            Effectively combining tactile sensing and reinforcement learning (RL) creates
            powerful new pathways for sophisticated robot manipulation. However, tactile
            information is not always fully exploited by neural network-based approaches
            in deep RL due to its unique characteristics (e.g. sparsity). Departing from
            conventional reliance on idealised state representations, we present a new approach
            to strengthen the performance of sensory-driven agents for complex manipulation
            tasks. We provide a novel application and analysis of tailored reconstruction and
            multi-step dynamics objectives that help the agent more effectively leverage its
            tactile observations, and propose training these objectives on a separated auxiliary
            memory. We find that dynamics-based objectives unlock higher-performing agents
            that are able to predict future contacts with high precision. Experimental results
            show the efficacy of our approach through a simulated robotic agent on three
            complex control tasks with touch and proprioception alone.            
          </p>
        </div>
      </div>
    </div>

  </div>
</section>

<section class="section">
  <div class="container is-max-desktop">

    <div class="columns is-centered">

      <!-- Visual Effects. -->
      <div class="column">
        <div class="content">
          <h2 class="title is-3">Best PPO agent</h2>
          <p>
            Trained end-to-end
          </p>
          <video id="manual" autoplay controls muted loop playsinline height="100%">
            <source src="./static/videos/baoding/best_ppo.mp4"
                    type="video/mp4">
          </video>
          <script>
            const video = document.getElementById("shared");
            video.playbackRate = 0.5;
          </script>
        </div>
      </div>
      <!--/ Visual Effects. -->

      <!-- Matting. -->
      <div class="column">
        <h2 class="title is-3">Our best agent</h2>
        <div class="columns is-centered">
          <div class="column content">
            <p>
              Trained with self-supervised dynamics + an auxiliary memory
            </p>
            <video id="shared" autoplay controls muted loop playsinline height="100%">
              <source src="./static/videos/baoding/best_dynamics_memory.mp4"
                      type="video/mp4">
            </video>
            <script>
              const video = document.getElementById("shared");
              video.playbackRate = 0.5;
            </script>
          </div>

        </div>
      </div>
    </div>
    <!--/ Matting. -->
  </div>
</section>

<section class="section">
  <div class="container is-max-desktop">

    <div class="columns is-centered">

      <!-- Visual Effects. -->
      <div class="column">
        <div class="content">
          <h2 class="title is-3">Best PPO agent</h2>
          <p>
            Trained end-to-end
          </p>
          <video id="manual" autoplay controls muted loop playsinline height="100%">
            <source src="./static/videos/bounce/best_ppo.mp4"
                    type="video/mp4">
          </video>
          <script>
            const video = document.getElementById("shared");
            video.playbackRate = 0.5;
          </script>
        </div>
      </div>
      <!--/ Visual Effects. -->

      <!-- Matting. -->
      <div class="column">
        <h2 class="title is-3">Our best agent</h2>
        <div class="columns is-centered">
          <div class="column content">
            <p>
              Trained with self-supervised dynamics
            </p>
            <video id="shared" autoplay controls muted loop playsinline height="100%">
              <source src="./static/videos/bounce/best_dynamics.mp4"
                      type="video/mp4">
            </video>
            <script>
              const video = document.getElementById("shared");
              video.playbackRate = 0.5;
            </script>
          </div>

        </div>
      </div>
    </div>
    <!--/ Matting. -->
  </div>
</section>

<section class="section">
  <div class="container is-max-desktop">

    <div class="columns is-centered">

      <!-- Visual Effects. -->
      <div class="column">
        <div class="content">
          <h2 class="title is-3">Best PPO agent</h2>
          <p>
            Trained end-to-en
          </p>
          <video id="manual" autoplay controls muted loop playsinline height="100%">
            <source src="./static/videos/find/best_ppo.mp4"
                    type="video/mp4">
          </video>
        </div>
      </div>
      <!--/ Visual Effects. -->

      <!-- Matting. -->
      <div class="column">
        <h2 class="title is-3">Our best agent</h2>
        <div class="columns is-centered">
          <div class="column content">
            <p>
              Trained with self-supervised dynamics
            </p>
            <video id="shared" autoplay controls muted loop playsinline height="100%">
              <source src="./static/videos/find/best_dynamics.mp4"
                      type="video/mp4">
            </video>

  
          </div>
        </div>
      </div>
    </div>
    <!--/ Matting. -->
  </div>
</section>

 
<footer class="footer">
  <div class="container">
    <div class="content has-text-centered">
      <a class="icon-link"
         href="./static/videos/nerfies_paper.pdf">
        <i class="fas fa-file-pdf"></i>
      </a>
      <a class="icon-link" href="https://github.com/keunhong" class="external-link" disabled>
        <i class="fab fa-github"></i>
      </a>
    </div>
    <div class="columns is-centered">
      <div class="column is-8">
        <div class="content">
          <p>
            Website template borrowed from <a
              href="https://github.com/nerfies/nerfies.github.io">NeRFies</a> 
          </p>
        </div>
      </div>
    </div>
  </div>
</footer>

</body>
</html>
