<!DOCTYPE html>
<html lang="en-us">

  <head>
  <link href="http://gmpg.org/xfn/11" rel="profile">
  <meta http-equiv="content-type" content="text/html; charset=utf-8">

  <meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1">

  <title>
    
      Moving Beyond Navigation with Active Neural SLAM &middot; The ICLR Blog Track
    
  </title>

  
  <link rel="canonical" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/2021/12/01/active-neural-slam/">
  

  <link rel="stylesheet" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/public/css/poole.css">
  <link rel="stylesheet" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/public/css/syntax.css">
  <link rel="stylesheet" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/public/css/lanyon.css">
  <link rel="stylesheet" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/public/css/custom.css">
  <link rel="stylesheet" href="https://fonts.googleapis.com/css?family=PT+Serif:400,400italic,700%7CPT+Sans:400">

  <link rel="apple-touch-icon-precomposed" sizes="144x144" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/public/apple-touch-icon-precomposed.png">
  <link rel="shortcut icon" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/public/favicon.ico">

  <link rel="alternate" type="application/rss+xml" title="RSS" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/atom.xml">

  

  <script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.1/MathJax.js?config=TeX-AMS-MML_HTMLorMML" type="text/javascript" ></script>
 <!-- <script type="text/x-mathjax-config"> MathJax.Hub.Config({ TeX: { equationNumbers: { autoNumber: "AMS" } } }); </script> -->
  <script type="text/x-mathjax-config">
      MathJax.Hub.Config({
        tex2jax: { inlineMath: [ ['$','$'], ["\\(","\\)"] ],
         processEscapes: false
        }
      });
</script>
</head>


  <body>

    <!-- Target for toggling the sidebar `.sidebar-checkbox` is for regular
     styles, `#sidebar-checkbox` for behavior. -->
<input type="checkbox" class="sidebar-checkbox" id="sidebar-checkbox">
<!-- <input type="checkbox" class="sidebar-checkbox" id="sidebar-checkbox" > -->

<!-- Toggleable sidebar -->
<div class="sidebar" id="sidebar">
  <div class="sidebar-item">
    <p>For short-term, peer-sourced tests of time, generalizations, specializations, reproductions, etc.!</p>
  </div>

  <nav class="sidebar-nav">

    

    
    
      
        
          <a class="sidebar-nav-item" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/">ICLR 2022 Blog Track</a>
        
      
    
      
        
      
    
      
        
          <a class="sidebar-nav-item" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/about/">About</a>
        
      
    
      
    
      
        
      
    
      
        
          <a class="sidebar-nav-item" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/submitting/">Submitting</a>
        
      
    
      
        
          <a class="sidebar-nav-item" href="https://iclr.iro.umontreal.ca/a1a65e04-3367-42a7-ad60-1697ed83901d_1642017451/tags/">Tags</a>
        
      
    

    <a class="sidebar-nav-item" href="https://github.com/iclr-blog-track/iclr-blog-track.github.io">GitHub project</a>
    <span class="sidebar-nav-item">Currently vICLR Spring 2021</span>
  </nav>

  <div class="sidebar-item">
    <p>
      &copy; 2022. All rights reserved.
    </p>
  </div>
</div>


    <!-- Wrap is the content to shift when toggling the sidebar. We wrap the
         content to avoid any CSS collisions with our real content. -->
    <div class="wrap">
      <div class="masthead">
        <div class="container">
          <h3 class="masthead-title">
            <a href="/" title="Home">The ICLR Blog Track</a>
            <small></small>
          </h3>
        </div>
      </div>

      <div class="container content">
        <div class="post">
  <h1 id="iclr-post-title" class="post-title">Moving Beyond Navigation with Active Neural SLAM</h1>
  <span class="post-date">01 Dec 2021 | 
    <a class="content-tag" href="/tags/#navigation"> navigation </a>
  
    <a class="content-tag" href="/tags/#reinforcement-learning"> reinforcement learning </a>
  
    <a class="content-tag" href="/tags/#domain-generalization"> domain generalization </a>
  </span>

  <span id="iclr-post-authors" class="post-date">Anonymous</span>
  <h1 id="moving-beyond-navigation-with-active-neural-slam">Moving Beyond Navigation with Active Neural SLAM</h1>

<h2 id="introduction">Introduction</h2>

<p>The ability to effectively harness <code class="language-plaintext highlighter-rouge">autonomous control in real-world 3D environments</code> largely depends on learning realistic navigation techniques for embodied agents. Advances to classical robotics call for efficient methods to navigate, viz., explore unseen simulations while transferring the attained performance to the real-world domain. Any autonomous agent in the wild should have the ability to make decisions on the fly, having explored its surroundings and being cognizant of sudden changes, as in the real world.</p>

<div align="center">
<img src="/public/images/2021-12-01-active-neural-slam/ANS_blog-Scene.png" width="600" alt="Scene" />
</div>

<p>Apart from understanding the 3D scene, intelligent agents must explore new environments effectively that can prove pivotal to downstream tasks. Some of these include <strong>PointNav</strong> (PointGoal Navigation), wherein an agent has to reach a given target coordinate within a fixed time budget, and <strong>ObjectNav</strong> (ObjectGoal Navigation), which requires semantic knowledge of the scene, as it focuses on recognition of instances/classes in the environment. On a high-level, exploration for autonomous navigation consists of three sub-tasks [1]:</p>

<ol>
  <li><strong>Mapping</strong>, with long-range memory for long-horizon tasks</li>
  <li><strong>State Estimation</strong>, including position, orientation, velocity.</li>
  <li><strong>Path-Planning</strong>, devising a possible path to the goal location.</li>
</ol>

<p>The breakthroughs in classical robotics relied only on Mapping [2] and Path-Planning [3], further inspiring learning-based methods [1, 4] to learn exploration policies using deep neural networks. A recent work [4] learns exploration policies for navigation in an <code class="language-plaintext highlighter-rouge">end-to-end</code> setting, focusing primarily on the architecture, reward function design, and optimization procedure to overcome classical methods which heavily rely on geometry from sensor data, hence being susceptible to drift. They also extend beyond prior learning-based techniques by focusing on task-agnostic autonomous exploration with sensor-driven reward signals and learned maps instead of goal-oriented navigation, making this approach realizable in a real-world setup. Although the proposed learning policy tackles complex-scene navigation by dynamic updates to determine the agent’s immediate action, this end-to-end framework that employs <code class="language-plaintext highlighter-rouge">imitation learning</code> depends on having millions of frames of experience. As a result, it has <code class="language-plaintext highlighter-rouge">high sample complexity</code>, which explodes with the size of the scene (area, number of floors).</p>

<h2 id="motivation">Motivation</h2>

<p>The work from Chen et al. [4] proved to be highly beneficial for realistic exploration, motivated primarily by the following claims: The use of learning:</p>

<ul>
  <li>From RGB input streams to allow <strong>flexibility w.r.t input modalities</strong> that removed the necessity for geometry data from custom sensors.</li>
  <li>To ensure <strong>robustness to state estimation errors</strong> while determining agent pose.</li>
  <li>To use <strong>structural regularities</strong> in training scenes to adapt to unseen environments.</li>
</ul>

<p>in an end-to-end manner. <strong>Active Neural SLAM</strong> (ANS) [1] banks on the observation that these stages of learning an effective navigation pipeline occur at <code class="language-plaintext highlighter-rouge">different timesteps</code>, breaking this approach down in a <code class="language-plaintext highlighter-rouge">modular and hierarchical</code> manner.</p>

<p>Active Neural SLAM (ANS) [1] consists of three main modules: a learned <code class="language-plaintext highlighter-rouge">Neural SLAM module</code>, a <code class="language-plaintext highlighter-rouge">global policy</code>, and a <code class="language-plaintext highlighter-rouge">local policy</code>, which is guided through an <code class="language-plaintext highlighter-rouge">intermediate map representation</code>, and an <code class="language-plaintext highlighter-rouge">analytical path planner</code>. In this way, the learning-based techniques are utilized best while receiving aid from the analytical path planner to improve performance and sample efficiency.</p>

<ol>
  <li>
    <p><strong>Learned Neural SLAM Module</strong>: It consists of a <code class="language-plaintext highlighter-rouge">Mapper</code> and a <code class="language-plaintext highlighter-rouge">Pose Estimator</code>. The Mapper is trained with ground-truth supervision (<code class="language-plaintext highlighter-rouge">Cross-entropy Loss</code>) to predict a free-space egocentric (first-person point-of-view) spatial map of the explored 3D environment. It encodes the visual RGB stream and decodes the embedding to create the required map, determining the probabilities of obstacles in the scene and the area explored by the agent. The Pose Estimator determines the agent pose in a geocentric frame (earth-centered point-of-view) with supervision (<code class="language-plaintext highlighter-rouge">Mean Squared Error Loss</code>), using past sensor estimates and the egocentric map prediction.</p>
  </li>
  <li>
    <p><strong>Global Policy</strong>: Using the predicted agent pose and spatial map in the previous step, a policy is learned using <code class="language-plaintext highlighter-rouge">PPO</code> (Proximal Policy Optimisation, a reinforcement learning method to learn optimal policy) to obtain a long-term goal, with an ambitious intent of traversing larger geodesic distances to reach distant goals. The agent attempts to maximize the <code class="language-plaintext highlighter-rouge">coverage area</code> while exploring the 3D environment.</p>
  </li>
</ol>

<p>The determined long-term goal is then broken down to a short-term goal, using an analytical path planner, such as <code class="language-plaintext highlighter-rouge">Dijkstra’s Algorithm</code>. Along the shortest path to the goal, the short-term goal point is sampled at a distance of <code class="language-plaintext highlighter-rouge">0.25m</code> away from the agent’s current position.</p>

<ol>
  <li><strong>Local Policy</strong>: Trained to mimic via Imitation Learning, the local policy uses the input RGB data and binned relative headings (distance and angle) to the short-term goal to decide the most likely action to reach the short-term destination.</li>
</ol>

<p>Hence, the three-fold motivation of Chen et al. [4] is crafted in a modular, sample-efficient manner with both learning and classical methods. More specifically,</p>

<ul>
  <li>learning in the Neural SLAM module helps leverage any input modality,</li>
  <li>the global policy intelligently learns from structural regularities in the environment, and</li>
  <li>the local policy uses additional visual inputs to ensure robustness to state estimation errors.</li>
</ul>

<p>Previous works failed to focus on visual and physical realism of both the simulation and agent motion, using discrete 90-degree rotations, etc. [5]. To make motion even more realistic, ANS implements actuation and sensor noise models in the <code class="language-plaintext highlighter-rouge">Habitat simulator</code> after using a <strong>LoCoBot</strong>: a ROS research rover to collect navigational data and fit <code class="language-plaintext highlighter-rouge">Gaussian Mixture Models</code> to be implemented in the simulator for all experiments.</p>

<h2 id="impact">Impact</h2>

<p>While the results obtained may not be as substantial as what qualifies for “ground-breaking research,” the notion of breaking down the process of exploring complex 3D scenes into a modular hypothesis enables:</p>

<ol>
  <li>
    <p><strong>Doing away with RL’s <code class="language-plaintext highlighter-rouge">high sample complexity</code></strong>: ensures sample efficiency, achieving a speedup of <code class="language-plaintext highlighter-rouge">over 75x</code> against the best RL baseline, with just 1 million data samples required to produce competitive performance.</p>
  </li>
  <li>
    <p><strong>Domain Generalization</strong>: Active Neural SLAM (ANS) [1] is trained on the <code class="language-plaintext highlighter-rouge">Gibson</code> [6] domain and is transferred to the <code class="language-plaintext highlighter-rouge">Matterport3D (MP3D)</code> [7] domain. This method trained on the Gibson training set generalizes well in terms of <code class="language-plaintext highlighter-rouge">Coverage Area</code> (in m<sup>2</sup>) and <code class="language-plaintext highlighter-rouge">% Coverage</code> in the total scene area on both small and large scenes from the Gibson [6] validation set. Although performance drops on large MP3D [7] scenes (significantly larger), the method transfers well to smaller MP3D [7] scenes of comparative size, completely unseen to the model. The following observation holds for both training objectives of Exploration and PointGoal Navigation. Finally, the proposed method is also tested on a LoCoBot in the real world. It effectively explores the scene upon modifying camera intrinsics to match the Habitat [8] simulator, generalizing it to the real-world domain.</p>
  </li>
  <li>
    <p><strong>Task Generalization</strong>: The exploration models can be retrained or fine-tuned on the PointGoal navigation task by fixing the long-term goal output of the Global Policy, as the goal coordinates to be reached within the defined period. Experiments revealed that ANS transferred well to the task, outperforming the best RL and Imitation Learning baselines, even to more challenging goal locations (with larger geodesic distance) and the MP3D [7] domain. Since the learned Neural SLAM module and local policy are <code class="language-plaintext highlighter-rouge">invariant to the target task</code>, knowledge can be effortlessly distilled for low-level navigational control and obstacle-aware movement in new environments.</p>
  </li>
</ol>

<h2 id="future-extensions">Future Extensions</h2>

<p>Active Neural SLAM (ANS) [1] opens a door for a wide range of possibilities:</p>

<ul>
  <li><strong>Scene understanding using ANS</strong> [1]: Previous works such as <code class="language-plaintext highlighter-rouge">Semantic Curiosity</code> [9] and <code class="language-plaintext highlighter-rouge">Semantic Exploration</code> [10] focus on detecting surrounding object semantics and exploring unseen environments based on a target instance or class. However, the former [9], which pushes detection and segmentation on static datasets to an active learning framework, assumes labeled trajectories and does not demonstrate real-world domain transfer, proving pivotal to autonomous navigational control. Furthermore, there is scope for improving the Semantic Exploration [10] method that incurs errors for accurate localization of goal objects and discrepancies in segmentation predictions. Instead, incorporating <code class="language-plaintext highlighter-rouge">uncertainty in mapping</code> for the class-specific channels of the intermediate semantic map [11] to select the goal. An uncertainty-driven navigation policy exploiting <code class="language-plaintext highlighter-rouge">semantic hallucination</code> rather than target-specific objectives could remove bias from an information deficit in unseen environments.</li>
</ul>

<div align="center">
<img src="/public/images/2021-12-01-active-neural-slam/ANS_blog-Nav.png" width="750" alt="Navigation" />
</div>

<ul>
  <li><strong>Moving beyond just navigation using ANS</strong> [1]: Embodied agents in simulated 3D environments are endowed with the flexibility to learn regularities in their surroundings, to be transferred across domains and tasks. Using <code class="language-plaintext highlighter-rouge">ANS</code> [1] for navigation to a goal coordinate or Semantic Exploration [10] to navigate to a given class instance could be interleaved with the task of <code class="language-plaintext highlighter-rouge">motion prediction</code> (or) <code class="language-plaintext highlighter-rouge">synthesis</code> on the fly. Recent work on human motion synthesis [12] generates realistic motion between two points, using <code class="language-plaintext highlighter-rouge">scene</code> and <code class="language-plaintext highlighter-rouge">agent pose</code> information as inputs. The long-term or short-term goal in conjunction with the map and pose from the learned Neural SLAM module could be leveraged for this purpose. Navigation and motion synthesis for long-horizon tasks will pose heavy compute dependence with unbounded scenes in the wild. Current approaches use the <code class="language-plaintext highlighter-rouge">Conditional Variational Auto-Encoder (CVAE)</code> framework, generating motion of human agents in an auto-regressive setting using <code class="language-plaintext highlighter-rouge">Long short-term memory (LSTM)</code> with <code class="language-plaintext highlighter-rouge">Fully-connected layers</code> [12], and <code class="language-plaintext highlighter-rouge">Transformer-based VAE</code> [13] for action-conditioned motion. <code class="language-plaintext highlighter-rouge">Structured State Spaces</code> [14] can be studied to model longer sequences efficiently for long-range motion synthesis to model unbounded motion sequences with varying lengths up to <code class="language-plaintext highlighter-rouge">60x faster</code>, in linear time and memory with substantially lesser parameters. In theory, this can aid applications in Graphics, such as Virtual Reality (VR) and video games, to learn holistic agent-scene interaction. For example, a learned agent to navigate &gt;=2 floors in a complex environment to reach a specific object instance in a given room, perform a user-specified action such as <code class="language-plaintext highlighter-rouge">“sit on the sofa in the living room of the ground floor”</code>, while synthesizing motion dynamically). Additional agents could also be incorporated for realistic gaming and VR experiences, using methods such as in [15]. The figures below serve as a high-level overview of a plausible pipeline that could be built upon ANS [1], solely for illustrative purposes. We also create a 3D scene with additionally generated agents [15] for a visual representation.</li>
</ul>

<div align="center">
<img src="/public/images/2021-12-01-active-neural-slam/ANS_blog-combined_synth_scene.png" width="900" alt="Synthetic Scene." />
</div>

<div align="center">
<img src="/public/images/2021-12-01-active-neural-slam/ANS_blog-RL.png" width="1000" alt="RL" />
</div>

<ul>
  <li>ANS [1] is trained and tested on <strong>closed 3D scenes</strong> in Habitat. However, the <strong>generalization of this method using Habitat to open scenes</strong>, such as ones from the <code class="language-plaintext highlighter-rouge">PROX</code> [16] and selective scenes from <code class="language-plaintext highlighter-rouge">Replica</code> [17], remains in doubt. In such cases, the 3D scene could be manipulated, such as closing the meshes by constructing a ceiling, etc. Further study is required, concerning this methods’ dependence on scene characteristics by moving beyond <code class="language-plaintext highlighter-rouge">Gibson</code> [6] and <code class="language-plaintext highlighter-rouge">MP3D</code> [7] scenes. Sample 3D scenes from these datasets are illustrated below, highlighting the difference between closed and open scenes.</li>
</ul>

<div align="center">
<img src="/public/images/2021-12-01-active-neural-slam/ANS_blog-closed_scenes.png" width="1000" alt="Closed Scene" />
</div>

<div align="center">
<img src="/public/images/2021-12-01-active-neural-slam/ANS_blog-open_scenes.png" width="1000" alt="Open Scene" />
</div>

<h2 id="references">References</h2>

<p>[1] Chaplot, D.S., Gandhi, D., Gupta, S., Gupta, A. and Salakhutdinov, R., 2020. Learning To Explore Using Active Neural SLAM. In International Conference on Learning Representations (ICLR)</p>

<p>[2] Hartley, R.I., &amp; Zisserman, A. (2009). Multiple View Geometry. Encyclopedia of Biometrics.</p>

<p>[3] Faverjon, B., &amp; Tournassoud, P. (1996). Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces.</p>

<p>[4] Chen, T., Gupta, S. and Gupta, A., 2019. Learning Exploration Policies for Navigation. In International Conference on Learning Representations (ICLR)</p>

<p>[5] Zhu, Y., Mottaghi, R., Kolve, E., Lim, J.J., Gupta, A.K., Fei-Fei, L., &amp; Farhadi, A. (2017). Target-driven visual navigation in indoor scenes using deep reinforcement learning. 2017 IEEE International Conference on Robotics and Automation (ICRA), 3357-3364.</p>

<p>[6] F. Xia, A. R. Zamir, Z. He, A. Sax, J. Malik and S. Savarese, “Gibson Env: Real-World Perception for Embodied Agents,” 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 9068-9079, doi: 10.1109/CVPR.2018.00945.</p>

<p>[7] Chang, A.X., Dai, A., Funkhouser, T.A., Halber, M., Nießner, M., Savva, M., Song, S., Zeng, A., &amp; Zhang, Y. (2017). Matterport3D: Learning from RGB-D Data in Indoor Environments. 2017 International Conference on 3D Vision (3DV), 667-676.</p>

<p>[8] Savva, M., Kadian, A., Maksymets, O., Zhao, Y., Wijmans, E., Jain, B., Straub, J., Liu, J., Koltun, V., Malik, J., Parikh, D., &amp; Batra, D. (2019). Habitat: A Platform for Embodied AI Research. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 9338-9346.</p>

<p>[9] Chaplot, D.S., Jiang, H., Gupta, S., &amp; Gupta, A.K. (2020). Semantic Curiosity for Active Visual Learning. ECCV.</p>

<p>[10] Chaplot, D.S., Gandhi, D., Gupta, A. and Salakhutdinov, R. 2020. Object Goal Navigation using Goal-Oriented Semantic Exploration. In Neural Information Processing Systems (NeurIPS-20).</p>

<p>[11] Georgakis, G., Bucher, B., Schmeckpeper, K., Singh, S., &amp; Daniilidis, K. (2021). Learning to Map for Active Semantic Goal Navigation. ArXiv, abs/2106.15648.</p>

<p>[12] Wang, J., Xu, H., Xu, J., Liu, S., &amp; Wang, X. (2021). Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 9396-9406.</p>

<p>[13] Petrovich, M., Black, M.J., and Varol, G. (2021). Action-Conditioned 3{D} Human Motion Synthesis with Transformer VAE. 2021 International Conference on Computer Vision (ICCV)</p>

<p>[14] Gu, A., Goel, K., &amp; R’e, C. (2021). Efficiently Modeling Long Sequences with Structured State Spaces. ArXiv, abs/2111.00396.</p>

<p>[15] Zhang, Y., Hassan, M., Neumann, H., Black, M.J., &amp; Tang, S. (2020). Generating 3D People in Scenes Without People. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 6193-6203.</p>

<p>[16] Hassan, M., Choutas, V., Tzionas, D., &amp; Black, M.J. (2019). Resolving 3D Human Pose Ambiguities With 3D Scene Constraints. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 2282-2292.</p>

<p>[17] Straub, J., Whelan, T., Ma, L., Chen, Y., Wijmans, E., Green, S., Engel, J., Mur-Artal, R., Ren, C.Y., Verma, S., Clarkson, A., Yan, M., Budge, B., Yan, Y., Pan, X., Yon, J., Zou, Y., Leon, K., Carter, N., Briales, J., Gillingham, T., Mueggler, E., Pesqueira, L., Savva, M., Batra, D., Strasdat, H.M., Nardi, R.D., Goesele, M., Lovegrove, S., &amp; Newcombe, R.A. (2019). The Replica Dataset: A Digital Replica of Indoor Spaces. ArXiv, abs/1906.05797.</p>

</div>

<div id="bibtex-container" class="related">
  For attribution in academic contexts, please cite this work as
  <pre id="bibtex-academic-attribution">

  </pre>

  BibTeX citation
  <pre id="bibtex-box">

  </pre>
</div>
<script>
  let authorsSpan = document.getElementById("iclr-post-authors");
  let authorsText = authorsSpan.textContent;
  let lnameFnameInstitution = authorsText.split(";");
  let lfiList = lnameFnameInstitution.map(lfi => lfi.split(",").map(item => item.trim()));
  let bibtexLFI = lfiList.map(lfi => lfi[0] + ", " + lfi[1]).join(" and ")
  let academicLFI = lfiList.map(lfi => lfi[0]);
  {
    if(academicLFI.length > 2) academicLFI = academicLFI[0] + ", et al.";
    else if(academicLFI.length == 2) academicLFI = academicLFI[0] + " & " + academicLFI[1];
    else academicLFI = academicLFI[0];
  }

  let titleSpan = document.getElementById("iclr-post-title");
  let titleText = titleSpan.textContent.trim();
  let bibtexTitleShorthand = (lfiList[0][1]+
    "2022"+
    titleText.split(" ").slice(0, 3).join("")
  ).replace(" ", "").replace(/[\p{P}$+<=>^`|~]/gu, '').toLowerCase().trim();

  let bibtexTemplate = `
@inproceedings{${bibtexTitleShorthand}},
  author = {${bibtexLFI}},
  title = {${titleText}},
  booktitle = {ICLR Blog Track},
  year = {2022},
  note = {${window.location.href}},
  url  = {${window.location.href}}
}
  `.trim();
  document.getElementById("bibtex-box").innerText = bibtexTemplate;

  let academicTemplate = `
${academicLFI}, "${titleText}", ICLR Blog Track, 2022.
`.trim();
  document.getElementById("bibtex-academic-attribution").innerText = academicTemplate;

</script>


<div class="related">
  <h2>Related posts</h2>
  <ul class="related-posts">
    
      <li>
        <h3>
          <a href="/2021/09/01/sample-submission/">
            Sample Submission
            <small>01 Sep 2021 | 
    <a class="content-tag" href="/tags/#navigation"> navigation </a>
  
    <a class="content-tag" href="/tags/#reinforcement-learning"> reinforcement learning </a>
  
    <a class="content-tag" href="/tags/#domain-generalization"> domain generalization </a>
  </small>
          </a>
        </h3>
      </li>
    
      <li>
        <h3>
          <a href="/2020/04/02/example-content/">
            Example content (Basic Markdown)
            <small>02 Apr 2020 | 
    <a class="content-tag" href="/tags/#navigation"> navigation </a>
  
    <a class="content-tag" href="/tags/#reinforcement-learning"> reinforcement learning </a>
  
    <a class="content-tag" href="/tags/#domain-generalization"> domain generalization </a>
  </small>
          </a>
        </h3>
      </li>
    
  </ul>
</div>


<script src="https://utteranc.es/client.js"
        repo="iclr-blog-track/iclr-blog-track.github.io"
        issue-term="pathname"
        label="utterance"
        theme="boxy-light"
        crossorigin="anonymous"
        >
</script>


      </div>
    </div>

    <label for="sidebar-checkbox" class="sidebar-toggle"></label>

    <script src='/public/js/script.js'></script>
  </body>
</html>
