
<!doctype html>
<html lang="en">
    <head>
        <!-- Required meta tags -->
        <meta charset="utf-8">
        <meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">

        <!-- Bootstrap CSS -->
        <link rel="stylesheet" href="https://stackpath.bootstrapcdn.com/bootstrap/4.3.1/css/bootstrap.min.css" integrity="sha384-ggOyR0iXCbMQv3Xipma34MD+dH/1fQ784/j6cY/iJTQUOhcWr7x9JvoRxT2MZw1T" crossorigin="anonymous">

        <!-- Other -->
        <script src="https://cdnjs.cloudflare.com/ajax/libs/handlebars.js/4.4.2/handlebars.min.js"></script>

        <title>Social Driving</title>
    </head>

    <body>

        <!-- header -->
        <div class='jumbotron'>
            <div class="container">
                <h1 class="text-center">EMERGENT ROAD RULES IN MULTI-AGENT DRIVING ENVIRONMENTS</h1>
                <p class="text-center">
                    ICLR 2021 Submission
                </p>
            </div>
        </div>
    
        <div class="container">
            <h2 class="text-center">Abstract</h2>
            <p class="text-justify">
                In order for autonomous vehicles to share the road safely with human drivers, autonomous vehicles must abide by certain "road rules" that human drivers have agreed all road users must follow. "Road rules" include rules that drivers are required to follow by law -- such as the requirement that vehicles stop at red lights -- as well as more subtle social rules -- such as the implicit designation of "fast lanes" on the highway. In this paper, we provide empirical evidence that suggests that -- instead of hard-coding these road rules into self-driving algorithms -- a scalable alternative may be to design multi-agent environments such that agents within the environments ``discover for themselves'' that these road rules are mutually beneficial to follow. We analyze what components of our chosen multi-agent environment cause the emergence of such behavior and find that two crucial factors are noisy perception and the spatial density of agents. We provide qualitative and quantitative evidence of the emergence of seven social driving behaviors, ranging from stopping at a traffic signal to following lanes. Our results add empirical support for the social road rules that countries around the world have agreed on for safe driving.
            </p>

            <h2 class="text-center">Emergent Social Driving Rules</h2>
            <span class="border border-white">
                <h4 class="text-left">1. Stopping at a Traffic Signal</h4>
                <br>

                <div class="row">
                    <div class="col-md-12 text-justify">
                        In a 4-way intersection, agents learn to obey traffic signals to safely navigate to the opposite road in minimum time.
    
                        Note that the agents merely observe a ternary value representing the traffic light’s state, not color. To make the visualizations, we visually inspect rollouts for each converged policy to find a permutation of the ternary states that align with human red/yellow/green traffic light conventions
                    </div>
                </div>
                
                <br><br>

                <div class="row">
                    <div class="col-md-3">
                        <h6 class="text-center">Lidar Noise = 0%</h6>
                    </div>
                    <div class="col-md-3">
                        <h6 class="text-center">Lidar Noise = 25%</h6>
                    </div>
                    <div class="col-md-3">
                        <h6 class="text-center">Lidar Noise = 50%</h6>
                    </div>
                    <div class="col-md-3">
                        <h6 class="text-center">Lidar Noise = 75%</h6>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts000_1.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts025_1.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts050_1.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts075_1.gif"></img>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts000_2.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts025_2.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts050_2.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts075_2.gif"></img>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts000_3.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts025_3.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts050_3.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts075_3.gif"></img>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts000_4.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts025_4.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts050_4.gif"></img>
                    </div>
                    <div class="col-md-3">
                        <img class="img-fluid" src="videos/ts075_4.gif"></img>
                    </div>
                </div>
                
                <br>
                <h5 class="text-left">Transfer from Synthetic Map to a Real World Map</h5>
                <br>

                In this experiment we show that policies trained on the synthetic intersection above transfer to real-world intersections found in the nuScenes dataset.
                <br><br>
                <div class="row">
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r1_gen.gif"></img>
                    </div>
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r2_gen.gif"></img>
                    </div>
                </div>
            </span>

            <span class="border border-white">
                <h4 class="text-left">2. Emergence of Lanes</h4>
                <br>

                When the agents are trained in an environment contraining 4 agents they follow a consistent lane till the time they cross the intersection.

                <br><br>

                <div class="row">
                    <div class="col-md-4">
                        <h6 class="text-center">Lidar Noise = 25%</h6>
                    </div>
                    <div class="col-md-4">
                        <h6 class="text-center">Lidar Noise = 50%</h6>
                    </div>
                    <div class="col-md-4">
                        <h6 class="text-center">Lidar Noise = 75%</h6>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-4">
                        <img class="img-fluid" src="videos/le4_025_1.gif"></img>
                    </div>
                    <div class="col-md-4">
                        <img class="img-fluid" src="videos/le4_050_1.gif"></img>
                    </div>
                    <div class="col-md-4">
                        <img class="img-fluid" src="videos/le4_075_1.gif"></img>
                    </div>
                </div>
                <br><br>
                <div class="row">
                    <div class="col-md-6">
                        <h6 class="text-center">Lidar Noise = 0%</h6>
                    </div>
                    <div class="col-md-6">
                        <h6 class="text-center">Lidar Noise = 100%</h6>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/le4_2.gif"></img>
                    </div>
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/le4_100_1.gif"></img>
                    </div>
                </div>

                <br>
                When the number of agents during training are increased, the agents tend to follow the lanes consistently till they reach their destination. Additionally in case of 8 agents, we see the formation of multi-lane tracks. Qualitatively we can see that agents starting from the left side of the road tend to take the lane closer to the center and the ones starting on the right side take the extreme right lane. This additionally allows a smoother traffic flow.
                <br>
                <br>
                <div class="row">
                    <div class="col-md-4">
                        <h6 class="text-center">Number of Training Agents = 4</h6>
                    </div>
                    <div class="col-md-4">
                        <h6 class="text-center">Number of Training Agents = 8</h6>
                    </div>
                    <div class="col-md-4">
                        <h6 class="text-center">Number of Training Agents = 8</h6>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-4">
                        <img class="img-fluid" src="videos/le4_2.gif"></img>
                    </div>
                    <div class="col-md-4">
                        <img class="img-fluid" src="videos/le8_1.gif"></img>
                    </div>
                    <div class="col-md-4">
                        <img class="img-fluid" src="videos/le8_2.gif"></img>
                    </div>
                </div>

                <br><br>

                Spatial Locations of the Agents when trained on 8 agent environment

                <div class="col-md-6 offset-md-3 justify-content-center">
                    <img class="img-fluid" src="videos/8agents_lanepos.gif"></img>
                </div>
            </span>

            <span class="border border-white">
                <h4 class="text-left">3. Fast Lanes on a Highway</h4>

                <br>
                We observe that depending on how fast the agents are they choose to travel along the left hand side or right hand side of the road. This is similar to fast lanes which are present on the highways. (Darker shades denote faster cars)

                <br>
                <br>

                <div class="col-md-12">
                    <img class="img-fluid" src="videos/fastlanes.gif"></img>
                </div>

                <div class="col-md-12">
                    <img class="img-fluid" src="videos/hway1.gif"></img>
                </div>

                <div class="col-md-12">
                    <img class="img-fluid" src="videos/hway2.gif"></img>
                </div>
            </span>

            <span class="border border-white">
                <h4 class="text-left">4. Stopping at a Crosswalk</h4>

                <br>
                The agents detect the pedestrians (small green boxes) walking along the crosswalk, and slow down once they approach the crosswalk.
                <br>

                <div class="row">
                    <div class="col-md-12">
                        <img class="img-fluid" src="videos/ped1.gif"></img>
                    </div>
                </div>
                <div class="row">
                    <div class="col-md-12">
                        <img class="img-fluid" src="videos/ped2.gif"></img>
                    </div>
                </div>
                <div class="row">
                    <div class="col-md-12">
                        <img class="img-fluid" src="videos/ped3.gif"></img>
                    </div>
                </div>
            </span>

            <span class="border border-white">
                <h4 class="text-left">5. Communication</h4>

                <br>
                We denote the signals sent by the agents with the colors of the agents. In the left video where there is no perception noise, the agents' signals are not correlated with their actions/heading. In the right hand side video, we do see that agents that turn right, tend to be colored black while the ones turning right are colored red.
                <br><br>
                <div class="row">
                    <div class="col-md-6">
                        <h6 class="text-center">Lidar Noise = 0%</h6>
                    </div>
                    <div class="col-md-6">
                        <h6 class="text-center">Lidar Noise = 100%</h6>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-6 embed-responsive embed-responsive-1by1">
                        <video class="embed-responsive-item" src="./videos/nuscenes_comm_1.mp4" allowfullscreen controls></video>
                    </div>
                    <div class="col-md-6 embed-responsive embed-responsive-1by1">
                        <video class="embed-responsive-item" src="./videos/nuscenes_comm_2.mp4" allowfullscreen controls></video>
                    </div>
                </div>
            </span>

            <span class="border border-white">
                <h4 class="text-left">6. Rollouts on nuScenes</h4>
                <br>
                In these experiments, we attempt to show that agents learn to maintain a minimum distance between themselves as a function of their relative velocity. Additionally we observe the emergence of right of way where the agent which arrives first at the intersection gets to leave it first.

                <div class="row">
                    <div class="col-md-4">
                        <img class="img-fluid" src="videos/r1.gif"></img>
                    </div>
                    <div class="col-md-4">
                        <img class="img-fluid" src="videos/r2.gif"></img>
                    </div>
                    <div class="col-md-4">
                        <img class="img-fluid" src="videos/r4.gif"></img>
                    </div>
                </div>

                <br>
                Rollouts from more nuScenes Intersections

                <div class="row">
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r5.gif"></img>
                    </div>
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r7.gif"></img>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r8.gif"></img>
                    </div>
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r9.gif"></img>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r10.gif"></img>
                    </div>
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r11.gif"></img>
                    </div>
                </div>

                <div class="row">
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r1_gen.gif"></img>
                    </div>
                    <div class="col-md-6">
                        <img class="img-fluid" src="videos/r2_gen.gif"></img>
                    </div>
                </div>
            </span>

            <br><br>

            <h2 class="text-center">Episode Return over Time</h2>

            <br><br>

            <h4 class="text-left">Obeying Traffic Signal</h4>

            <br>

            Episode Return over time for the Toy Intersection with 4 agents.

            <br><br>

            <div class="col-md-6">
                <img class="img-fluid" src="videos/episode-return.png"></img>
            </div>

            <br><br>
            <br><br>
        </div>

        <!-- Optional JavaScript -->
        <!-- jQuery first, then Popper.js, then Bootstrap JS -->
        <script src="https://code.jquery.com/jquery-3.3.1.slim.min.js" integrity="sha384-q8i/X+965DzO0rT7abK41JStQIAqVgRVzpbzo5smXKp4YfRvH+8abtTE1Pi6jizo" crossorigin="anonymous"></script>
        <script src="https://cdnjs.cloudflare.com/ajax/libs/popper.js/1.14.7/umd/popper.min.js" integrity="sha384-UO2eT0CpHqdSJQ6hJty5KVphtPhzWj9WO1clHTMGa3JDZwrnQq4sF86dIHNDz0W1" crossorigin="anonymous"></script>
        <script src="https://stackpath.bootstrapcdn.com/bootstrap/4.3.1/js/bootstrap.min.js" integrity="sha384-JjSmVgyd0p3pXB1rRibZUAYoIIy6OrQ6VrjIEaFf/nJGzIxFDsf4x0xIM+B07jRM" crossorigin="anonymous"></script>
    </body>
</html>