Keywords: natural language processing, graph neural network, graph-to-text, geographical navigation
Abstract: Car-focused navigation services are based on turns and distances of named streets, whereas navigation instructions naturally used by humans are centered around physical objects called landmarks. We present a neural model that takes OpenStreetMap representations as input and learns to generate navigation instructions that contain visible and salient landmarks from human natural language instructions. Routes on the map are encoded in a location- and rotation-invariant graph representation that is decoded into natural language instructions. Our work is based on a novel dataset of 7,672 crowd-sourced instances that have been verified by human navigation in Street View. Our evaluation shows that the navigation instructions generated by our system have similar properties as human-generated instructions, and lead to successful human navigation in Street View.
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics
One-sentence Summary: We present a neural model that takes OpenStreetMap representations as input and learns to generate landmark navigation instructions from human natural language demonstrations.
Supplementary Material: zip
Reviewed Version (pdf): https://openreview.net/references/pdf?id=QIFF1iyKJw
12 Replies
Loading