Demonstrating Multi-modal Human Instruction Comprehension with AR Smart GlassDownload PDFOpen Website

Published: 01 Jan 2023, Last Modified: 15 May 2023COMSNETS 2023Readers: Everyone
Abstract: We present a multi-modal human instruction comprehension prototype for object acquisition tasks that involve verbal, visual and pointing gesture cues. Our prototype includes an AR smart-glass for issuing the instructions and a Jetson TX2 pervasive device for executing comprehension algorithms. With this setup, we enable on-device, computationally efficient object acquisition task comprehension with an average latency in the range of 150-330msec.
0 Replies

Loading