Conversational Voice Interfaces: Translating Research Into Actionable Design

Published: 01 Jan 2024, Last Modified: 27 Sept 2024CHI Extended Abstracts 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: HCI research has for long been dedicated to better and more naturally facilitating information transfer between humans and machines. Unfortunately, humans’ most natural form of communication, speech, is also one of the most difficult modalities to be understood by machines – despite, and perhaps, because it is the highest-bandwidth communication channel we possess. As significant research efforts in engineering have been spent on improving machines’ ability to understand speech, research is only beginning to make the same improvements in understanding how to appropriately design these speech interfaces to be user-friendly and adoptable. Issues such as variations in error rates when processing speech, and difficulties in learnability and explainability (to name a few), are often in contrast with claims of success from industry. Along with this, designers themselves are making the transition to designing for speech and voice-enabled interfaces. Recent research has demonstrated the struggle for designers to translate their current experiences in graphical user interface design into speech interface design. Research has also noted the lack of any user-centered design principles or consideration for usability or usefulness in the same ways as graphical user interfaces have benefited from heuristic design guidelines.The goal of this course is to inform the CHI community of the current state of speech and natural language research, to dispel some of the myths surrounding speech-based interaction, as well as to inform participants about currently existing design tools, methods and resources for speech interfaces (and provide hands-on experience with working with them). Through this, we hope that HCI researchers and practitioners will learn how to combine recent advances in speech processing with user-centred principles in designing more usable and useful speech-based interactive systems.
Loading