Unsupervised speech representation learning for behavior modeling using triplet enhanced contextualized networks

Published: 2021, Last Modified: 17 Jul 2025Comput. Speech Lang. 2021EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•A manifold representation is derived from human speech to capture behavioral information.•The feasibility of unsupervised cross-domain behavioral modeling is verified.•Context information and metric learning are complementary in behavior modeling.•Unsupervised training with large-scale data helps domain behavior modeling.
Loading