A multi-encoder neural conversation modelOpen Website

2019 (modified: 14 Nov 2024)Neurocomputing 2019Readers: Everyone
Abstract: With the development of deep neural networks, Sequence-to-sequence (Seq2Seq) models become a popular technique of conversation models. Current Seq2Seq models with single encoder-decoder structures tend to generate responses which contain high frequency patterns on datasets. However, these patterns are always generic and meaningless. Generic and meaningless responses will lead the conversation between computer and human to an end quickly. According to our observations, human conversations are always topic related. If the conversation data can be divided into different clusters according to their topics, high frequency patterns will be topic related rather than generic. We consider that a model trained in different clusters can generate more topic related and meaningful responses. Inspired by this idea, we propose a Multi-Encoder Neural Conversation (MENC) model. MENC can make use of topic information by its multi-encoder structure. To the best of our knowledge, it is the first work which applies multi-encoder structures into conversation models. We conduct our experiments on two daily conversation datasets. Our experiments show that MENC gets a better performance than other mainstream models on both subject and object evaluation metrics.
0 Replies

Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview