Fast Inference and Learning for Modeling Documents with a Deep Boltzmann Machine

Nitish Srivastava; Ruslan Salakhutdinov; Geoffrey Hinton

Fast Inference and Learning for Modeling Documents with a Deep Boltzmann Machine

Nitish Srivastava, Ruslan Salakhutdinov, Geoffrey Hinton

18 Apr 2024 (modified: 26 Apr 2013)ICML 2013 Inferning submissionReaders: Everyone

Decision: conferencePoster

Abstract: We introduce a type of Deep Boltzmann Machine (DBM) that is suitable for extracting distributed semantic representations from a large unstructured collection of documents. We propose an approximate inference method that interacts with learning in a way that makes it possible to train the DBM more efficiently than previously proposed methods. Even though the model has two hidden layers, it can be trained just as efficiently as a standard Restricted Boltzmann Machine. Our experiments show that the model assigns better log probability to unseen data than the Replicated Softmax model. Features extracted from our model outperform LDA, Replicated Softmax, and DocNADE models on document retrieval and document classification tasks.

2 Replies

Loading