Community Detection: Exact Inference with Latent Variables in an Arbitrary Domain

TMLR Paper2341 Authors

06 Mar 2024 (modified: 02 Apr 2024)Under review for TMLREveryoneRevisionsBibTeX
Abstract: We analyze the necessary and sufficient conditions for exact inference of a latent model \added{in the context of community detection}. In latent models, each entity is associated with a latent variable following some probability distribution. The challenging question we try to solve is: can we perform exact inference without observing the latent variables, even without knowing what the domain of the latent variables is? We show that exact inference can be achieved using a semidefinite programming (SDP) approach without knowing either the latent variables or their domain. Our analysis predicts the experimental correctness of SDP with high accuracy, showing the suitability of our focus on the Karush-Kuhn-Tucker conditions and the spectrum of a properly defined matrix. Running on a laptop equivalent, our method can achieve exact inference in models with over 10000 entities efficiently. As a byproduct of our analysis, we also provide concentration inequalities with dependence on latent variables, both for bounded moment generating functions as well as for the spectra of matrices. To the best of our knowledge, these results are novel and could be useful for many other problems.
Submission Length: Long submission (more than 12 pages of main content)
Previous TMLR Submission Url: https://openreview.net/forum?id=1R7spWLnpR
Changes Since Last Submission: We made all the changes in the manuscript, from relevant comments brought up in the previous review. (Removed text is in struckthrough red, added text is in blue.) Comments that were not relevant were addressed in the previous review, but not changed in the manuscript.
Assigned Action Editor: ~Bryon_Aragam1
Submission Number: 2341
Loading