Abstract: Highlights•A document clustering framework that leverages contextualized vectors is proposed.•Informative representations for documents are extracted from pre-trained models.•A partial optimization and centroid update is proposed in the clustering module.•The proposed method outperforms the baselines in several datasets for clustering.•The effect of clustering method and embeddings are explored in various experiments.
Loading