Pull-back Geometry of Persistent Homology Encodings

Published: 04 Mar 2024, Last Modified: 04 Mar 2024Accepted by TMLREveryoneRevisionsBibTeX
Abstract: Persistent homology (PH) is a method for generating topology-inspired representations of data. Empirical studies that investigate the properties of PH, such as its sensitivity to perturbations or ability to detect a feature of interest, commonly rely on training and testing an additional model on the basis of the PH representation. To gain more intrinsic insights about PH, independently of the choice of such a model, we propose a novel methodology based on the pull-back geometry that a PH encoding induces on the data manifold. The spectrum and eigenvectors of the induced metric help to identify the most and least significant information captured by PH. Furthermore, the pull-back norm of tangent vectors provides insights about the sensitivity of PH to a given perturbation, or its potential to detect a given feature of interest, and in turn its ability to solve a given classification or regression problem. Experimentally, the insights gained through our methodology align well with the existing knowledge about PH. Moreover, we show that the pull-back norm correlates with the performance on downstream tasks, and can therefore guide the choice of a suitable PH encoding.
Submission Length: Long submission (more than 12 pages of main content)
Video: https://youtu.be/q4DctLOZZTs?si=DzVqhRqx3C8zycnW
Code: https://github.com/shuangliang15/pullback-geometry-persistent-homology
Assigned Action Editor: ~Bamdev_Mishra1
License: Creative Commons Attribution 4.0 International (CC BY 4.0)
Submission Number: 1668