Keywords: Distance, Distance function, Tree-structured data, Heterogenous Graphs, JSONs, Multiple Instance Learning
Abstract: Many machine learning algorithms solving various problems are available for
metric spaces. While there are plenty of distances for vector spaces, much
less exists for structured data (rooted heterogeneous trees) stored in popular
formats like JSON, XML, ProtoBuffer, MessagePack, etc. This paper
introduces the Hierarchically-structured Tree Distance (HTD) designed
especially for these data. The HTD distance is modular with differentiable
parameters weighting the importance of different sub-spaces. This allows
the distance to be tailored to a given dataset and task, such as classification,
clustering, and anomaly detection. The extensive experimental comparison
shows that distance-based algorithms with the proposed HTD distance
are competitive to state-of-the-art methods based on neural networks with
orders of magnitude more parameters. Furthermore, we show that HTD is
more suited to analyze heterogeneous Graph Neural Networks than Tree
Mover’s Distance.
Primary Area: learning on graphs and other geometries & topologies
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.
Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2025/AuthorGuide.
Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.
No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.
Submission Number: 7973
Loading