LifeGraph 4 - Lifelog Retrieval using Multimodal Knowledge Graphs and Vision-Language Models

Published: 01 Jan 2024, Last Modified: 15 Jul 2024LSC@ICMR 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: In the scope of the 7th Lifelog Search Challenge (LSC'24), we present the 4th iteration of LifeGraph, a multimodal knowledge-graph approach with data augmentations using Vision-Language Models (VLM). We extend the LifeGraph model presented in former LSC challenges by event-based clustering using temporal and spatial relations as well as information extracted from descriptions of Lifelog image captions produced by VLMs.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview