LifeGraph 4 - Lifelog Retrieval using Multimodal Knowledge Graphs and Vision-Language Models

Published: 01 Jan 2024, Last Modified: 13 Jun 2025LSC@ICMR 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: In the scope of the 7th Lifelog Search Challenge (LSC'24), we present the 4th iteration of LifeGraph, a multimodal knowledge-graph approach with data augmentations using Vision-Language Models (VLM). We extend the LifeGraph model presented in former LSC challenges by event-based clustering using temporal and spatial relations as well as information extracted from descriptions of Lifelog image captions produced by VLMs.
Loading