Multi-agentic Framework for Developing and Evaluating Biometric-based Multimodal Synthetic Personas: A Case Study on Pregnancy and Postpartum Users

Published: 01 Mar 2026, Last Modified: 01 Mar 2026ICLR 2026 TSALM Workshop PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Synthetic Data Generation, TimeGAN, Multimodal agentic framework, Multimodal Personas, LLM-as-a-Judge, Biometric Time-Series
TL;DR: This framework generates and uses synthetic biometric time series data and multi-modal personas to test LLM health agents for conversational accuracy and factual correctness.
Abstract: Despite the rise of LLM-based health agents, robust and privacy-preserving evaluation frameworks remain scarce. We present a framework using a Transformer-based TimeGAN to generate synthetic biometric time-series from 3,497 real participant data. These time series are integrated with LLM-generated narrative vignettes to create biometrically-aligned multimodal personas for realistic user-agent simulations. Employing an LLM-as-a-Judge methodology, we evaluate agent performance across Passive and Proactive modes. Our results demonstrate that this framework effectively identifies critical gaps in current LLM capabilities regarding factual correctness, data usage, and clinical plausibility. While focused on the postpartum period, our methodology provides a scalable tool for optimizing and evaluating personal health agents across diverse biometric and clinical applications.
Track: Research Track (max 4 pages)
Submission Number: 56
Loading