Valuing time in silicon: Can large language models replicate human value of travel time

Published: 17 Dec 2025, Last Modified: 07 May 2026Travel Behavior and SocietyEveryonearXiv.org perpetual, non-exclusive license
Abstract: As a key advancement in artificial intelligence, large language models (LLMs) are set to transform transportation systems. While LLMs offer the potential to simulate human travelers in future mixed-autonomy transportation systems, their behavioral fidelity in complex scenarios remains largely unconfirmed by existing research. This study addresses this gap by conducting a comprehensive analysis of the value of travel time (VOT) of three popular LLMs. We employ a full factorial experimental design to systematically examine LLMs’ sensitivities to various transportation contexts, including the choice setting, travel purpose, and socio-demographic factors. Our results reveal a high degree of behavioral similarity between LLMs and humans. Some LLMs exhibit an aggregate VOT similar to that of humans, and all tested models demonstrate human-like sensitivity to travel purpose, income, and the time-cost trade-off ratios of the alternatives. Furthermore, the behavioral patterns of LLMs are highly consistent across varied contexts. However, while the behavior of every single model is highly robust, we also find some heterogeneity across models regarding the magnitude and direction of sensitivity to travel contexts and income elasticity. Overall, this study provides a foundational benchmark for the future development of LLMs as proxies for human travelers, demonstrating their robust decision-making capabilities while cautioning that misaligned magnitudes of economic trade-offs between humans and LLMs necessitate rigorous validation and additional conditioning of LLMs before their application.
Loading