Keywords: irregular time series, multimodal time series, multivariate time series, time series forecasting
TL;DR: A dataset and benchmark for forecasting real-world time series with cause-driven irregularities and multimodal observations.
Abstract: Time series data in real-world applications such as healthcare, climate modeling, and finance are often irregular, multimodal, and messy, with varying sampling rates, asynchronous modalities, and pervasive missingness.
However, existing benchmarks typically assume clean, regularly sampled, unimodal data, creating a significant gap between research and real-world deployment.
We introduce Time-IMM, a dataset specifically designed to capture cause-driven irregularity in multimodal multivariate time series.
Time-IMM represents nine distinct types of time series irregularity, categorized into trigger-based, constraint-based, and artifact-based mechanisms.
Complementing the dataset, we introduce IMM-TSF, a benchmark library for forecasting on irregular multimodal time series, enabling asynchronous integration and realistic evaluation.
IMM-TSF includes specialized fusion modules, including a timestamp-to-text fusion module and a multimodality fusion module, which support both recency-aware averaging and attention-based integration strategies.
Empirical results demonstrate that explicitly modeling multimodality on irregular time series data leads to substantial gains in forecasting performance.
Time-IMM and IMM-TSF provide a foundation for advancing time series analysis under real-world conditions.
The dataset is publicly available at \url{https://github.com/blacksnail789521/Time-IMM}, and the benchmark library can be accessed at \url{https://github.com/blacksnail789521/IMM-TSF}.
Croissant File: json
Dataset URL: https://www.kaggle.com/datasets/blacksnail789521/time-imm/
Code URL: https://github.com/blacksnail789521/IMM-TSF
Primary Area: Evaluation (e.g., data collection methodology, data processing methodology, data analysis methodology, meta studies on data sources, extracting signals from data, replicability of data collection and data analysis and validity of metrics, validity of data collection experiments, human-in-the-loop for data collection, human-in-the-loop for data evaluation)
Submission Number: 1020
Loading