In-Context Adaptation to Concept Drift for Learned Database Operations

Jiaqi Zhu; Shaofeng Cai; Yanyan Shen; Gang Chen; Fang Deng; Beng Chin Ooi

In-Context Adaptation to Concept Drift for Learned Database Operations

Jiaqi Zhu, Shaofeng Cai, Yanyan Shen, Gang Chen, Fang Deng, Beng Chin Ooi

Published: 01 May 2025, Last Modified: 23 Jul 2025ICML 2025 posterEveryoneRevisionsBibTeXCC BY 4.0

TL;DR: An efficient and effective online adaptation framework for learned database operations

Abstract:

Machine learning has demonstrated transformative potential for database operations, such as query optimization and in-database data analytics. However, dynamic database environments, characterized by frequent updates and evolving data distributions, introduce concept drift, which leads to performance degradation for learned models and limits their practical applicability. Addressing this challenge requires efficient frameworks capable of adapting to shifting concepts while minimizing the overhead of retraining or fine-tuning.

In this paper, we propose FLAIR, an online adaptation framework that introduces a new paradigm called \textit{in-context adaptation} for learned database operations. FLAIR leverages the inherent property of data systems, i.e., immediate availability of execution results for predictions, to enable dynamic context construction. By formalizing adaptation as $f:(\mathbf{x} | \mathcal{C}_t) \to \mathbf{y}$, with $\mathcal{C}_t$ representing a dynamic context memory, FLAIR delivers predictions aligned with the current concept, eliminating the need for runtime parameter optimization. To achieve this, FLAIR integrates two key modules: a Task Featurization Module for encoding task-specific features into standardized representations, and a Dynamic Decision Engine, pre-trained via Bayesian meta-training, to adapt seamlessly using contextual information at runtime. Extensive experiments across key database tasks demonstrate that FLAIR outperforms state-of-the-art baselines, achieving up to $5.2\times$ faster adaptation and reducing error by 22.5% for cardinality estimation.

Lay Summary:

Modern databases often rely on machine learning to speed up tasks like finding and analyzing data. However, these systems struggle to keep up when the data changes over time, which can lead to slower performance and incorrect results. We developed a new framework called FLAIR that helps models adapt more quickly to changes in the data without needing to be retrained every time. FLAIR uses the results of recent database tasks to create a “context” that guides future predictions, much like how humans use recent experiences to make decisions. FLAIR helps databases stay accurate and efficient even as their data evolves, making them more reliable in real-world situations. In our tests, it adapted up to five times faster than current methods and significantly improved accuracy, offering a promising step toward smarter, self-adjusting data systems.

Primary Area: Applications->Everything Else

Keywords: Concept Drift, Learned Database Operation, Online Adaptation

Submission Number: 3541

Loading