Shapelets-based Data Augmentation for Time Series ClassificationDownload PDFOpen Website

Published: 2021, Last Modified: 12 Feb 2024ICMLA 2021Readers: Everyone
Abstract: Data augmentation is an important data mining task that has been highly adopted to resolve class imbalance problems and provide more input to data-hungry models. For the case of time series data, the data augmentation method needs to take into consideration the dependence of the variables. In this paper, we propose a new model that preserves important relations between variables while performing time series data augmentation. In particular, we combine shapelets transform and Synthetic Minority Oversampling Technique (SMOTE) models to achieve the aforementioned goal. By using shapelets transform, the most prominent shapelets are extracted from the training set and used during the oversampling process. To make the most use of important shapelets, our proposed method preserves the extracted shapelets as the key part in the synthetic data sample. Then for the other parts of each synthetic data sample, we use SMOTE to generate the remaining data points. Compared with pure SMOTE, our method makes full use of important shapelets to maintain the important correlations between interdependent variables, which also can provide more interpretive information.
0 Replies

Loading