Predicting Hot-Spots in Distributed Cloud Databases Using Association Rule Mining

Published: 01 Jan 2014, Last Modified: 13 Nov 2024UCC 2014EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Data partitioning is a popular technique to horizontally or vertically split table attributes of a Cloud database cluster to evenly distribute increasing workloads. However, hot-spots can be created due to inappropriate partitioning scheme and static partition management without considering the dynamic workload characteristics. In this paper, an automatic database partition management scheme - APM - is proposed which periodically analyses workload logs to predict the formation of any potential hot-spot using association rule mining. A detailed illustration of the proposed scheme is presented with examples along with a cost model following by experimental observations from running a HBase cluster with YCSB workloads in AWS.
Loading