Abstract: Highlights•We generalize a class of big data analytics workload (Re-Org) on ordered datasets.•We propose a novel distributed mechanism for efficiently executing Re-Org tasks.•The proposed mechanism is implemented in a distributed framework by extending Hadoop.•A model is presented to formally study the proposed framework.•Experiments show that our framework is 6.3x faster than vanilla Hadoop.
Loading