Selection and replacement algorithms for memory performance improvement in Spark

Mingxing Duan, Kenli Li, Zhuo Tang, Guoqing Xiao, Keqin Li

Published: 2016, Last Modified: 12 May 2023Concurr. Comput. Pract. Exp. 2016Readers: Everyone

Abstract: As a parallel computation framework, Spark can cache repeatedly resilient distribution datasets (RDDs) partitions in different nodes to speed up the process of computation. However, Spark does not ha...

0 Replies