Performance study of HPC applications on an Arm-based cluster using a generic efficiency modelDownload PDFOpen Website

Published: 01 Jan 2020, Last Modified: 15 May 2023PDP 2020Readers: Everyone
Abstract: HPC systems and parallel applications are increasing their complexity. Therefore the possibility of easily study and project at large scale the performance of scientific applications is of paramount importance. In this paper we describe a performance analysis method and we apply it to four complex HPC applications. We perform our study on a pre-production HPC system powered by the latest Arm-based CPUs for HPC, the Marvell ThunderX2. For each application we spot inefficiencies and factors that limit their scalability. The results show that in several cases the bottlenecks do not come from the hardware but from the way applications are programmed or the way the system software is configured.
0 Replies

Loading