Mer\underline{K}ury: Adaptive Resource Allocation to Enhance the \underline{K}ubernetes Performance for Large-Scale Clusters

Jiayin Luo; Xinkui Zhao; 马宇鑫; Shengye Pang; Jianwei Yin

Mer\underline{K}ury: Adaptive Resource Allocation to Enhance the \underline{K}ubernetes Performance for Large-Scale Clusters

Jiayin Luo, Xinkui Zhao, 马宇鑫, Shengye Pang, Jianwei Yin

Published: 29 Jan 2025, Last Modified: 29 Jan 2025WWW 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0

Track: Systems and infrastructure for Web, mobile, and WoT

Keywords: resource allocation, Kubernetes, large-scale

TL;DR: In this paper, we introduce MerKury, a lightweight framework to enhance the Kubernetes performance for large-scale clusters.

Abstract: As a prevalent paradigm of modern web applications, cloud computing has experienced a surge in adoption. The deployment of vast and various workloads encapsulated within containers has become ubiquitous across cloud platforms, imposing substantial demands on the supporting infrastructure. However, Kubernetes (k8s), the de-facto standard for container orchestration, struggles with low scheduling throughput and high latency in large-scale clusters. The primary challenges are identified as excessive loads of read requests and resource contention among co-located components. In response to these challenges, in this paper, we present MerKury, a lightweight framework to enhance the Kubernetes performance for large-scale clusters. It employs a dual strategy: first, it preprocesses specific requests to alleviate unnecessary load, and second, it introduces an adaptive resource allocation algorithm to mitigate resource contention. Evaluations under different scenarios of varying cluster scale have demonstrated that MerKury notably augments cluster scheduling throughput up to 16.4$\times$ and reduces request latency by up to 39.3\%, outperforming vanilla Kubernetes and baseline resource allocation methods.

Submission Number: 338

Loading