Understanding the IO Performance Gap Between OS-Level and VM-Level Containers in High-Density Deployment

Wentai Li, Kaijun Zhou, Jiacheng Shi, Xingman Chen, Yadong Liu, Luyuan Wang, Jinyu Gu

Published: 01 Jan 2024, Last Modified: 16 May 2025ICDCS 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Containers are widely deployed in clouds. There are two common container architectures: operating system-level (OS-level) container and virtual machine-level (VM-level) container. Typical examples are runc and Kata. It is well known that VM- level containers provide better isolation than OS-level containers, but at a higher overhead. Although there are quantitative analyses of the performance gap between these two container architectures, they rarely discuss the performance gap under the constrained resources nrovisioned to containers. Since the high-density deployment of containers is demanding in the cloud, each container is provisioned with limited resources specified by the cgroup mechanism. In this paper, we provide an in-depth analysis of the storage and network (two key aspects) performance differences between runc and Kata under varying resource constraints. We identify configuration implications that are crucial to performance and find that some of them are not exposed by the Kata interfaces. Based on that, we propose a profiling tool to automatically offer configuration suggestions for optimizing container performance. Our evaluation shows that the auto-generated configuration can improve the performance of MySQL by up to 107% in the TPCC benchmark compared with the default Kata setup.