Abstract: Microservices deployment in the cloud often faces a prevalent challenge: how to maximize resource utilization while maintaining high quality-of-service (QoS). Existing automatic scaling tools frequently exhibit limited adaptability, particularly when handling frequent request load fluctuations, which exacerbates the challenge. To address this issue, we introduce a proactive runtime deployment optimization method for multi-stage microservices, aiming to ensure both resource efficiency and QoS.
Loading