Parameter server for distributed machine learning

Mu Li, Li Zhou, Zichao Yang, Aaron Li, Fei Xia, David G Andersen, Alex Smola

Published: 09 Dec 2013, Last Modified: 16 May 2025OpenReview Archive Direct UploadEveryoneWM2024 Conference

Abstract: We propose a parameter server framework to solve distributed machine learning problems. Both data and workload are distributed into client nodes, while server nodes maintain globally shared parameters, which are represented as sparse vectors and matrices. The framework manages asynchronous data communications between clients and servers. Flexible consistency models, elastic scalability and fault tolerance are supported by this framework. We present algorithms and theoretical analysis for challenging nonconvex and nonsmooth problems. To demonstrate the scalability of the proposed framework, we show experimental results on real data with billions of parameters.