
\section{Related Works}
\label{sec:related_work}

\paragraph{Discrete differential privacy.} Various discrete DP mechanisms have been proposed for discrete inputs to make them differentially private. For example, both \textit{Discrete Laplace Mechanism}~\citep{discrete-laplace} and \textit{Discrete Gaussian Mechanism}~\citep{discrete-gaussian} add noises to the inputs sampled from discrete distributions, which are commonly used for tasks when with discrete inputs~\citep{abowd}. The \textit{Snapping Mechanism}~\citep{least_bit} truncates and rounds the inputs and Laplace noises based on floating-point arithmetic, but it inevitably diminishes accuracy~\citep{discrete-gaussian}. \textit{Communication-limited Local Differential Privacy} (CLDP) mechanism~\citep{shuffled} works with a trusted shuffler in federated learning to generate compressed and private updates from clients. However, it cannot be tuned to adopt different communication budgets. \textit{Skellam Mechanism}~\citep{skellam} add noises sampled from Skellam distribution to achieve performance comparable with the continuous Gaussian mechanism, but is subject to biased output when combined with privacy-protection protocols in federated learning such as secure aggregation. In contrast, \textit{Poisson Binomial Mechanism}~\citep{pbm} encodes the inputs inside the Binomial distribution to generate unbiased outputs, and it can achieve better privacy while decreasing communication costs, and is also compatible with secure aggregation.

%\smallskip
\paragraph{Private quantization.} %Besides of privacy concerns, macine learning applications like federated learning are also confronted with bandwidth bottlenecks. In Federated learning, a parameter server needs to collect updates from many distributed devices, hence is subject to communication bottleneck. 
Previous works have utilized data compression methods such as quantization to compress the data in applications with communication or bandwidth bottlenecks. One example is federated learning where a central server needs to repeatedly collect local model updates from distributed clients for training the global model~\citep{fedpaq, dadaquant}. Another example is large language models where the computation overheads may be reduced by compressing the model parameters~\citep{compress-model, post-train-quant}.
By mapping the continuous inputs to the closest discrete outputs within a finite set, the quantization process can effectively represent the data with reduced communication overhead. 

While methods were proposed in prior works to quantize data under a certain privacy constraint, they often treat privacy and quantization separately~\citep{vqsgd, dis_gau_fed}, i.e., privatizing the data first and then quantize the private data. %However, they neglect the additional privacy amplification brought by compression itself.
Recent works attempt to design discrete DP mechanisms leveraging quantization to simultaneously compress data and protect privacy. For instance, \cite{mvu} proposed \textit{Minimum Variance Mechanism} (\textsf{MVU}), which first quantizes inputs with discrete bins and then maps the unbiased quantization results to output alphabets according to a probability matrix. \textsf{MVU} optimizes the probability matrix to minimize accuracy loss while preserving privacy. \textsf{I-MVU}~\citep{i-mvu} extends \textsf{MVU} by 
designing a new interpolation procedure to attain better privacy for high-dimensional vectors. \cite{rqm} proposed \textit{Randomized Quantization Mechanism} (\textsf{RQM}), which subsamples from uniformly distributed bins and performs randomized quantization to output an unbiased result. 

Compared to prior works, we propose a more general family of quantization DP mechanisms that enables non-uniform quantization. It has a high degree of freedom and the optimal mechanism can be found efficiently by linear programming tools. We also show theoretically and empirically that our mechanism can attain a better privacy-accuracy trade-off. 

%numerically optimized probability to have more degree of freedom in order to achieve better privacy-utility trade-off.

