A gradient estimator via L1-randomization for online zero-order optimization with two point feedback

Arya Akhavan; Evgenii E Chzhen; Massimiliano Pontil; Alexandre Tsybakov

A gradient estimator via L1-randomization for online zero-order optimization with two point feedback

Arya Akhavan, Evgenii E Chzhen, Massimiliano Pontil, Alexandre Tsybakov

Published: 31 Oct 2022, Last Modified: 28 Feb 2025NeurIPS 2022 AcceptReaders: Everyone

Keywords: zero-order optimization, online learning

TL;DR: We propose a new gradient estimator for zero-order optimisation and study its theoretical and practical aspects

Abstract: This work studies online zero-order optimization of convex and Lipschitz functions. We present a novel gradient estimator based on two function evaluations and randomization on the $\ell_1$-sphere. Considering different geometries of feasible sets and Lipschitz assumptions we analyse online dual averaging algorithm with our estimator in place of the usual gradient. We consider two types of assumptions on the noise of the zero-order oracle: canceling noise and adversarial noise. We provide an anytime and completely data-driven algorithm, which is adaptive to all parameters of the problem. In the case of canceling noise that was previously studied in the literature, our guarantees are either comparable or better than state-of-the-art bounds obtained by~\citet{duchi2015} and \citet{Shamir17} for non-adaptive algorithms. Our analysis is based on deriving a new weighted Poincaré type inequality for the uniform measure on the $\ell_1$-sphere with explicit constants, which may be of independent interest.

Supplementary Material: pdf

10 Replies

Loading