\section{Introduction}

In modern machine learning applications, 
an important societal concern is fairness 
of the machine-learned models. 
One may think that model-made decisions 
have no discrimination against minority 
people, but many case studies show 
otherwise e.g., \cite{feller2016computer,chan2018hiring}.\\  
As a result, how to attain fairness in 
machine-learned models becomes an important 
research problem and the literature has 
exploded in recent years. See e.g., some latest 
surveys \cite{chouldechova2018frontiers, mehrabi2021survey, pessach2022review} and references therein. 

Model fairness has been studied at both group 
level and individual level. 
Roughly speaking, group fairness requires  
model outputs to have small disparity across 
different groups of people, while individual 
fairness requires model outputs to be similar 
on similar individuals. This paper focuses on 
individual fairness. 

Individual fairness was initially formalized 
as the Lipschitz condition of a prediction model 
\cite{dwork2012fairness}, and later relaxed 
to a probabilistic and almost Lipschitz condition 
called approximate metric-fairness 
\cite{yona2018probably}. 
There are many studies on 
different aspects of individual fairness 
such as how to design the fairness metric 
\cite{ilvento2020metric,mukherjee2020two},
how to achieve fairness with limited 
resources \cite{kim2018fairness,bechavod2020metric}, 
and how to combine individual fairness 
with group fairness 
\cite{zemel2013learning,sharifi2019average}. 
The sample complexity for achieving individual 
fairness in passive learning is studied in \cite{balashankar2019fairness,shabat2020sample}.

This paper studies a new aspect of 
individual fairness. We ask \textit{can 
one obtain a more efficient sample complexity 
for achieving individual fairness through 
active learning?} To our knowledge, all 
prior studies focus on the passive setting 
and maintain an $O(\frac{1}{\varepsilon^2})$ 
sample complexity for bounding (properly 
defined) individual bias by $\varepsilon$. 
See \cite{yona2018probably,balashankar2019fairness,shabat2020sample} for example. In this paper, we show it 
can be improved to $O(\log \frac{1}{\varepsilon})$ 
through active learning. 

To facilitate algorithm design and analysis, 
we first present a new form of approximate 
metric-fairness (AMF) based on 
uniform continuity and prove its equivalence to the 
original form using an interesting connection 
between uniform continuity and \textit{almost} 
Lipschitz continuity \cite{vanderbei1991uniform}. 
Based on this, we present a passive AMF learner 
and prove the generalization ability of its 
achieve fairness. 

Building on the above, we design an active AMF 
learner which labels instances that 
are fairly close to their neighbors but receive 
fairly different predictions. Under proper conditions, 
we prove this learner only takes $O(\log
\frac{1}{\varepsilon})$ labeling to bound the bias 
of its returned model by $\varepsilon$ with high probability. Our analysis assumes boundness of a 
specially designed counter approximate metric-fairness 
coefficient, and we exemplify the calculation of it. 

At the end, we experiment the proposed active AMF 
learner on three real-world data sets. 
We observe it improves individual fairness of both 
linear and non-linear models more efficiently than 
its passive counterpart as well as state-of-the-art 
active learners while maintaining a comparable accuracy, 
achieving a more efficient fairness-accuracy trade-off.  

The rest of this paper is organized as follows. 
We review related topics in Section 2 and present 
the proposed AMF form and passive learner in 
Section 3. In section 4, we present the proposed 
active AMF learner and prove its sample complexity. 
We discuss several implementation issues in Section 5, present experimental results in Section 6, and 
conclude the study in Section 7. Proofs of certain 
theoretical results are elaborated in the 
supplementary material. 

