\section{Background}

\subsection{Fairness in Machine Learning}

Today, machine-learned models are widely used in sensitive 
domains like healthcare and hiring, and it is imperative 
for them to give fair assessment on human candidates. 
Take hiring as an example, when a model is used to 
score the qualification of job candidates, for fairness 
it should give similar scores to similar candidates 
disregarding their race or gender (i.e. no racial or 
gender discrimination). In reality, however, many 
model assessments are considered unfair  \cite{feller2016computer,chan2018hiring}. 
This has motivated intensive research on how to 
attain fairness in machine-learned models e.g.  \cite{grgic2016case,alabi2018unleashing,grgic2018beyond,rothblum2018probably,mozannar2020fair}, to name a few. 

Model fairness has been studied at both group 
and individual levels \cite{dwork2012fairness}. 
In this paper, we focus on individual fairness 
which, roughly speaking, requires model output 
to be similar on similar individuals. 

Individual fairness is first formalized as the 
Lipschitz condition of the model \cite{dwork2012fairness}, 
and later relaxed to a probabilistic and almost 
Lipschitz condition called Approximate 
Metric-Fairness (AMF) \cite{yona2018probably}\footnote{In 
many following discussions, we will use AMF 
to represent `approximate metric-fairness' 
or `approximately metric-fair'.} 
Many later studies are built on AMF \cite{balashankar2019fairness,bechavod2020metric,kim2018fairness}. The active fair learner proposed 
in this paper is also built on AMF, but 
we present a new and provably equivalent 
form based on uniform continuity instead 
of almost Lipschitz. 

One research direction in individual fairness 
is to attain a proper metric for evaluating 
individual similarity \cite{ilvento2020metric,mukherjee2020two}. In this paper, we assume a metric is given 
and focus on how to achieve individual fairness 
\textit{efficiently} through active learning. 

Finally, to our knowledge, existing studies 
on individual fairness focus on the passive 
setting, where training data are randomly labeled. 
Their typical sample complexity for achieving 
individual fairness is $O(\frac{1}{\varepsilon^2})$ \cite{yona2018probably,balashankar2019fairness,shabat2020sample}. 
In this paper, we focus on the active setting, 
where training data are strategically labeled. 
We show the proposed active AMF learner admits 
an $O(\log 
\frac{1}{\varepsilon})$ 
sample complexity, which substantially improves 
the state-of-the-art result. 

\subsection{Active Learning}

Active learning has been extensively studied 
in the literature \cite{settles2009active, aggarwal2014active, hanneke2014theory}. 
Given a supervised learner, active learning assumes 
labels of the training data are expensive to query 
and aims to minimize the query cost by strategically 
labeling a few data for efficiently improving model 
accuracy. For example, the uncertainty-based strategy 
labels data with uncertain model predictions, and 
the query-by-committee strategy labels data receiving 
disagreed predictions from a committee of models. 
Active labeling strategies have been successfully 
applied in many domains and shown to improve model 
accuracy more efficiently than random 
labeling \cite{thompson1999active,warmuth2003active,liu2004active,hoi2006batch,abe2006outlier,zhao2013cost}. 

On the theory side, active labeling can allows one 
to learn a model with $\varepsilon$ error by labeling 
only $O(\log \frac{1}{\varepsilon})$ instances, and 
this is  more efficient than random labeling which 
requires $O(\frac{1}{\varepsilon})$ instances to 
achieve the same error guarantee 
\cite{dasgupta2005coarse,hanneke2007bound,balcan2010true}.  

We notice that most active labeling strategies 
are designed for classification model but very 
few are for regression model \cite{burbidge2007active,sugiyama2009pool,
cai2013maximizing,yu2010passive}. To our knowledge, 
the state-of-the-art strategy for regression model 
is greedy sampling \cite{wu2019active}, which 
labels data that are most different from the 
already labeled training data in both feature 
space and label space. 

Our study is related to active learning but 
differs in that they focus on improving accuracy 
for traditional learners, while we focus on improving individual fairness for 
AMF learners. Despite the difference, 
our work is inspired by disagreement-based 
active learning \cite{hanneke2007bound}. 

\subsection{Fairness in Active Learning}

The intersection of fairness and active learning is 
a fairly new research direction, and existing studies 
can be roughly grouped into active labeling 
\cite{anahideh2020fair,sharaf2020promoting} 
and adaptive sampling 
\cite{abernethy2020active,shekhar2021adaptive}. 
This paper considers the active labeling setting, 
but differs from the existing study as they focus 
on improving group fairness for standard learner 
while we focus on improving individual fairness for 
AMF learner. Besides, we are the first work that
shows active learning can improve the sample 
complexity for individual fairness to 
$O(\log \frac{1}{\varepsilon})$, which 
is not presented in prior studies. 
