%%%%%%%%%%% Needs to be more technical %%%%%%%%%%%%
%%%%%%%%%%% Introduce notations and nature of the results %%%%%%%%%%%%
\vspace*{-15pt}
\section{Introduction}
\label{sec:intro}
\vspace*{-10pt}
%We consider the problem of ranking a set of items from pairwise comparisons when some side information about the items is available. 
Given a set of $n$ items and $m$ pairwise comparisons among them, the problem of \emph{ranking from pairwise preferences} is to recover an underlying ranking among the $n$ items. This is a well-studied problem in several disciplines including statistics, operations research, theoretical computer science, social choice theory, machine learning, decision systems etc \cite{thurstone1927,bradleyTe1952,luce05,saaty08,ailon08}, \cite{bravermanMo08,gleichLek11,jamiesonNo11,negahban+12}, \cite{wauthier+13,busaHu14,rajkumarAg14,shahWa15,borkarNi16,chenJo16,rajkumarAg16,shah+16,niranjanRa17}. A typical approach to solve this problem is to assume that the comparisons are generated in a stochastic fashion according to a score based pairwise probability model, e.g. Bradley-Terry-Luce model \cite{bradleyTe1952} \cite{luce05} or the Thurstone model \cite{thurstone1927} and develop algorithms \cite{gleichLek11}, \cite{negahban+12}, \cite{rajkumarAg14}, \cite{borkarNi16}  that first estimate the score vector from the given comparisons and obtain the final ranking by simply sorting their estimated scores. 

However in practice they suffer from several shortcomings: Firstly, often times side information such as features or relationships among items are available, e.g. to rank a set of mobile phones, it is natural to use features such as cost, battery life, size etc., which influence the pairwise preferences of users in preferring one mobile over other. However, most algorithms do not take this additional information into account. Secondly, they fail to handle the case when new items get added as one cannot find the position of a new item in an already estimated ranking without collecting at least few pairwise preferences of it. Finally, the sample complexity of previous approaches scale as $O(n\log n)$ which can proved to be sub-optimal when item preferences are based on their feature similarities.

In this work, we introduce the \emph{feature-Bradley--Terry--Luce (f-BTL)} model of pairwise comparisons to tackle the problems listed above. The f-BTL model is a generalization of the standard BTL model where the probability of preferring one item over the other explicitly depends on their associated features such that similar items get similar ranks. 
We next propose a least squares-based algorithm \emph{fBTL-LS} -- the novelty of our approach lies in the sample complexity analysis (i.e. the number of comparisons needed to achieve a fixed error) for recovering a `near-optimal' ranking. 
%which is shown to be much smaller than $O(n\log n)$, based on the degree of item correlations. 
The key ingredient used here is a \emph{relation graph} that we define on the items based on their features correlation and apply ideas from classical graph matching theory on the relation graph. Precisely, our sample complexity bound is of $O(\alpha \log \alpha)$, where on an intuitive level, $\alpha$ denotes the number of the main (independent) items that influence the preference structures of the rest of $n - \alpha$ items in the set---This shows a significant reduction in the number of comparisons needed, compared to the earlier known bound $O(n \log n)$, especially when $\alpha << n$, which often is the case in many applications. 
%By explicitly modelling dependencies among features (rather than just considering them to lie in some low dimensional space), our bounds reveal the true complexity of the problem. We believe that the graph theory based approach used in this work would be of wider use to the learning theory community. 
Furthermore, we also give a matching sample complexity lower bound analyzing the minimal number of pairwise preferences required, establishing the optimality of our algorithm.
Our experimental evaluation shows the proposed algorithm significantly outperforms existing algorithms, demonstrating its usefulness on various special types of relation graphs including union of cliques, disconnected graphs, trees, stars, cycles, etc. %\textbf{Summary of Contributions} 
Our \emph{contributions} are listed below:

%The main contributions of this work are as follows:
%\begin{itemize}

\textbf{1.} We introduce a new probabilistic model, f-BTL,  for ranking from pairwise comparisons which explicitly uses features associated with items (Sec. \ref{sec:prb_setup}).

\textbf{2.} We give a novel sample complexity analysis using ideas from graph matching theory that captures the dependencies among features explicitly in terms of structural properties of the graph, unlike previous approaches (Sec. \ref{sec:no_noise}).
    
\textbf{3.} We propose an algorithm, \fbtl\, and provide its sample complexity guarantees for recovering a `good estimate' of the score vector under f-BTL (Sec. \ref{sec:noise}).
    
\textbf{4.} We finally show our sample complexity guarantee is tight proving a matching lower bound (Sec. \ref{sec:lb}).
    
\textbf{5. } Our experimental results support our theoretical findings showing the superiority of our algorithm on both synthetic data and real datasets (Sec. \ref{sec:expts}).  
%\end{itemize}

%\emph{Related Works} are detailed in Appendix \ref{app:rel}. Our work has no societal impact.

%\textbf{Organization.}
%We give the necessary preliminaries in Section \ref{sec:prb_set} and define the problem formally. In Section \ref{sec:no_noise}, we analyze the case when the probability values for the sampled pairs are known exactly and derive a graph matching theory-based sample complexity bound.  In Section \ref{sec:noise}, we propose our least squares-based algorithm and show theoretical guarantees of its performance. Section \ref{sec:lb} proves a matching lower bound guarantee for the problem. In section \ref{sec:expts}, we experimentally evaluate our algorithm on various synthetic and real-world data sets. We conclude in Section \ref{sec:concl} with directions for future work. All proofs are presented in the appendix.
