% \documentclass{uai2022} % for initial submission
\documentclass[accepted]{uai2022} % after acceptance, for a revised
                                    % version; also before submission to
                                    % see how the non-anonymous paper
                                    % would look like
%% There is a class option to choose the math font
% \documentclass[mathfont=ptmx]{uai2022} % ptmx math instead of Computer
                                         % Modern (has noticable issues)
% \documentclass[mathfont=newtx]{uai2022} % newtx fonts (improves upon
                                          % ptmx; less tested, no support)
% NOTE: Only keep *one* line above as appropriate, as it will be replaced
%       automatically for papers to be published. Do not make any other
%       change above this note for an accepted version.

%% Choose your variant of English; be consistent
\usepackage[american]{babel}
% \usepackage[british]{babel}

%% Some suggested packages, as needed:
\usepackage{natbib} % has a nice set of citation styles and commands
    \bibliographystyle{plainnat}
    \renewcommand{\bibsection}{\subsubsection*{References}}
\usepackage{mathtools} % amsmath with fixes and additions
% \usepackage{siunitx} % for proper typesetting of numbers and units
\usepackage{booktabs} % commands to create good-looking tables
\usepackage{tikz} % nice language for creating drawings and diagrams


%for corresponding author
\newcommand*\samethanks[1][\value{footnote}]{\footnotemark[#1]}

%% Provided macros
% \smaller: Because the class footnote size is essentially LaTeX's \small,
%           redefining \footnotesize, we provide the original \footnotesize
%           using this macro.
%           (Use only sparingly, e.g., in drawings, as it is quite small.)

%% Self-defined macros
\newcommand{\swap}[3][-]{#3#1#2} % just an example

\usepackage{amsmath}
\usepackage{amsthm}
\usepackage{amssymb}%for mathbb
\usepackage{algorithm}
\usepackage{algorithmic}
\newtheorem{definition}{Definition}
\newtheorem{example}{Example}
\newtheorem{theorem}{Theorem}
\newtheorem{lemma}{Lemma}
\newtheorem{corollary}{Corollary}
\newtheorem{proposition}{Proposition}
\title{A Competitive Analysis of Online Failure-Aware Assignment}

% The standard author block has changed for UAI 2022 to provide
% more space for long author lists and allow for complex affiliations
%
% All author information is authomatically removed by the class for the
% anonymous submission version of your paper, so you can already add your
% information below.
%
% Add authors
\author[1]{Mengjing Chen}
\author[1]{Pingzhong Tang\thanks{corresponding author}}
\author[2,3]{Zihe Wang\samethanks[1]}
\author[1]{Shenke Xiao}
\author[4]{Xiwang Yang}

% Add affiliations after the authors
\affil[1]{%
    Institute for Interdisciplinary Information Sciences\\
    Tsinghua University\\
    Beijing, China
}
\affil[2]{%
    Gaoling School of Artificial Intelligence\\
    Renmin University of China \\
    Beijing, China
}
\affil[3]{%
    Beijing Key Laboratory of Big Data Management and Analysis Methods\\
	Beijing, China
  }
\affil[4]{%
	Bytedance\\
	Beijing, China
}
  
  \begin{document}
\maketitle

\begin{abstract}
Motivated by a new generation of Internet advertising that has emerged in the live streaming e-commerce markets (e.g., Tiktok) over the past five years, we study a variant of online bipartite matching problem: advertisers send ad requests to influencers (aka, key opinion leaders) on a social media platform. Each influencer has a maximum number of ad requests she can accommodate. We assign a fixed number of influencers to an advertiser when she enters the platform. The advertiser then matches with each of the assigned influencers with a probability, which can be thought of as a set of negotiations between the advertiser and the set of assigned influencers. Unlike the standard online assignment problems, the outcome of any of these matches is not revealed throughout the session (negotiations take time). Our goal is to maximize the expected number of matches between advertisers and influencers.

We put forward a new deterministic algorithm with a competitive ratio of $1/2$ and prove that no deterministic algorithm can achieve a better competitive ratio. We also show that the competitive ratio can be improved when randomness is allowed. We then study a setting where a match is successful with either probability 0 or a fixed $p$. We present an optimal randomized algorithm that achieves a competitive ratio of $1-1/e$ in this setting.
\end{abstract}

\section{Introduction}\label{sec:intro}
Live streaming e-commerce promotes and sells products through live webcasts on social media platforms. Over the past few years, such live streaming e-commerce markets have grown fiercely. To put the numbers in perspective, in China of the year 2019 alone, the total Gross Mechanize Volume (GMV) of transactions in live streaming e-commerce is over \$63 billion, and the GMV has just doubled in 2020~\cite{statislivestream}. In these markets, advertisers seek online \emph{influencers} to present their products on their live webcasts. Similar to traditional TV shopping channels, such demonstrations are more vivid compared to those text and picture ads. Unlike the TV shopping channels, such ads are personalized, displaying only to a group of buyers known to have higher conversion rates. It is reported that the conversion rate of such advertisements is at least $21.1\%$ while that of an ordinary ad is less than $5 \%$~\cite{conversionlivestream,reportlivestream}. 

Advertising is an important way for influencers to monetize their fame and traffic, so social media giants such as TikTok and Instagram provide a matching market that facilitates the cooperation between advertisers and influencers. When an advertiser enters such a matching market, she can see the information of all available influencers on the platform and send advertisement requests to them. Despite having full information of influencers, it is difficult for an advertiser to find the most suitable one. On the one hand, the significant number of influencers makes it impossible for the advertiser to look through all the information. On the other hand, a famous influencer can receive many advertisement requests and reject some due to time capacity. To optimize successful matches between advertisers and influencers, it needs a centralized matching algorithm that recommends several selected influencers for each advertiser once she appears online, without knowing any information about future advertisers. The advertiser then negotiates with each of these influencers and matches with a given probability (the outcome of whether any of these matches is successful will reveal only after the whole assignment is made). In this paper, we model this problem as a new variant of the online assignment problem and provide algorithmic solutions. The platform’s objective is to maximize the expected number of successful matches, with each influencer having its service capacity.   %Note the platform only cares the total number of successful deals, so it does not matter, for an influencer, which advertisers are rejected. 

%We formulate this problem as an assignment problem, that is, we model the recommendation as assigning Internet sensations to advertisers. 
%We assume every advertiser can only send one request to each sensation and the success of each deal has probability. 
%Such deals may fail by many external reasons. For example, the advertiser has a better choice on other websites, or he is not satisfied with the advertisement that the Internet sensation produces, etc. We regard such external reasons as a probability that the advertiser successfully deals with an Internet sensation. In this paper, we do not focus on the problem of how to evaluate such probabilities. Instead, we assume these probabilities are all given. Additionally, each Internet sensations has a capacity that describes the maximum number of advertisement requests she can accept. For an Internet sensation with capacity $c$, if the number of advertisement requests that successfully cooperate with her exceeds $c$, only $c$ advertisers are counted. The objective of the  platform is to maximize the number of successful deals.

The problem we focus on can be formulated as the online assignment problem with stochastic rewards in the setting where the outcome of any match is not revealed before the whole assignment process ends. We call this assignment problem the \emph{failure-aware assignment problem}.
Despite the vast literature on online bipartite matching and assignment problems in the past~\cite{karp1990optimal,goel2008online,feldman2009online,feldman2010online}, this problem has not been investigated to the best of our knowledge. %We formally state our model in Section~\ref{Problem Formulation}.


%%%
% the simple version
To compare approaches, we define the competitive ratio of an algorithm to be the ratio of the objective value produced by the algorithm to that produced by an optimal solution for the \emph{worst-case}. In Section~\ref{A Deterministic Algorithm}, we propose a greedy deterministic algorithm whose competitive ratio is at least $1/2$ and prove that no deterministic algorithm can achieve a better competitive ratio. This implies that our algorithm is optimal among all deterministic algorithms. 

%%%
% the complete version
%To measure algorithms, we define the competitive ratio of an algorithm to be the expected (here we use ``expected" in case that the algorithm is a randomized algorithm) ratio of the objective value produced by the algorithm to that produced by an optimal solution under the \emph{worst-case} instance. In Section~\ref{A Deterministic Algorithm}, we propose a greedy deterministic algorithm whose competitive ratio is at least $1/2$ and prove that no deterministic algorithm can achieve a better competitive ratio even if there are 2 advertisers and only 1 influencer is assigned to each advertiser. It implies that our algorithm is optimal among all deterministic algorithms. 

In Section~\ref{Randomized Algorithms}, we show that the randomized algorithms can achieve higher competitive ratios even in constrained cases, and we propose a randomized algorithm with a tight competitive ratio in such cases. This is a theoretical improvement to our greedy deterministic algorithm proposed in Section~\ref{A Deterministic Algorithm}. We study the setting where the probabilities of success transactions are either 0 or a fixed value $p$ in Section~\ref{Ranking Algorithm}. We propose an optimal randomized algorithm that achieves a competitive ratio of $1-1/e$ in this setting.

\subsection{Related Works}
Our work is closely related to the \emph{AdWords problem}~\cite{mehta2005adwords,devanur2009adwords,buchbinder2007online}, which is a generalization of the online bipartite matching problem. In the AdWords problem, an Internet search engine company selects an advertisement to display when each query comes, given the advertisers' budgets and bids. The company’s goal is to design the allocation rule to maximize its revenue. The AdWords model looks similar to our model where only one influencer is recommended to an advertiser. However, the objective values are evaluated differently by the two problems. For example, if 1 advertiser with budget 1 is assigned to 2 queries with both bids 0.5, then the advertiser contributes 1 to the revenue of the company, but it only contributes $\left(1-0.5\right)^2\times 0+\left(1-\left(1-0.5\right)^2\right)\times 1=0.75$ to the target value of our problem (see Section~\ref{Problem Formulation}). The AdWords problem can be formulated as a linear program while our problem cannot. This difference makes the methods solving the AdWords problem~\cite{mehta2005adwords,buchbinder2007online} not suitable for our problem.

The AdWords problem, as well as the online bipartite matching problem, is a special case of the online submodular welfare maximization problem~\cite{nemhauser1978analysis1,nemhauser1978analysis2,nemhauser1978best,kapralov2013online} where the objective function is budget-additive. The objective function is indeed submodular in our problem but not budget-additive. In the classical submodular welfare maximization problem, each item (advertiser) is only allowed to be allocated to one agent (influencer). We explore the more general setting where multiple influencers are recommended to each advertiser in our problem. Moreover, we study the worst-case competitive ratios of randomized algorithms.

Another similar setting is studied in the literature~\cite{mehta2014online,mehta2012online,goyal2020online}. They realize the problem of the assignment failure and initiate the online stochastic assignment problem. There are some fundamental differences: (1) They focus on the matching problem where each node cannot match with more than one other node. At the same time, either the advertisers or the influencers can have multiple cooperators in our setting. The matching problem is a special case of our problem where each influencer's capacity is 1and the platform only recommends one available influencer to each advertiser. (2) In their setting, the outcomes of the success of matches of previous nodes are known when a new node arrives, while in our setting, we never know the realization from beginning to end. 

\section{Problem Formulation} \label{Problem Formulation}
Because our model can be widely used in many scenarios, we will strip the Internet-advertising background from our model and describe it as an assigning-node-to-arrival problem as follows.
There are $n$ nodes (corresponding to influencers) and $m$ arrivals (corresponding to advertisers). These arrivals arrive one by one. For ease of representation, we number the arrivals $1,2,\ldots,m$ by the order they arrive, that is, arrival 2 arrives after arrival 1, arrival 3 arrives after arrival 2, etc. When arrival $i$ arrives, we are required to immediately assign $s$ different nodes to it. For each arrival $i$ and each node $j$, if we assign node $j$ to arrival $i$, arrival $i$ will \emph{accept} node $j$ with probability $p_{ij}$. Whether an arrival accepts a node is independent of each other and remains unknown to us along the whole process. The probabilities $p_{i1},p_{i2},\ldots,p_{in}$ are revealed to us immediately after arrival $i$ arrives. Moreover, each node $j$ has a capacity $c_j$, meaning the maximum number of arrivals to which it is able to be \emph{successfully} assigned. More precisely, let $P_{ij}$ denote a random variable whose value is 1 with probability $p_{ij}$ and 0 with probability $1-p_{ij}$ (these random variables are mutually independent), then for node $j$, if we assign it to arrivals $i_1,i_2,\ldots,i_k$, it will be \emph{successfully} assigned to $\mathbb{E}\left(\min\left\{P_{i_1j}+\cdots+P_{i_kj},c_j\right\}\right)$ arrivals in expectation\footnote{Since we only care the expected number of successful assignments, it doesn't matter to which arrivals this node is successfully assigned.}. Our target is to maximize the expected number of successful assignments, or formally, to solve the following program in an online fashion. 

\begin{align*}
\begin{aligned}
\text{max} &&& \sum_j\mathbb{E}\left(\min\left\{\sum_ix_{ij}P_{ij},c_j\right\}\right) && \\
\text{s.t.}     &&& \sum_j x_{ij} = s,                                                     && \text{for all }i, \\
&&& x_{ij} \in \left\{0,1\right\},                                         && \text{for all }i,j.
\end{aligned} \label{programming problem}
\end{align*}
In the online setting, the values of $x_{1j}\text{'s},x_{2j}\text{'s},\ldots$ are determined in order, and when determining the value of $x_{ij}$, we don't know the values of $p_{i'j'}$'s for $i'>i$.

\paragraph{Remark} Note that assigning a node to arrival is always no worse than not assigning it whenever an arrival arrives, so we assume exactly $s$ nodes are assigned to the arrival in the setting instead of no more than $s$ nodes. This also implies $s\le n$. If $s>n$, we can add some hypothetical nodes, which are not accepted by any arrival (i.e., accepted with probability 0), to make $n\ge s$.

\begin{example}
	Consider an instance with 2 arrivals and 2 nodes, and $s=1$, i.e., each time an arrival arrives, we only assign one node to it. In this instance, $c_1=c_2=1$, $p_{11}=p_{12}=p_{21}=0.5$ and $p_{22}=0$. Suppose an algorithm $\mathcal{A}$ assigns node 1 to arrival 1. When arrival 2 arrives, since $p_{22}=1$, it is optimal for $\mathcal{A}$ to assign node 1 to arrival 2. The objective value produced by $\mathcal{A}$ is exactly $\mathbb{E}\left(\min\left\{P_{11}+P_{21},1\right\}\right)=0.75$ (recall that $P_{11}$ and $P_{12}$ are i.i.d. random variables which take value 1 with probability 0.5 and 0 with probability 0.5). Note the optimal assignment for this example would assign node 2 to arrival 1 and node 1 to arrival 2, and the optimal objective value is $\mathbb{E}\left(\min\left\{P_{12},1\right\}\right)+\mathbb{E}\left(\min\left\{P_{21},1\right\}\right)=1$.
\end{example}

\paragraph{Measurement} Given an algorithm $\mathcal{A}$ and an instance $\mathsf{ins}$ of this problem, we define $\mathcal{A}\left(\mathsf{ins}\right)$ as the expected objective value produced by $\mathcal{A}$, i.e., the expected value of $\sum_j\mathbb{E}\left(\min\left\{\sum_ix_{ij}P_{ij},c_j\right\}\right)$ where $x_{ij}$'s are outputted by $\mathcal{A}$ when running on the instance $\mathsf{ins}$. Here we say ``expected value'' because $\mathcal{A}$ may be a randomized algorithm. Furthermore, we define the \emph{competitive ratio} of an algorithm $\mathcal{A}$ as:
\begin{displaymath}
\inf_{\mathsf{ins}}\frac{\mathcal{A}\left(\mathsf{ins}\right)}{\max_{\mathcal{A}'}\mathcal{A}'\left(\mathsf{ins}\right)}.
\end{displaymath}
%Roughly speaking, the competitive ratio of an algorithm is the ratio of the expected objective value produced by the algorithm to that produced by the optimal assignment under the \emph{worst-case} instance. 
The competitive ratio of an algorithm is the ratio of the expected objective value produced by the algorithm to that produced by an optimal assignment under the \emph{worst-case} instance. 


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

\section{Deterministic Algorithm} \label{A Deterministic Algorithm}
In this section, we put our attention to deterministic algorithms. We first propose a deterministic algorithm with a competitive ratio of $1/2$; we prove $1/2$ is the upper bound of the competitive ratio that deterministic algorithms can achieve. 

Our algorithm is a greedy algorithm whose greedy policy is to assign nodes to increase the current objective value as much as possible whenever an arrival arrives. Formally, we define
\begin{align*}
w_{ij}={}&\mathbb{E}\left(\min\left\{P_{ij}+\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}\right)\\
&-\mathbb{E}\left(\min\left\{\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}\right). 
\end{align*}
The greedy algorithm assigns $x_{ij_1},\ldots,x_{ij_s}$ to 1 (and assigns $x_{ij'}$'s to 0 for $j'\notin \left\{j_1,\ldots,j_s\right\}$) where $j_1,\ldots,j_s$ are indices that maximize $\sum_{k=1}^s w_{ij_k}$.

To simplify the representation, we denote by $p_t$ the probability that the random variable $\min\left\{\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}$ takes value $t$, then we have
\begin{align*}
&\mathbb{E}\left(\min\left\{P_{ij}+\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}\right)\\ &=\sum_{t=1}^{c_j-1}t\left(p_t\left(1-p_{ij}\right)+p_{t-1}p_{ij}\right)+c_j\left(p_{c_j}+p_{c_j-1}p_{ij}\right) \nonumber \\
&=\sum_{t=1}^{c_j}tp_t+p_{ij}\left(1-p_{c_j}\right) \nonumber \\
&=\mathbb{E}\left(\min\left\{\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}\right)+p_{ij}\left(1-p_{c_j}\right), 
\end{align*}
i.e.,
\begin{align}
w_{ij}=p_{ij}\left(1-p_{c_j}\right). \label{difference eq}
\end{align}
Hence, we can maintain the distribution of $\min\left\{\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}$ so that for each arrival, the algorithm takes $O\left(n\log n \right)$ time to find the nodes $j_1,\ldots,j_s$ to assign plus $O\left(\sum_{k=1}^sc_{j_s}\right)$ time to update the distribution of $\min$$\left\{\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}$. We call this algorithm GREEDY.

\begin{theorem}\label{thm:deterministic}
	GREEDY has a competitive ratio of at least $1/2$.
\end{theorem}
\begin{proof}
	Suppose when following the assignment produced by GREEDY, node $j$ is successfully assigned to $\alpha_jc_j$ arrivals in expectation, and when following an optimal assignment, node $j$ is successfully assigned to $b_j$ arrivals in expectation. Let $\mathrm{ALG}$ be the objective value when following the assignment produced by GREEDY, and let $\mathrm{OPT}$ be the objective value when following the optimal assignment. We have immediately $\mathrm{OPT}=\sum_jb_j$ and
	\begin{equation}\label{one hand eq}
	\mathrm{ALG}=\sum_j\alpha_jc_j\ge\sum_j\alpha_jb_j.
	\end{equation}
	
	On the other hand, suppose for arrival $i$, the optimal solution assigns nodes $j_{i1},\ldots,j_{is}$ while GREEDY assigns nodes $j_{i1}',\ldots,j_{is}'$. By the greedy policy of GREEDY, we have (in this proof, the variables $x_{ij}$'s and $w_{ij}$'s refer to the ones produced by GREEDY)
	\begin{align} \label{w and w' ineq}
	\sum_{k=1}^s w_{ij_{ik}'}\ge\sum_{k=1}^s w_{ij_{ik}}
	\end{align}
	
	Now let us fix a value $k\in\{1,\ldots,s\}$, and suppose \\$\min\left\{\sum_{i':i'<i}x_{i'j_{ik}}P_{i'j_{ik}},c_{j_{ik}}\right\}$ takes value $k$ with probability $p_k$, then by \eqref{difference eq} we have
	\begin{align} \label{w_ij ineq}
	w_{ij_{ik}}= p_{ij_{ik}}\left(1-p_{c_{j_{ik}}}\right)\ge p_{ij_{ik}}\left(1-\alpha_{j_{ik}}\right)
	\end{align}
	where the last inequality holds because
	\begin{align*}
	\alpha_{j_{ik}}
	={}&\frac{\mathbb{E}\left(\min\left\{\sum_{i'} x_{i'j_{ik}}P_{i'j_{ik}},c_{j_{ik}}\right\}\right)}{c_{j_{ik}}}\\
	\ge{}&\frac{\mathbb{E}\left(\min\left\{\sum_{i':i'<i}x_{i'j_{ik}}P_{i'j_{ik}},c_j\right\}\right)}{c_{j_{ik}}}\\
	\ge{}&\frac{p_{c_{j_{ik}}}c_{j_{ik}}}{c_{j_{ik}}}
	={}p_{c_{j_{ik}}}.
	\end{align*}
	%	\begin{gather*}
	%	\alpha_{j_{ik}}
	%	={}\frac{\mathbb{E}\left(\min\left\{\sum_{i'} x_{i'j_{ik}}P_{i'j_{ik}},c_{j_{ik}}\right\}\right)}{c_{j_{ik}}}
	%	\ge{}\frac{\mathbb{E}\left(\min\left\{\sum_{i':i'<i}x_{i'j_{ik}}P_{i'j_{ik}},c_j\right\}\right)}{c_{j_{ik}}}
	%	\ge{}\frac{p_{c_{j_{ik}}}c_{j_{ik}}}{c_{j_{ik}}}
	%	={}p_{c_{j_{ik}}}.
	%	\end{gather*}
	By summing up \eqref{w_ij ineq} for $k$ from $1$ to $s$, we have 
	\begin{align} \label{sum w_ij ineq}
	\sum_{k=1}^s w_{ij_{ik}}\ge \sum_{k=1}^s p_{ij_{ik}}\left(1-\alpha_{j_{ik}}\right).
	\end{align}
	Therefore,
	\begin{align*}
	\mathrm{ALG}=&\sum_i\sum_{k=1}^s w_{ij_{ik}'}
	\ge \sum_i\sum_{k=1}^s w_{ij_{ik}} \tag*{\text{(by \eqref{w and w' ineq})}}\\
	&\ge\sum_i\sum_{k=1}^s p_{ij_{ik}}\left(1-\alpha_{j_{ik}}\right) \tag*{\text{(by \eqref{sum w_ij ineq})}}\\
	&=\sum_j\sum_{\left(i,k\right):j_{ik}=j}\left(1-\alpha_j\right)p_{ij}
	\ge\sum_j\left(1-\alpha_j\right)b_j.
	\end{align*}
	Combined with \eqref{one hand eq}, we have
	\begin{align*}
	\mathrm{ALG}&\ge{}\frac{1}{2}\left(\sum_j\alpha_jb_j+\sum_j\left(1-\alpha_j\right)b_j\right) \\
	&=\frac{1}{2}\sum_jb_j=\frac{1}{2}\mathrm{OPT}.
	\end{align*}
	Note the argument above works for any instance of the problem, so the competitive ratio of GREEDY is at least 1/2.
\end{proof}
The following theorem shows that no deterministic algorithm can achieve a competitive ratio better than $1/2$, meaning that GREEDY is optimal among all deterministic algorithms in the sense of competitive ratio. %In this paper, all the missing proofs can be found in the appendix.
\begin{theorem} \label{optimal theorem}
	For any deterministic algorithm $\mathcal{A}$ for our problem, the competitive ratio of $\mathcal{A}$ is no more than $1/2$ even if there are only 2 arrivals and only 1 node is allowed to be assigned to each arrival, i.e.,
	\begin{align*}
	\textstyle
	\inf_{\mathsf{ins}}\frac{\mathcal{A}\left(\mathsf{ins}\right)}{\max_{\mathcal{A}'}\mathcal{A}'\left(\mathsf{ins}\right)}\le\inf_{\mathsf{ins}: n\le 2, s= 1}\frac{\mathcal{A}\left(\mathsf{ins}\right)}{\max_{\mathcal{A}'}\mathcal{A}'\left(\mathsf{ins}\right)} \le \frac{1}{2}.
	\end{align*}
\end{theorem}
\begin{proof}
	We construct an instance $I_1$ with 2 arrivals and 2 nodes, and let $c_1=c_2=1$, $p_{11}=p_{12}=p_{21}=1$, and $p_{22}=0$. In addition, we construct another instance $I_2$ that is almost the same as $I_1$ except that $p_{21}=0$ and $p_{22}=1$. For $I_1$, an algorithm can assign node 2 to arrival 1 and assign node 1 to arrival 2 to achieve an objective value 2. For $I_2$, the objective value 2 can also be achieved by assigning node 1 to arrival 1 and assigning node 2 to arrival 2. Hence, we have $\max_{\mathcal{A'}}\mathcal{A'}\left(I_1\right)\ge 2$ and $\max_{\mathcal{A'}}\mathcal{A'}\left(I_2\right)\ge 2$.
	
	Now we compare the behavior of $\mathcal{A}$ when running on $I_1$ and $I_2$ respectively. Note when dealing with arrival 1, the information given to $\mathcal{A}$ is the same, and since $\mathcal{A}$ is a deterministic algorithm, it must assign the same node to arrival 1. If $\mathcal{A}$ assigns node 1 to arrival 1, then on instance $I_1$, no arrival will accept node 2 (since $p_{22}=0$), thus $\mathcal{A}\left(I_1\right)=1$. Similarly, if $\mathcal{A}$ assigns node 2 to arrival 1, then no arrival will accept node 1 on instance $I_2$, thus $\mathcal{A}\left(I_2\right)=1$. Then we have 
	\begin{displaymath}
	\textstyle
	\inf_{\textsf{ins}\in\left\{I_1,I_2\right\}}\frac{\mathcal{A}\left(\mathsf{ins}\right)}{\max_{\mathcal{A}'}\mathcal{A}'\left(\mathsf{ins}\right)}\le\frac{1}{2},
	\end{displaymath}
	so the competitive ratio of $\mathcal{A}$ is at most $1/2$.
\end{proof}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

\section{Randomized Algorithm} \label{Randomized Algorithms}
We have proven the optimal competitive ratio of deterministic algorithms is $1/2$. One may ask whether higher competitive ratios can be achieved if randomized algorithms are allowed. The answer is yes. In this section, we will see that randomness helps improve the competitive ratio even in the very constrained case where $s=1$. 

Recall that our optimal deterministic algorithm GREEDY greedily assigns the node $j$ that maximizes $w_{ij}$ whenever an arrival $i$ arrives. Our randomized algorithm would consider $w_{ij}$'s as the weights and randomly assign a node according to these weights. Also by observing that when arrival $i$ comes, there is no benefit to assign a node whose $w_{ij}$ is not the largest $\min\left\{m-i+1,n\right\}$ ones (for example, it is always optimal to assign the node with the largest $w_{ij}$ to the last arrival), our randomized algorithm only chooses the node from those whose $w_{ij}$'s are the largest $\min\left\{m-i+1,n\right\}$ ones. We call this algorithm RANDOM, which is formally described in Algorithm \ref{RANDOM}. We will show that RANDOM achieves a higher competitive ratio, which is also the optimal randomized algorithm in this case.
\begin{algorithm}[tb]
	\caption{RANDOM}\label{RANDOM}
	\begin{flushleft}
		When arrival $i$ arrives,
		\begin{enumerate}
			\item For all $j$, let 
			\begin{align*}
			w_{ij}={}&\mathbb{E}\left(\min\left\{P_{ij}+\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}\right)\\
			&-\mathbb{E}\left(\min\left\{\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}\right).
			\end{align*}
			Like the deterministic case, $w_{ij}$'s can be computed efficiently by maintaining the distribution of $\min\left\{\sum_{i':i'<i}x_{i'j}P_{i'j},c_j\right\}$.
			\item Let $k=\min\left\{m-i+1,n\right\}$, and find the $k$ largest $w_{ij}$'s: $w_{ij_1},w_{ij_2},\ldots,w_{ij_k}$.
			\item Assign node $j_t$ with probability $w_{ij_t}/\left(w_{ij_1}+w_{ij_2}+\cdots+w_{ij_k}\right)$ to arrival $i$.
		\end{enumerate}
	\end{flushleft}
\end{algorithm}

The following theorem shows that the competitive ratio of RANDOM is at least $3/4$ if there are no more than 2 arrivals and only 1 node is allowed to be assigned to each arrival. Note that though the sketch of RANDOM is similar to the GREEDY, the proof techniques are pretty different. Compared with Theorem~\ref{optimal theorem}, RANDOM indeed improves the competitive ratio via randomness. 

\begin{theorem}\label{thm:random}
	RANDOM has a competitive ratio of at least $3/4$ if there are no more than 2 arrivals and only 1 node is allowed to be assigned to each arrival.
	\begin{align*}
	\textstyle
	\inf_{\mathsf{ins}: n\le 2,s=1}\frac{\mathrm{RANDOM}\left(\mathsf{ins}\right)}{\max_{\mathcal{A}'}\mathcal{A}'\left(\mathsf{ins}\right)} \ge \frac{3}{4}.
	\end{align*}
\end{theorem}

\begin{proof}
	In this proof, we will compare the assignment produced by RANDOM with an optimal assignment. To avoid confusion, we use $x_{ij}$'s to refer to the ones produced by the optimal assignment, while we use $\bar{x}_{ij}$'s to refer to the ones produced by RANDOM (so $\bar{x}_{ij}$'s are random variables). We assume the optimal assignment assigns node $j_i$ to arrival $i$ while RANDOM assigns node $\bar{j}_i$ to arrival $i$. Note that $j_i$ and $\bar{j}_i$ are respectively functions of $x_{i1},x_{i2},\ldots,x_{in}$ and $\bar{x}_{i1},\bar{x}_{i2},\ldots,\bar{x}_{in}$, thus $\bar{j}_i$ is also a random variable. We define
	\begin{align*}
	\mathrm{OPT}_i={}&\mathbb{E}_X\left(\min\left\{P_{ij_i}+\sum_{i':i'< i}x_{i'j_i}P_{i'j_i},c_{j_i}\right\}\right)\\
	&-\mathbb{E}_X\left(\min\left\{\sum_{i':i'<i} x_{i'j_i}P_{i'j_i},c_{j_i}\right\}\right)
	\end{align*}
	and
	\begin{align*}
	\mathrm{ALG}_i={}&\mathbb{E}_X\left(\min\left\{P_{i\bar{j}_i}+\sum_{i':i'< i}\bar{x}_{i'\bar{j}_i}P_{i'\bar{j}_i},c_{\bar{j}_i}\right\}\right)\\
	&-\mathbb{E}_X\left(\min\left\{\sum_{i':i'<i} \bar{x}_{i'\bar{j}_i}P_{i'\bar{j}_i},c_{\bar{j}_i}\right\}\right)
	\end{align*}
	where $\mathbb{E}_X$ means the expectation is taken over all $P_{i'j}$'s, thus $\mathrm{ALG}_i$ is a random variable. Note by \eqref{difference eq} we have
	\begin{align}
	\mathrm{OPT}_1&=p_{1j_1},\label{opt1 eq} \\
	\mathrm{ALG}_1&=p_{1\bar{j}_1},\label{alg1 eq} \\
	\mathrm{OPT}_2&= p_{2j_2}\left(1-x_{1j_2}\left[c_{j_2}\le 1\right]p_{1j_2}\right), \label{opt2 eq}
	\end{align}
	where $\left[\text{condition}\right]$ is an indicator that equals to 1 if the condition is true and 0 otherwise. Now the competitive ratio of the randomized algorithm can be expressed as 
	\begin{align*}
	\min\frac{\sum_i\mathbb{E}\left(\mathrm{ALG}_i\right)}{\sum_i\mathrm{OPT}_i},
	\end{align*}
	where the minimum is taken over instances. 
	
	We first analyze $\mathrm{ALG}_1$. Assume $p_{1\ell_1}$ and $p_{1\ell_2}$ are the largest 2 ones among all $p_{1j}$'s. Recall that when RANDOM deals with arrival 1, $w_{1j}$'s are exactly $p_{1j}$'s, so 
	\begin{align}
	\mathbb{E}\left(\mathrm{ALG}_1\right)&=\mathbb{E}\left(p_{1\bar{j}_1}\right) \nonumber \tag*{\text{(by \eqref{alg1 eq})}} \\
	&=\frac{p_{1\ell_1}}{p_{1\ell_1}+p_{1\ell_2}}\cdot p_{1\ell_1}+\frac{p_{1\ell_2}}{p_{1\ell_1}+p_{1\ell_2}}\cdot p_{1\ell_2} \label{Ealg1 intermediate eq} \\
	&= \frac{1+\left(p_{1\ell_2}/p_{1\ell_1}\right)^2}{1+p_{1\ell_2}/p_{1\ell_1}}\cdot p_{1\ell_1}\nonumber \\
	&\ge 2\left(\sqrt{2}-1\right)\mathrm{OPT}_1, \label{Ealg1 ineq}
	\end{align}
	where the inequality \eqref{Ealg1 ineq} holds by \eqref{opt1 eq} and taking the minimum of the function $\left(1+t^2\right)/\left(1+t\right)$ over $[0,1]$.
	
	We then analyze $\mathrm{ALG}_2$. According to the rule of RANDOM, since arrival 2 is the last arrival, the algorithm will deterministically assign node $j$ that maximizes $w_{2j}$, i.e.,
	\begin{align*}
	\mathrm{ALG}_2\ge w_{2j_2}
	=p_{2j_2}\left(1-\bar{x}_{1j_2}\left[c_{j_2}\le 1\right]p_{1j_2}\right). \tag*{{\text{(by \eqref{difference eq})}}}
	\end{align*}
	Hence,
	\begin{align}
	\mathbb{E}\left(\mathrm{ALG}_2\right)\ge p_{2j_2}\left(1-\mathbb{E}\left(\bar{x}_{1j_2}\right)\left[c_{j_2}\le 1\right]p_{1j_2}\right). \label{Ealg2 intermediate ineq}
	\end{align}
	
	After comparing \eqref{Ealg2 intermediate ineq} with \eqref{opt2 eq}, we can see if $\mathbb{E}\left(\bar{x}_{1j_2}\right)\le x_{1j_2}$, we have $\mathbb{E}\left(\mathrm{ALG}_2\right)\ge \mathrm{OPT}_2$, thus
	\begin{align*}
	\mathbb{E}\left(\mathrm{ALG}_1\right)+\mathbb{E}\left(\mathrm{ALG}_2\right)&\ge2\left(\sqrt{2}-1\right)\mathrm{OPT}_1+\mathrm{OPT}_2\\
	&\ge 3\left(\mathrm{OPT}_1+\mathrm{OPT}_2\right)/4,
	\end{align*}
	which completes the proof. Hence, in the rest of the proof, we assume $\mathbb{E}\left(\bar{x}_{1j_2}\right)> x_{1j_2}$. With this assumption, we can assert that RANDOM has a non-zero probability to assign node $j_2$ to arrival 1, which means $j_2\in\left\{\ell_1,\ell_2\right\}$ by the rules of RANDOM, and the optimal assignment does not assign node $j_2$ to arrival 1, which means $j_1\neq j_2$. Furthermore, we can assume $j_1\in\left\{\ell_1,\ell_2\right\}$, otherwise we can change $j_1$ to an index in $\left\{\ell_1,\ell_2\right\}\backslash \left\{j_2\right\}$, which does not reduce the target value of the optimal solution. As a result, we have $\left\{j_1,j_2\right\}=\left\{\ell_1,\ell_2\right\}$.  Hence, we can rewrite \eqref{Ealg1 intermediate eq} as
	\begin{align}
	\mathbb{E}\left(\mathrm{ALG}_1\right)\ge \frac{p_{1j_1}^2+p_{1j_2}^2}{p_{1j_1}+p_{1j_2}}.\label{Ealg1 eq}
	\end{align}
	Also, according to the rules of RANDOM, node $j_2$ is assigned to arrival 1 with probability $p_{1j_2}/\left(p_{1j_1}+p_{1j_2}\right)$, \\so $\mathbb{E}\left(\bar{x}_{1j_2}\right)= p_{1j_2}/\left(p_{1j_1}+p_{1j_2}\right)$, and
	\begin{align}
	\mathbb{E}\left(\mathrm{ALG}_2\right)\ge\mathrm{OPT}_2\left(1-\frac{p_{1j_2}^2}{p_{1j_1}+p_{1j_2}}\right). \label{Ealg2 ineq}
	\end{align}
	
	By combining \eqref{opt1 eq}, \eqref{Ealg1 ineq} and \eqref{Ealg2 ineq}, we have
	\begin{align}
	&\frac{\mathbb{E}\left(\mathrm{ALG}_1\right)+\mathbb{E}\left(\mathrm{ALG}_2\right)}{\mathrm{OPT}_1+\mathrm{OPT}_2} \nonumber\\
	\ge{}&\frac{\frac{p_{1j_1}^2+p_{1j_2}^2}{p_{1j_1}+p_{1j_2}}+\mathrm{OPT}_2\left(1-\frac{p_{1j_2}^2}{p_{1j_1}+p_{1j_2}}\right)}{p_{1j_1}+\mathrm{OPT}_2} \nonumber\\
	\ge{}&\min\left\{\frac{p_{1j_1}^2+p_{1j_2}^2}{p_{1j_1}\left(p_{1j_1}+p_{1j_2}\right)}, \frac{\frac{p_{1j_1}^2}{p_{1j_1}+p_{1j_2}}+1}{p_{1j_1}+1}\right\} \label{tmp1}\\
	\ge{}&\frac{3}{4}. \label{tmp2}
	\end{align}
	Here the inequality \eqref{tmp1} uses the fact that \\$\left(ka+b\right)/\left(a+c\right)\ge\min\left\{b/c,\left(k+b\right)/\left(1+c\right)\right\}$ for $0\le a,b,c\le 1$, and the inequality \eqref{tmp2} holds because it is equivalent to two inequalities corresponding to the two parts of ``min'', and each inequality can be turned into a quadratic inequality, which is easy to validate.
\end{proof}

The competitive ratio of $3/4$ is tight. We formalize it as the following theorem.

\begin{theorem} \label{thm:random_tight}
	For any (randomized) algorithm $\mathcal{A}$ for our problem, the competitive ratio of $\mathcal{A}$ is no more than $3/4$ even if there are no more than 2 arrivals and only 1 node is allowed to be assigned to each arrival, i.e.,
	\begin{align*}
	\textstyle
	\inf_{\mathsf{ins}}\frac{\mathcal{A}\left(\mathsf{ins}\right)}{\max_{\mathcal{A}'}\mathcal{A}'\left(\mathsf{ins}\right)}\le\inf_{\mathsf{ins}: n\le 2,s=1}\frac{\mathcal{A}\left(\mathsf{ins}\right)}{\max_{\mathcal{A}'}\mathcal{A}'\left(\mathsf{ins}\right)} \le \frac{3}{4}.
	\end{align*}
\end{theorem}
\begin{proof}
	By Yao's lemma~\cite{4567946}, we only need to consider deterministic algorithms on randomized inputs. We construct a randomized instance $I$ with 2 arrivals and 2 nodes, and let $c_1=c_2=1$, $p_{11}=p_{12}$. Moreover, we set $p_{21}=1,p_{22}=0$ with probability $1/2$ and $p_{21}=0,p_{22}=1$ with probability $1/2$. Now consider the best deterministic algorithm on this randomized instance. No matter which node the algorithm assigns to arrival 1, the expected competitive ratio is $(1+1/2)/2=3/4$, so the competitive ratio of any (randomized) algorithm cannot exceed $3/4$.
\end{proof}

Unfortunately, RANDOM may perform asymptotically bad when the number of arrivals increases. Consider an instance where $p_{11}=1, p_{12}=\cdots=p_{1n}=\epsilon$, and $p_{ij}=0$ for all $i\ge 2$ and all $j$. An optimal assignment would assign node 1 to arrival 1, which obtains a target value of 1. However, RANDOM will produce an expected target value of $\left(1+(n-1)\epsilon^2\right)/\left(1+(n-1)\epsilon\right)$. So if we take $\epsilon=1/\sqrt{n-1}$, the expected target value produced by RANDOM will converge to 0 when $n$ tends to infinity. 

\section{Ranking Algorithm}\label{Ranking Algorithm}
In this section, we consider the case where all $c_j$'s are the same (say $c$), and each $p_{ij}$ is either 0 or a fixed value $p$ ($0<p\le 1$). The case can be applied to markets where the differences of effect and fame of influencers are small. The social media platforms that mainly display professional production always have this feature. 

For each arrival $i$, we define the \emph{feasible set} for arrival $i$ as $F_i=\{j\mid p_{ij}>0\}$. In this setting, when arrival $i$ comes, there are possibly multiple $j$'s that maximize $w_{ij}$'s. We show that if we choose the nodes according to an order randomly determined in advance, the competitive ratio can be improved to $1-1/e$. The idea of this algorithm comes from the RANKING algorithm in \cite{karp1990optimal}. We call this algorithm RANKING, which is formally described in Algorithm~\ref{RANKING}. Note the classical online bipartite matching problem~\cite{karp1990optimal} is exactly a special case in this setting where $p=c=s=1$. Since \citeauthor{karp1990optimal} have proved that the upper bound for the competitive ratio of the classical online bipartite matching problem is $(1-1/e)+o(1)$, our RANKING algorithm is optimal in this setting.


\begin{algorithm}[!htbp]
	\caption{RANKING}
	\label{RANKING}
	\begin{flushleft}
		\begin{enumerate}
			\item Sort all nodes in a random order. 
			\item When arrival $i$ arrives,
			\begin{enumerate}
				\item For all $j$, let $y_{ij}=\sum_{i':i'<i}x_{i'j}$, i.e. the number of arrivals to which node $j$ has already been assigned.
				\item We rename all $j$'s as $j_{i,1},j_{i,2},\ldots$ such that $w_{j_{i,1}}=\cdots=w_{j_{i,k_1}}>w_{j_{i,k_1+1}}=\cdots=w_{j_{i,k_2}}>\cdots$, where $j_{i,1}<\cdots<j_{i,k_1}$, $j_{i,k_1+1}<\cdots<j_{i,k_2}$, and so on. 
				\item Assign nodes $j_{i,1},\ldots,j_{i,s}$ to arrival $i$.
			\end{enumerate}
		\end{enumerate}
	\end{flushleft}
\end{algorithm}

\begin{theorem}
	In the special case where all $c_j$'s are the same, and each $p_{ij}$ is either 0 or a fixed value $p$ ($0<p\le 1$), RANKING has a competitive ratio of $1-1/e$.
\end{theorem}

The proof is analogous to the one in~\cite{karp1990optimal}. The key idea of their proof is to turn the original setting into a setting where arrivals are known at the beginning while nodes arrive one by one according to the random order generated by RANKING instead. However, in our setting, even when $s=1$, one node can be assigned to multiple arrivals, so it does not make the problem easier to use this idea directly. We handle this difficulty by allowing nodes to arrive round after round. This makes our proof much more complicated.

\begin{proof}
	We first prove this algorithm has a competitive ratio of $1-1/e$ in the case where $s=1$.
	
	Given an order $\sigma$ of nodes $\sigma_1,\ldots,\sigma_n$, we consider a setting where arrivals are known at the beginning while nodes arrive one by one instead according to $\sigma$. Specifically, node $\sigma_1$ arrives first, then node $\sigma_2$ arrives, and so on. When a node arrives, we are asked to assign it to an arrival that has not been assigned yet. The process above is repeated $m$ times (we call one complete process a \emph{turn}), so a node can be assigned to multiple arrivals through multiple turns. In this setting, we denote by \emph{time $k+n(t-1)$} the moment where node $\sigma_k$ in the $t$-th turn is being assigned. %Particularly,  we use ``by time $t$'' represent the moment after $t-1$-th assignments finishes and before the $t$-th assignment begins, and we use ``at time $t$'' represent the time when the $t$-th assigning behavior happens. 
	Particularly, when we say ``by time $t$'', the assigning behavior happening at time $t$ is not included. We call this new setting $\sigma$-DUAL. For any order $\sigma$, we can see that the optimal assignment in $\sigma$-DUAL is the same as the optimal assignment in our original setting.
	
	Consider the following algorithm named $\sigma$-DUAL-RANKING in the setting $\sigma$-DUAL.
	\begin{algorithm}[!htbp]
		\caption{$\sigma$-DUAL-RANKING}
		\label{sigma-DUAL-RANKING}
		\begin{flushleft}
			\begin{enumerate}
				\item When node $j$ arrives,
				\begin{enumerate}
					\item Let $i_0$ be the smallest index in $F_i$ such that no node has been assigned to arrival $i_0$ yet.
					\item Assign node $j_0$ to arrival $i$. If such $i_0$ does not exist, do nothing.
				\end{enumerate}
			\end{enumerate}
		\end{flushleft}
	\end{algorithm}
	
	We claim that when the random order drawn by RANKING is $\sigma$, $\sigma$-DUAL-RANKING generates the same assignment as RANKING.	
	\begin{lemma} \label{dual lemma}
		When the random order drawn by RANKING is $\sigma$, $\sigma$-DUAL-RANKING generates the same assignment as RANKING.
	\end{lemma}
	\begin{proof}
		For ease of presentation, we write ``during the running of RANKING in the original setting'' as ``in RANKING'' and write ``during the running of $\sigma$-DUAL-RANKING in the new setting $\sigma$-DUAL'' as ``in $\sigma$-DUAL-RANKING''. 
		Observing that if $\sigma$-DUAL-RANKING does not assign node $\sigma_j$ to any arrival in some turn, it will not assign node $\sigma_j$ in subsequent turns. Thus we only need to prove the following proposition.
		\begin{proposition} \label{dual proposition}
			For any $t$, at time $t$ in $\sigma$-DUAL-RANKING where node $\sigma_j$ is being assigned, 
			\begin{enumerate}
				\item if node $\sigma_j$ is assigned to arrival $i$ at time $t$, then RANKING will assign node $\sigma_j$ to arrival $i$ too; 
				\item if node $\sigma_j$ is not assigned to any arrival at time $t$, and has already been assigned to arrivals $i_1,\ldots,i_k$ by time $t$, then RANKING will not assign node $\sigma_j$ to any arrival other than $i_1,\ldots,i_k$.
			\end{enumerate}
		\end{proposition}
		We prove this proposition by mathematical induction on $t$.
		We first consider the first part of Proposition~\ref{dual proposition}, i.e., at time $t$ in $\sigma$-DUAL-RANKING, node $\sigma_j$ is assigned to arrival $i$. We assume by this time node $\sigma_j$ has already been assigned to arrivals $i_1,\ldots,i_k$ (i.e., this is the $(k+1)$-th turn). Consider the moment immediately before arrival $i$ arrives in RANKING. If node $\sigma_j$ has been assigned to an arrival $i'$ ($i'<i$) other than $i_1,\ldots,i_k$, then by the induction hypothesis, arrival $i'$ should not be assigned to by time $t$ in $\sigma$-DUAL-RANKING. But $\sigma$-DUAL-RANKING assigns node $\sigma_j$ to arrival $i$ at time $t$ while $i'<i$, which contradicts to the rule of $\sigma$-DUAL-RANKING. Hence, immediately before arrival $i$ arrives in RANKING, node $\sigma_j$ has been assigned to at most $k$ arrivals. 
		
		Now we suppose to the contrary that node $\sigma_j$ is not assigned to arrival $i$ by RANKING, then when arrival $i$ arrives in RANKING, another node $\sigma_{j'}$ must be assigned to it. Let $t'<t$ be a time in $\sigma$-DUAL-RANKING where node $\sigma_{j'}$ is being assigned. Note by time $t'$, node $\sigma_{j'}$ has not been assigned to arrival $i$ (otherwise $\sigma$-DUAL-RANKING cannot assign node $\sigma_j$ to arrival $i$), so node $\sigma_{j'}$ must be assigned to some arrival $i''$ at time $t'$ (otherwise by the induction hypothesis, it will never be assigned to arrival $i$ in RANKING). Since $\sigma$-DUAL-RANKING assigns node $\sigma_{j'}$ to arrival $i''$ rather than $i$, we have $i''<i$ in addition by the rule of $\sigma$-DUAL-RANKING. Hence, by the induction hypothesis, when arrival $i$ arrives in RANKING, node $\sigma_{j'}$ is assigned to at least $k$ (if $j'>j$) or $k+1$ (if $j'<j$) arrivals. Recall that we have shown that at the same time, node $\sigma_j$ has been assigned to at most $k$ arrivals, so RANKING will choose to assign node $\sigma_j$ to arrival $i$ by its rule, a contradiction, which proves the first part of Proposition~\ref{dual proposition}.
		
		We then consider the second part of Proposition~\ref{dual proposition}, i.e., $\sigma$-DUAL-RANKING, node $\sigma_j$ is not assigned to any arrival at time $t$ and has already been assigned to arrivals $i_1,\ldots,i_k$ by time $t$. Suppose to the contrary that RANKING assigns node $\sigma_j$ to an arrival $i$ other than $i_1,\ldots,i_k$, then at time $t$ in $\sigma$-DUAL-RANKING, some node $\sigma_{j'}$ must have been assigned to arrival $i$, otherwise $\sigma$-DUAL-RANKING will assign node $\sigma_j$ to arrival $i$ (or an arrival $i'$ with $i'<i$) by its rule. By the induction hypothesis, node $\sigma_{j'}$ is also assigned to arrival $i$ in RANKING, a contradiction. 
	\end{proof}
	
	By Lemma~\ref{dual lemma}, it is sufficient to prove when $\sigma$ is randomly generated, the ratio of the expected target value generated by $\sigma$-DUAL-RANKING to the optimal value (note the optimal value is the same for any $\sigma$) is at least $(1-1/e)$ in the setting $\sigma$-DUAL. Next, we focus on the setting $\sigma$-DUAL.
	
	We define $P_1,P_2,\ldots$ to be i.i.d. random variables whose value is 1 with probability $p$ and 0 with probability $1-p$. Let $v(k)=\mathbb{E}\left(\min\left\{P_1+\cdots+P_k,c\right\}\right)$ and $d(k)=v(k)-v(k-1)$. Now consider an arbitrary algorithm. For an arrival $i$, if the algorithm assigns node $\sigma_j$ to it at time $t$ in turn $k$, and node $\sigma_j$ has already been assigned to $k'$ ($k'<k$) arrivals by time $t$, then assigning this node to arrival $i$ increases the target value by $d(k'+1)$. We define the \emph{weight} of arrival $i$ to be $d(k'+1)$. In addition, we define the \emph{fake weight} of arrival $i$ to be $d(k)$. If no node is assigned to arrival $i$, both the fake weight and the weight of arrival $i$ are defined to be 0. Note the (fake) weights of arrivals depend on the algorithm, and the sum of the weights of all arrivals is exactly the target value generated by the algorithm.
	
	We call an algorithm a \emph{refusal algorithm} if when a node arrives, it either assigns it to the arrival chosen by the rule of $\sigma$-DUAL-RANKING or does not assign it to any arrival. We define the \emph{fake value} of a refusal algorithm to be the sum of the fake weights of all arrivals.
	
	\begin{lemma} \label{refusal lemma}
		The fake value of any refusal algorithm is no more than the target value generated by $\sigma$-DUAL-RANKING.
	\end{lemma}
	\begin{proof}
		We first fix a refusal algorithm $\mathcal{R}$. We respectively denote by $z_i$ and $z_i'$ the weights and fake weights of arrival $i$ corresponding to $\sigma$-DUAL-RANKING and $\mathcal{R}$. Suppose $\sigma$-DUAL-RANKING and $\mathcal{R}$ respectively assign a node to arrival $i$ at time $t_i$ and $t_i'$ (if no node is assigned to arrival $i$, $t_i$ or $t_i'$ is defined to be $+\infty$), we define 
		\begin{align*}
		z_i(t)=\begin{cases}
		z_i, &\text{if $t>t_i$}\\
		0, &\text{otherwise}
		\end{cases},\quad
		z_i'(t)=\begin{cases}
		z_i', &\text{if $t>t_i'$}\\
		0. &\text{otherwise}
		\end{cases}
		\end{align*}
		Note $z_i(nm+1)=z_i$ and $z_i'(nm+1)=z_i'$.
		
		We prove the following stronger proposition instead by mathematical induction on $t$.
		\begin{proposition} \label{refusal proposition}
			For any $t$, $z_i(t)\ge z_i'(t)$
		\end{proposition}
		It trivially holds for $t=1$. 	
		Consider time $t-1$ in the $k$-th turn. If $\mathcal{R}$ does nothing at time $t-1$, then for any $t$, $z_i'(t)=z_i'(t-1)$ while $z_i(t)\ge z_i(t-1)$, we have $z_i(t)\ge z_i'(t)$ by the induction hypothesis. 
		
		If $\mathcal{R}$ assigns node $\sigma_j$ to arrival $i$ at time $t-1$, then $z_i'(t)= d(k)$. 
		If by time $t-1$, $\sigma$-DUAL-RANKING has assigned some nodes to arrival $i$, then $z_i(t)\ge d(k)=z_i'(t)$ (note $d(k)$ is non-increasing in $k$). Otherwise, suppose $\sigma$-DUAL-RANKING assigns node $\sigma_j$ to arrival $i'$ at time $t$. Since $\sigma$-DUAL-RANKING is able to assign node $\sigma_j$ to arrival $i$ at this time, we have $i'\le i$. If $i'<i$, since $\mathcal{R}$ does not assign node $\sigma_j$ to arrival $i'$, it must have assigned some node to arrival $i'$ by time $t$, which means $z_i'(t)>0$, thus $z_i(t)>0$ by the induction hypothesis, i.e., $\sigma$-DUAL-RANKING must also have assigned some node to arrival $i'$ by time $t$, which contradicts to our assumption that $\sigma$-DUAL-RANKING assigns node $\sigma_j$ to arrival $i'$ at time $t$. If $i'=i$, we have $z_i(t)=z_i'(t)$. Note $z_{i''}(t)=z_{i''}(t-1)$ and $z_{i''}'(t)=z_{i''}(t-1)$ for any arrival $i''$ other than $i$, we have $z_i(t)\ge z_i'(t)$ for any $i$ by the induction hypothesis. 
	\end{proof}	
	Suppose algorithm $\mathcal{O}$ generates an optimal assignment. Let $A_k$ be the set of arrivals whose weight is $d(k)$ corresponding to $\mathcal{O}$. Let $\mathcal{R}$ be a refusal algorithm that does not assign any node to an arrival not in $A_k$ in turn $k$. We can see $\mathcal{R}$ sequentially performs the optimal algorithm in \cite{karp1990optimal} on $A_1$, $A_2$, and so on. Suppose $\mathcal{R}$ assigns nodes to exactly $Y_k$ arrivals in $A_k$, then by the conclusion of \cite{karp1990optimal}, we have $\mathbb{E}(Y_k)\ge (1-1/e)|A_k|$ where the expectation is taken over the random choice of $\sigma$. Hence, by lemma~\ref{refusal lemma}, the expected target value generated by $\sigma$-DUAL-RANKING is no less than the expected fake value of $\mathcal{R}$, so $\mathbb{E}\left(\sum_kd(k)Y_k\right)=\sum_kd(k)\mathbb{E}\left(Y_k\right)\ge \sum_kd(k)(1-1/e)|A_k|=(1-1/e)\sum_kd(k)|A_k|$. Note $\sum_kd(k)|A_k|$ is the optimal value, so we can conclude that RANKING has a competitive ratio of $1-1/e$ when $s=1$.
	
	Now consider the case where $s>1$. We first fix an instance $\tau$ and construct a new instance $\tau'$ as follows. We split each arrival $i$ into $s$ arrivals $i_1,\ldots,i_s$. Moreover, for $i_1,\ldots,i_s$, we remove $j_{i,1}$ from the feasible set for arrival $i_1$, remove $j_{i,1},j_{i,2}$ from the feasible set for arrival $i_2$, and so on. Specifically, for any $j$, we define $p_{i_kj}$ to be $p$ if $p_{ij}=p$ and $j\notin\{j_{i,1},\ldots,j_{i,k-1}\}$, and 0 otherwise. In the following analysis, we restrict $s=1$ whenever we talk about $\tau'$. Let $\mathrm{ALG}_{\tau}$ and $\mathrm{ALG}_{\tau'}$ respectively be the target values of RANKING when it runs on $\tau$ and $\tau'$ (note $s=1$ when RANKING runs on $\tau'$), and let $\mathrm{OPT}_{\tau}$ and $\mathrm{OPT}_{\tau'}$ respectively be the optimal target values on $\tau$ and $\tau'$ (again, note $s=1$ when $\tau'$ is analysed). We can see $\mathrm{ALG}_{\tau}=\mathrm{ALG}_{\tau'}$, and $\mathrm{ALG}_{\tau'}\ge (1-1/e)\mathrm{OPT}_{\tau'}$, where the inequality holds by our previous result for $s=1$. Moreover, for any assignment of $\tau$, if nodes $j_1,\ldots,j_s$ are assigned to arrival $i$, we can arrange them properly to arrivals $i_1,\ldots,i_s$ in $\tau'$ without changing the target value, so $\mathrm{OPT}_{\tau'}\ge\mathrm{OPT}_{\tau}$. By combining the inequalities above, we have $\mathrm{ALG}_{\tau}\ge (1-1/e)\mathrm{OPT}_{\tau}$. This means RANKING also has a competitive ratio of $1-1/e$ when $s>1$.
\end{proof}

%\section{Experiments} \label{Experiments}
%In previous sections, we propose two algorithms GREEDY and RANDOM, and analyze their competitive ratios. Recall that the competitive ratio of an algorithm reflects the performance on the \emph{worst-case} instance. How do the two algorithms perform on general instances? To answer this question, we design experiments that run our algorithms on randomly-generated instances. The results show that the performances of our algorithms on general instances are still satisfactory.
%
%We consider the setting where each Internet celebrity or advertiser can only cooperate with one each other. Specifically, we set all $c_j$'s to be 1 and $s=1$. For each pair of $(n,m)$, an instance is generated by drawing $p_{ij}$'s from an identical distribution on $[0,1]$ independently. We test 3 different distributions in our experiments: the uniform distribution $\mathcal{U}(0,1)$, the normal distribution with mean $0.5$ and standard deviation $0.2$, noted by $\mathcal{N}(0.5,0.2)$, and the beta distribution with parameters $\alpha=\beta=0.5$, noted by $\mathrm{Beta}(0.5,0.5)$. Note that the value drawn from the normal distribution may be out of range, so we discard such value and redraw a new one once it happens. The reason we use the beta distribution is that it takes extreme values with high probabilities and takes median values with low probabilities, which is opposite to the normal distribution. For each pair of $(n,m)$ and each distribution, we generate 20 instances and calculate the average objective values of each algorithm on these instances. 
%
%We consider two benchmarks, the optimal solution and another random algorithm which we call BAD. The BAD algorithm randomly assigns a kind of items to an agent when the agent arrives. Note that BAD is different from RANDOM since each kind of items is drawn by BAD with the same probability. The optimal solution gives the upper bound of optimal objective value of each instance while BAD can be seen as a lower bound. We test both our algorithms and benchmarks on different distributions and different $(m,n)$. The results are shown in Figure~\ref{uniform m fig}-\ref{beta n fig}. It is obvious that the average objective values obtained by our algorithms, especially GREEDY, are very close to the optimal objective value, and are much better than BAD in all experiments. The experimental results illustrate that our algorithms perform well not only on the worst-case instance, but also on randomly-generated instances. Another interesting result is that RANDOM always performs worse than GREEDY. It is because RANDOM sacrifices the local optimum property of GREEDY to deal with some extreme cases (like the counterexample given in the proof of Theorem~\ref{optimal theorem}). Hence, RANDOM performs better for the worst-case instances, but there are few such extreme instances in randomly-generated instances. 
%
%\begin{figure}[!b]
%	%\setlength{\belowcaptionskip}{-5pt}
%	\begin{subfigure}[t]{0.48\columnwidth}
%		\centering
%		\resizebox{\columnwidth}{!}{%
%			\begin{tikzpicture}%[scale=0.41]
%			\begin{axis}[
%			title={},
%			xlabel={The number of agents $m$},
%			ylabel={Average Objective Value},
%			y label style={at={(axis description cs:-0.05,0.5)}},
%			xmin=0, xmax=11,
%			ymin=0, ymax=6,
%			xtick={0,2,4,6,8,10},
%			ytick={0,1,2,3,4,5},
%			legend pos=north west,
%			ymajorgrids=true,
%			grid style=dashed,
%			legend cell align={left},
%			legend style={font=\small},
%			label style={font=\Large},
%			tick label style={font=\Large},
%			]
%			
%			\addplot[
%			mark=square*,
%			]
%			coordinates {
%				(2, 1.66002255)(3, 2.44359555)(4, 3.1232606)(5, 3.70432545)(6, 4.221251650000001)(7, 4.42196575)(8, 4.6546377)(9, 4.75869995)(10, 4.8536645499999995)
%			};
%			\addlegendentry{Optimal Solution}
%			
%			\addplot[
%			mark=triangle*,
%			]
%			coordinates {
%				(2, 1.6525343)(3, 2.3604473500000003)(4, 2.9907811499999997)(5, 3.5816573499999995)(6, 3.9773839000000004)(7, 4.184410000000001)(8, 4.3791815)(9, 4.51524335)(10, 4.7262425)
%			};
%			\addlegendentry{GREEDY}
%			
%			\addplot[
%			mark=*,
%			]
%			coordinates {
%				(2, 1.6246157999999997)(3, 2.1996548)(4, 2.6462335)(5, 3.1330551500000006)(6, 3.5558744499999997)(7, 3.656167300000001)(8, 3.961854100000001)(9, 4.23410655)(10, 4.359990150000001)
%			};
%			\addlegendentry{RANDOM}
%			
%			\addplot[
%			mark=diamond*,
%			]
%			coordinates {
%				(2, 0.9625723)(3, 1.3041722500000001)(4, 1.7138328)(5, 2.0142970500000006)(6, 2.1969241999999998)(7, 2.6109122000000005)(8, 3.2023263999999996)(9, 2.88553305)(10, 3.2897729)
%			};
%			\addlegendentry{BAD}
%			
%			\end{axis}
%			\end{tikzpicture}
%		}
%		\caption{$\mathcal{U}(0,1)$, $n=5$} \label{uniform m fig}
%	\end{subfigure}
%	\hfill
%	\begin{subfigure}[t]{0.48\columnwidth}
%		\centering
%		\resizebox{\columnwidth}{!}{%
%			\begin{tikzpicture}%[scale=0.41]
%			\begin{axis}[
%			title={},
%			xlabel={The number of kinds $n$},
%			ylabel={Average Objective Value},
%			y label style={at={(axis description cs:-0.05,0.5)}},
%			xmin=0, xmax=11,
%			ymin=0, ymax=6,
%			xtick={0,2,4,6,8,10},
%			ytick={0,1,2,3,4,5},
%			legend pos=north west,
%			ymajorgrids=true,
%			grid style=dashed,
%			legend cell align={left},
%			legend style={font=\tiny},
%			]
%			
%			\addplot[
%			mark=square*,
%			]
%			coordinates {
%				(2, 1.8548754000000003)(3, 2.6869717)(4, 3.36650355)(5, 3.70432545)(6, 4.048221549999999)(7, 4.143350849999999)(8, 4.3734623)(9, 4.479215250000001)(10, 4.424744500000001)
%			};
%			\addlegendentry{Optimal Solution}
%			
%			\addplot[
%			mark=triangle*,
%			]
%			coordinates {
%				(2, 1.83915025)(3, 2.5737658)(4, 3.1727771999999996)(5, 3.5816573499999995)(6, 3.861673500000001)(7, 3.932961749999999)(8, 4.194158300000001)(9, 4.37569735)(10, 4.37836155)
%			};
%			\addlegendentry{GREEDY}
%			
%			\addplot[
%			mark=*,
%			]
%			coordinates {
%				(2, 1.80106885)(3, 2.2997047000000004)(4, 2.931109)(5, 3.1330551500000006)(6, 3.44588455)(7, 3.5162657499999996)(8, 3.8088669499999996)(9, 3.9279992)(10, 4.0240171)
%			};
%			\addlegendentry{RANDOM}
%			
%			\addplot[
%			mark=diamond*,
%			]
%			coordinates {
%				(2, 1.45798825)(3, 1.75534945)(4, 1.9809969499999998)(5, 2.0142970500000006)(6, 2.1616333)(7, 2.2605318999999997)(8, 2.22269415)(9, 2.32565315)(10, 2.3038571999999995)
%			};
%			\addlegendentry{BAD}
%			
%			\end{axis}
%			\end{tikzpicture}
%		}
%		\caption{$\mathcal{U}(0,1)$, $m=5$} \label{uniform n fig}
%	\end{subfigure}
%	\par\bigskip
%	\begin{subfigure}{0.48\columnwidth}
%		\centering
%		\resizebox{\columnwidth}{!}{
%			\begin{tikzpicture}
%			\begin{axis}[
%			title={},
%			xlabel={The number of agents $m$},
%			ylabel={Average Objective Value},
%			y label style={at={(axis description cs:-0.05,0.5)}},
%			xmin=0, xmax=11,
%			ymin=0, ymax=6,
%			xtick={0,2,4,6,8,10},
%			ytick={0,1,2,3,4,5},
%			legend pos=north west,
%			ymajorgrids=true,
%			grid style=dashed,
%			legend cell align={left},
%			legend style={font=\tiny},
%			]
%			
%			\addplot[
%			mark=square*,
%			]
%			coordinates {
%				(2, 1.4029935499999997)(3, 2.113704)(4, 2.7118999500000003)(5, 3.4134326)(6, 3.7221173999999997)(7, 4.033320000000001)(8, 4.25848025)(9, 4.4323809999999995)(10, 4.529688800000001)
%			};
%			\addlegendentry{Optimal Solution}
%			
%			\addplot[
%			mark=triangle*,
%			]
%			coordinates {
%				(2, 1.3969806999999999)(3, 2.0596735500000003)(4, 2.65843295)(5, 3.17613375)(6, 3.523324400000001)(7, 3.7931289000000006)(8, 4.03043845)(9, 4.178227549999999)(10, 4.3405231)
%			};
%			\addlegendentry{GREEDY}
%			
%			\addplot[
%			mark=*,
%			]
%			coordinates {
%				(2, 1.3239539)(3, 1.9301576000000005)(4, 2.3216920499999993)(5, 2.6961798999999997)(6, 3.029899200000001)(7, 3.341006)(8, 3.7669975500000006)(9, 3.8117185)(10, 3.974915850000001)
%			};
%			\addlegendentry{RANDOM}
%			
%			\addplot[
%			mark=diamond*,
%			]
%			coordinates {
%				(2, 0.9891482499999998)(3, 1.3363322)(4, 1.7188244000000001)(5, 2.1063266499999997)(6, 2.25094405)(7, 2.6179131500000006)(8, 2.7165561499999997)(9, 3.1530276000000006)(10, 3.28673415)
%			};
%			\addlegendentry{BAD}
%			
%			\end{axis}
%			\end{tikzpicture}
%		}
%		\caption{$\mathcal{N}(0.5,0.2)$, $n=5$} \label{normal m fig}
%	\end{subfigure}
%	\hfill
%	\begin{subfigure}{0.48\columnwidth}
%		\centering
%		\resizebox{\columnwidth}{!}{
%			\begin{tikzpicture}
%			\begin{axis}[
%			title={},
%			xlabel={The number of kinds $n$},
%			ylabel={Average Objective Value},
%			y label style={at={(axis description cs:-0.05,0.5)}},
%			xmin=0, xmax=11,
%			ymin=0, ymax=6,
%			xtick={0,2,4,6,8,10},
%			ytick={0,1,2,3,4,5},
%			legend pos=north west,
%			ymajorgrids=true,
%			grid style=dashed,
%			legend cell align={left},
%			legend style={font=\tiny},
%			]
%			
%			\addplot[
%			mark=square*,
%			]
%			coordinates {
%				(2, 1.7972243999999997)(3, 2.44142045)(4, 2.9950454000000004)(5, 3.4134326)(6, 3.54517535)(7, 3.7790676999999997)(8, 3.7991529)(9, 3.8020595)(10, 3.9077512500000005)
%			};
%			\addlegendentry{Optimal Solution}
%			
%			\addplot[
%			mark=triangle*,
%			]
%			coordinates {
%				(2, 1.7235277499999995)(3, 2.3133278)(4, 2.8619068)(5, 3.17613375)(6, 3.3872798000000004)(7, 3.7037934999999997)(8, 3.7443774000000003)(9, 3.7371571499999994)(10, 3.8780859)
%			};
%			\addlegendentry{GREEDY}
%			
%			\addplot[
%			mark=*,
%			]
%			coordinates {
%				(2, 1.7021949499999998)(3, 2.2236949)(4, 2.5416424999999996)(5, 2.6961798999999997)(6, 3.0206657)(7, 3.2277662)(8, 3.2726144999999995)(9, 3.3264973500000012)(10, 3.4190117)
%			};
%			\addlegendentry{RANDOM}
%			
%			\addplot[
%			mark=diamond*,
%			]
%			coordinates {
%				(2, 1.5063326500000005)(3, 1.6816709499999998)(4, 2.0752585)(5, 2.1063266499999997)(6, 2.1518658000000004)(7, 2.1241787499999996)(8, 2.2522424)(9, 2.17383295)(10, 2.2188536)
%			};
%			\addlegendentry{BAD}
%			
%			\end{axis}
%			\end{tikzpicture}	
%		}
%		\caption{$\mathcal{N}(0.5,0.2)$, $m=5$} \label{normal n fig}
%	\end{subfigure}
%	\par\bigskip
%	\begin{subfigure}{0.48\columnwidth}
%		\centering
%		\resizebox{\columnwidth}{!}{
%			\begin{tikzpicture}
%			\begin{axis}[
%			title={},
%			xlabel={The number of agents $m$},
%			ylabel={Average Objective Value},
%			y label style={at={(axis description cs:-0.05,0.5)}},
%			xmin=0, xmax=11,
%			ymin=0, ymax=6,
%			xtick={0,2,4,6,8,10},
%			ytick={0,1,2,3,4,5},
%			legend pos=north west,
%			ymajorgrids=true,
%			grid style=dashed,
%			legend cell align={left},
%			legend style={font=\tiny},
%			]
%			
%			\addplot[
%			mark=square*,
%			]
%			coordinates {
%				(2, 1.6763816500000002)(3, 2.6189462500000005)(4, 3.4015234999999997)(5, 4.10504045)(6, 4.52862065)(7, 4.7306764999999995)(8, 4.8540690500000006)(9, 4.929872050000001)(10, 4.94741405)
%			};
%			\addlegendentry{Optimal Solution}
%			
%			\addplot[
%			mark=triangle*,
%			]
%			coordinates {
%				(2, 1.6587945999999998)(3, 2.55229755)(4, 3.3161684)(5, 3.7805780500000004)(6, 4.177982)(7, 4.469475049999999)(8, 4.69061725)(9, 4.785669099999999)(10, 4.805300750000001)
%			};
%			\addlegendentry{GREEDY}
%			
%			\addplot[
%			mark=*,
%			]
%			coordinates {
%				(2, 1.6158439000000002)(3, 2.38439325)(4, 2.87197205)(5, 3.36474495)(6, 3.85626495)(7, 4.11104785)(8, 4.1001904499999995)(9, 4.3982104500000005)(10, 4.58995045)
%			};
%			\addlegendentry{RANDOM}
%			
%			\addplot[
%			mark=diamond*,
%			]
%			coordinates {
%				(2, 0.8541746000000001)(3, 1.4236687999999997)(4, 1.7123871000000002)(5, 2.1341617)(6, 2.5032486)(7, 2.7030350000000003)(8, 2.7476379499999997)(9, 3.1779363)(10, 3.1885980500000004)
%			};
%			\addlegendentry{BAD}
%			
%			\end{axis}
%			\end{tikzpicture}	
%		}
%		\caption{$\mathrm{Beta}(0.5,0.5), n=5$} \label{beta m fig}
%	\end{subfigure}
%	\hfill
%	\begin{subfigure}{0.48\columnwidth}
%		\centering
%		\resizebox{\columnwidth}{!}{
%			\begin{tikzpicture}
%			\begin{axis}[
%			title={},
%			xlabel={The number of kinds $m$},
%			ylabel={Average Objective Value},
%			y label style={at={(axis description cs:-0.05,0.5)}},
%			xmin=0, xmax=11,
%			ymin=0, ymax=6,
%			xtick={0,2,4,6,8,10},
%			ytick={0,1,2,3,4,5},
%			legend pos=north west,
%			ymajorgrids=true,
%			grid style=dashed,
%			legend cell align={left},
%			legend style={font=\tiny},
%			]
%			
%			\addplot[
%			mark=square*,
%			]
%			coordinates {
%				(2, 1.9551606499999998)(3, 2.81513525)(4, 3.60969595)(5, 4.10504045)(6, 4.39840695)(7, 4.541378249999999)(8, 4.630830850000001)(9, 4.7461541)(10, 4.73362895)
%			};
%			\addlegendentry{Optimal Solution}
%			
%			\addplot[
%			mark=triangle*,
%			]
%			coordinates {
%				(2, 1.9532690000000001)(3, 2.7365128)(4, 3.4115587499999998)(5, 3.7805780500000004)(6, 4.10564325)(7, 4.4348744)(8, 4.581430050000002)(9, 4.669974500000001)(10, 4.6517331)
%			};
%			\addlegendentry{GREEDY}
%			
%			\addplot[
%			mark=*,
%			]
%			coordinates {
%				(2, 1.8816513499999996)(3, 2.478581850000001)(4, 3.1962933500000004)(5, 3.36474495)(6, 3.5950179500000004)(7, 4.0028744000000005)(8, 4.07495175)(9, 4.229692399999999)(10, 4.228336)
%			};
%			\addlegendentry{RANDOM}
%			
%			\addplot[
%			mark=diamond*,
%			]
%			coordinates {
%				(2, 1.479242)(3, 1.8330088000000004)(4, 1.8890096)(5, 2.1341617)(6, 2.1190378999999995)(7, 2.07917165)(8, 2.2485720499999995)(9, 2.58550355)(10, 2.4718831)
%			};
%			\addlegendentry{BAD}	   	
%			\end{axis}
%			\end{tikzpicture}	
%		}
%		\caption{$\mathrm{Beta}(0.5,0.5), m=5$} \label{beta n fig}
%	\end{subfigure}
%	\caption{ The average objective value for different distributions}
%\end{figure}

\section{Conclusion}
In this paper, we study a new variant of the online bipartite problem where each agent has a probability of rejecting an assignment. When a new agent arrives, previous assignment outcomes are not revealed in our setting. We give a deterministic algorithm with a tight competitive ratio of the problem. Next we propose an optimal randomized algorithm with a competitive ratio of $3/4$ when there are no more than two arrivals. We show that the competitive ratio can be $1-1/e$, which is tight, in a special case where the probabilities are either 0 or a fixed value $p$.


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\iffalse
\begin{contributions} % will be removed in pdf for initial submission,
                      % so you can already fill it to test with the
                      % ‘accepted’ class option
    Briefly list author contributions.
    This is a nice way of making clear who did what and to give proper credit.

    H.~Q.~Bovik conceived the idea and wrote the paper.
    Coauthor One created the code.
    Coauthor Two created the figures.
\end{contributions}
\fi
\begin{acknowledgements} % will be removed in pdf for initial submission,
                         % so you can already fill it to test with the
                         % ‘accepted’ class option
This work was partially supported by National Key Research and Development Program of China under (Grant No. 2020AAA0103401); National Natural Science Foundation of China (Grant No. 62172422); Beijing Outstanding Young Scientist Program (No. BJJWZYJH012019100020098);
Intelligent Social Governance Interdisciplinary Platform, Major Innovation $\&$ Planning Interdisciplinary Platform for the ``Double-First Class'' Initiative, Renmin University of China.
\end{acknowledgements}

\bibliography{chen_165}



\end{document}
