\subsection{Related work}
\label{sec:relatedwork}

Approximation algorithms for the maximization of a submodular objective function subject to various constraints have been extensively studied in the literature \citep{nemhauser1978analysis,badanidiyuru2014fast,mirzasoleiman2015lazier,calinescu2011maximizing} with the assumption of oracle access to $f$. The runtime of these algorithms is generally measured in queries to $f$ as this is the main bottleneck (see Section \ref{appdx:related_work} for a more comprehensive discussion on the runtime of algorithms for various submodular optimization problems).

The problem of submodular maximization under bandit feedback has been studied in many previous works \citep{hiranandani2020cascading, yue2011linear,nie2022explore,singla2016noisy}. In most of the works, the objective is to minimize the regret, where the strategy is to explore new solution sets while exploiting past sampling results to select sets with relatively high function values \citep{yue2011linear,takemori2020submodular,nie2022explore,zhang2019online}. Within this broader category, \citep{singla2016noisy,jawanpuriarevisiting,chen2024linear} examine the same pure-exploration setting as us. In particular,
% While there are many works assuming value oracle access to $f$, algorithms developed assuming noisy access to $f$ are relatively less explored \cite{horel2016maximization,singla2016noisy,hassidim2017submodular,qian2017subset,crawford2019submodular,huang2022efficient}. 
% One related setting to ours is that we have noisy access to $f$, but this noise is \textit{persistent} \cite{horel2016maximization,hassidim2017submodular,qian2017subset,crawford2019submodular,huang2022efficient}. Our noisy setting departs from this direction in that the noisy feedback is random and repeated samples should be taken to diminish the noise. 
 the algorithm \singla of \citet{singla2016noisy} is for a noisy setting identical to ours and is developed for the MSMC problem specifically. Their algorithm combines the standard greedy algorithm with the best arm identification problem found in combinatorial bandit literature \citep{chen2014combinatorial}. 
 % \textcolor{blue}{We consider a broader class of optimization problems compared to \citet{singla2016noisy}, and in addition our algorithms for MSMC are more efficient in terms of sample complexity.}
 An extensive comparison of our algorithms and results with \cite{singla2016noisy} are presented in Appendix Section \ref{app:related}, as well as an experimental comparison in Section \ref{sec:exp}. Additionally, \cite{chen2024linear} presents a concurrent study employing threshold-based greedy algorithms. However, their approach is tailored to the relatively restricted class of objective functions with linear structures and does not apply to the general setting considered here.
 %and was developed after our work.

Another related but different setting is that of stochastic submodular optimization \citep{karimi2017stochastic,staib2019distributionally,ozcan2023stochastic} which assumes the optimization objective $f$ is the expectation over some unknown distribution over a set of monotone submodular functions. Therefore a sample average function can be built, which is also monotone and submodular, and algorithms run on it. In contrast, in our setting, it is only assumed that we can sample noisy queries at each subset $X\subseteq U$.

%One of the main contributions of our paper is the adaptive threshold sampling algorithm, \samplong (\samp). Notably, 
The intuition behind \samp is similar to the best-arm-identification problem in the multi-armed bandit literature \citep{kalyanakrishnan2012pac,chen2014combinatorial}.
%In pure exploratory bandits under the PAC setting, the learner interacts with a total of $n$ arms. The goal is to find an arm $a$ that guarantees $r^*-r_a\leq\epsilon$ with probability at least $1-\delta$, using as few samples as possible. In this setting, \cite{kalyanakrishnan2012pac} propose the LUCB algorithm that achieves a sample complexity of $O(H^{\epsilon}\log\big(\frac{H^{\epsilon}}{\delta}\big))$, where $H^{\epsilon}=\sum_{i\in[n]}\frac{1}{(\max\{\Delta_i,\epsilon/2\})^2}$ and $\Delta_i=r^*-r_i$.
Both the algorithm LUCB of \cite{kalyanakrishnan2012pac} and \samp share a common underlying intuition: they leverage the difference between expectations to reduce the number of noisy queries required. In LUCB, this difference is between the expectation of the optimal arm and other arms, while in \samp, it is between the expectation of the input variable and the threshold value $w$.
%By adaptively adjusting the confidence interval and determining when to terminate the sampling process, both two algorithms can significantly reduce the number of queries compared to fixed approximation sampling, which doesn't consider the difference between expectations and only approximates the evaluated random variable to a close precision. 

% TODO more related \cite{ozcan2021submodular,ozcan2023stochastic,mokhtari2018conditional,hassidim2017robust}.

%The most basic algorithm proposed by \singla is based on the standard greedy algorithm. At each iteration, their basic algorithm chooses the element of highest marginal gain with high probability by adaptively sampling marginal gains to the needed precision. They use work done on the best arm identification problem in the multi-armed bandit literature in order to determine how many samples are needed. A downside to this basic algorithm is that if there are small differences in the marginal gains between different elements, this can result in a lot of samples. They then propose a randomized version of the standard greedy algorithm that is less dependent on small differences in marginal gains, but on the other hand the approximation guarantee is weaker since it is in expectation instead of with high probability. We provide a more detailed description of the algorithm of \singla compared to ours in Section \ref{sec:compare to the previous result}. In summary, our approach is related to \singla but there are a few advantages to ours: (i) Our algorithm has an approximation guarantee with high probability as opposed to in expectation; (ii) Our algorithm is not as sensitive to small differences in marginal gain since it doesn't choose the element of highest marginal gain; (iii) The algorithm of \singla has greater time complexity beyond query complexity because it requires {\color{blue} O(n) computations per each noisy query to $\Delta f$ (see discussion in Section \ref{sec:exp_results}) }; (iv) Our algorithm makes fewer queries of $f$ overall since it is based on a faster variant of the greedy algorithm (\threshold); (v) Our algorithm is simpler and requires a simpler analysis. We compare the algorithms experimentally in Section \ref{sec:exp_results}.

%{\color{blue}Another line of work assumes the optimization objective is the expectation of  monotone submodular functions over some unknown distribution, i.e., $f_P(S)=\mathbb{E}_{f\sim P}f(S)$ and $f$ is monotone, submodular
%\cite{karimi2017stochastic,staib2019distributionally}. In particular, \citeauthor{karimi2017stochastic} proposed an approach with a $(1-1/e)$-approximation guarantee in expectation for the class of weighted coverage functions. In the stochastic submodular setting, it is assumed that 
% each realization $f$ of the noisy objective is monotone and submodular and can be stored to query $f(S)$ for all subsets $S\subseteq U$. However, in our setting, we don't assume that the noisy query function is also a monotone submodular function and that a noisy sampling of $f$ can be used to query for all subsets. }

% The problem of monotone submodular optimization has been extensively studied in many machine learning fields such as data summarization, influence maximization, and graph-based algorithms.
%Most of the works that study the problem of monotone submodular optimization assume oracle access of the function values, which is a strong assumption that might fail in many real-world applications such as submodular bandit \cite{yue2011linear}, \cite{gabillon2013adaptive}. When such an oracle doesn't exist, the problem becomes more challenging, and additional access models need to be assumed. There are two different access models used in previous works, the deterministic noise model and the stochastic noise model. The deterministic noise model assumes a function $F$ that is a deterministic surrogate function of $f$. i.e., the querying result of $F(S)$ remains the same if multiple queries are made.  \cite{crawford2019submodular}, \cite{geng2022bicriteria} assume the surrogate function $F$ with
%$\epsilon$-additive noise to $f$, i.e., $|F(S)-f(S)|\leq\epsilon$. \cite{horel2016maximization}, \cite{golz2019migration}, \cite{nguyen2020streaming} consider noisy submodular objective with $\epsilon$-multiplicative approximate function $F$, i.e. $(1-\epsilon)f(S)\leq F(S)\leq(1+\epsilon)f(S)$. Another line of work assumes the stochatic access model \cite{hassidim2017submodular}, i.e., $F(S)$ is a random variable. \cite{karimi2017stochastic}, \cite{kempe2003maximizing} studies the problem where the submodular objective is the expectation of some submodular functions
%over some unknown distribution. Other works \cite{hassidim2017submodular} \cite{huang2022efficient} consider multiplicative i.i.d noise. Among all these works,  