['6c6', '< We study statistical estimation in the spiked tensor model, where we observe a q-tensor Y ∈ R n q in n q dimensions given by', '---', '> The problem of statistical estimation in the spiked tensor model is a crucial area of study, particularly due to its significant computational-statistical gap. In this model, we observe a q-tensor Y ∈ R n q in n q dimensions, defined as:', '8c8', '< Here u ∼ Unif({+1, -1} n ) is some hidden signal, 1 and W ∈ (R n ) ⊗q is a noise tensor whose entries are i.i.d. standard Gaussian N (0, 1). The parameter λ n > 0 is the signal-to-noise ratio (SNR). The goal is to estimate u given only access to Y . That is, we seek an estimator û : (R n ) ⊗q → S n-1 ( √ n) achieving nontrivial overlap with the signal:', '---', '> Here, u ∼ Unif({+1, -1} n ) represents a hidden signal, and W ∈ (R n ) ⊗q is a noise tensor with i.i.d. standard Gaussian N (0, 1) entries. The parameter λ n > 0 denotes the signal-to-noise ratio (SNR). The primary objective is to accurately estimate u given only Y , specifically seeking an estimator û : (R n ) ⊗q → S n-1 ( √ n) that achieves non-trivial overlap with the signal:', '11,13c11,16', '< The spiked tensor model is a famous problem because it exhibits a huge computational-statistical gap, referring to regimes of SNR where the statistical estimation problem is information-theoreotically solvable, but no efficient algorithm has been found. For example, it is known that the Bayes optimal estimator achieves non-trivial overlap with the signal u when λ n > λ IT for some constant threshold λ IT = Θ (1), whereas the problem is information-theoretically impossible when λ n ≤ λ IT [1]. Furthermore, the maximum likelihood estimator also achieves non-trivial overlap with the signal when λ n > λ MLE for some λ MLE = Θ (1). However, the best-known polynomial-time classical algorithms for computing a non-trivial estimator require a much higher SNR of λ n = Θ(n (q-2)/4 ). These include tensor power iteration, gradient descent, approximate message passing, and spectral methods with tensor unfolding [2][3][4][5][6][7][8][9][10][11][12]. Indeed, assuming the secret leakage planted clique conjecture, Ref. [13] proves an Ω(n (q-2)/4 ) lower bound on the SNR needed by any polynomial-time classical algorithm. See Fig. 1 for an illustration of the different SNR thresholds and Section 2.1 for more background.', '< On the other hand, quantum algorithms are widely believed to have computational advantages over classical algorithms for many problem classes. In particular, we focus on the Quantum Approximate Optimization Algorithm (QAOA) [14], a general-purpose quantum optimization algorithm that can be applied to optimize any objective function on bit-strings. The QAOA has received an enormous amount of attention in the quantum computing community for several reasons. First, the QAOA is simple and allows efficient implementation on near-term quantum hardware with many applications [15][16][17][18][19]. Additionally, the QAOA is computationally universal [20], and its generalization can realize other powerful algorithms such as the quantum singular value transform [21]. Under common complexity-theoretic assumptions, no classical device can efficiently simulate the output distribution of the QAOA even at shallow depth [22,23]. Furthermore, the QAOA is guaranteed to find optimal solutions when its number of steps (or depth) diverges [14]. Nevertheless, analyzing the asymptotic performance of QAOA remains challenging: classical simulation of the algorithm is limited to small problem dimension n, and analytical computations are often highly non-trivial [24][25][26][27][28]. Given the enormous Ω(n (q-2)/4 ) computational-statistical gap in the spiked tensor model (compared to e.g., the constant factor gap in spin-glass optimization [29]), it is an interesting open question whether the QAOA, as a realistic quantum algorithm with asymptotic convergence guarantees, can provide any computational advantages.', '< In this work, we investigate the performance of QAOA for the spiked tensor model. In particular, we choose the log-likelihood objective of spiked tensor C(z) = ⟨Y , z ⊗q ⟩/n (q-2)/2 . Its maximizer, the maximum likelihood estimator, achieves non-trivial overlap with the signal whenever λ n > λ MLE = Θ (1). While the infinite-step QAOA could compute the maximizer, we are interested in the performance of QAOA when the depth is polynomial in the problem size, and hope that it can surpass the Θ(n (q-2)/4 ) classical threshold. Although some limitations of the QAOA are known for certain random optimization problems in the low-depth regime [26,[30][31][32], these negative results do not apply to the spiked tensor model since they rely on either sparse connectivity or concentration, both of which are absent in the current setting. Here, as a first attempt to bridge the gap in understanding how well a popular quantum algorithm may perform on a classically hard statistical estimation problem, we study the asymptotic behavior of the QAOA on the spiked tensor model in the constant-depth regime, where we are able to obtain rigorous and analytical results.', '---', '> ', '> The spiked tensor model is renowned for exhibiting a substantial computational-statistical gap. This gap refers to regimes of SNR where the statistical estimation problem is information-theoretically solvable, yet no efficient classical algorithm has been discovered. For instance, the Bayes optimal estimator can achieve non-trivial overlap with the signal u when λ n > λ IT for some constant threshold λ IT = Θ (1), while the problem becomes information-theoretically impossible when λ n ≤ λ IT [1]. Similarly, the maximum likelihood estimator (MLE) also achieves non-trivial overlap when λ n > λ MLE for some λ MLE = Θ (1). However, the most effective polynomial-time classical algorithms for computing a non-trivial estimator necessitate a much higher SNR of λ n = Θ(n (q-2)/4 ). These include methods such as tensor power iteration, gradient descent, approximate message passing, and spectral methods with tensor unfolding [2][3][4][5][6][7][8][9][10][11][12]. Furthermore, assuming the secret leakage planted clique conjecture, Ref. [13] establishes an Ω(n (q-2)/4 ) lower bound on the SNR required by any polynomial-time classical algorithm. Figure 1 illustrates these various SNR thresholds, with Section 2.1 providing additional background.', '> ', '> In contrast, quantum algorithms are widely posited to offer computational advantages over classical counterparts for numerous problem classes. Our focus is on the Quantum Approximate Optimization Algorithm (QAOA) [14], a versatile quantum optimization algorithm applicable to any objective function on bit-strings. QAOA has garnered significant attention within the quantum computing community due to its simplicity, efficient implementability on near-term quantum hardware, and broad applicability [15][16][17][18][19]. It is also computationally universal [20], and its generalizations can implement other powerful algorithms like the quantum singular value transform [21]. Under standard complexity-theoretic assumptions, classical devices cannot efficiently simulate the output distribution of QAOA, even at shallow depths [22,23]. Moreover, QAOA is guaranteed to find optimal solutions as its number of steps (or depth) increases indefinitely [14]. Nevertheless, analyzing the asymptotic performance of QAOA remains a formidable challenge, as classical simulation is restricted to small problem dimensions (n), and analytical computations are often highly non-trivial [24][25][26][27][28]. Given the enormous Ω(n (q-2)/4 ) computational-statistical gap in the spiked tensor model—a gap substantially larger than, for example, the constant factor gap observed in spin-glass optimization [29]—it becomes a compelling open question whether QAOA, as a realistic quantum algorithm with asymptotic convergence guarantees, can offer any computational advantages in this context.', '> ', '> This work investigates the performance of QAOA for the spiked tensor model, specifically employing the log-likelihood objective function C(z) = ⟨Y , z ⊗q ⟩/n (q-2)/2 . The maximizer of this function, the maximum likelihood estimator (MLE), achieves non-trivial overlap with the signal whenever λ n > λ MLE = Θ (1). While infinite-step QAOA could, in principle, compute the MLE, we are particularly interested in the performance of QAOA at depths polynomial in the problem size, with the hope that it might surpass the Θ(n (q-2)/4 ) classical threshold. Although certain limitations of QAOA are known for specific random optimization problems in the low-depth regime [26,[30][31][32], these negative results typically rely on sparse connectivity or concentration properties, neither of which are present in the spiked tensor model. Therefore, these limitations do not directly apply here. As a pioneering effort to bridge the understanding gap between popular quantum algorithms and classically hard statistical estimation problems, we rigorously study the asymptotic behavior of QAOA on the spiked tensor model in the constant-depth regime, where we are able to obtain robust analytical results.', '22c25', '< Tensor power iteration. A well-studied classical algorithm for the spiked tensor model is tensor power iteration [2,10,33]. Starting from a uniform random initialization û0 ∼ Unif(S n-1 ), the k-th iteration is given by ûk , where', '---', '> Tensor Power Iteration. A widely studied classical algorithm for the spiked tensor model is tensor power iteration [2,10,33]. Starting from a uniform random initialization û0 ∼ Unif(S n-1 ), the k-th iteration is defined as ûk, where:', '24,27c27,32', '< Here, Y [û ⊗(q-1) ] ∈ R n denotes contracting the order-q tensor Y ∈ R n q with the order-(q -1) tensor û⊗(q-1) ∈ R n q-1 . It is shown that with (log n) iterations, weak recovery is possible if the SNR satisfies λ n = Ω(n (q-2)/2 / polylog(n)) [10,33]. However, tensor power iteration does not match the best-known classical algorithms. Furthermore, we remark that rounding the tensor power iteration to sign(û k ) ∈ {±1} n does not give a better threshold.', '< Other classical algorithms and related results. [2] showed that the tensor power iteration and approximate message passing algorithms with random initialization can recover the signal provided λ n = Ω(n (q-1)/2 ). This SNR threshold was later improved to λ n = Ω(n (q-2)/2 ) by [3,10,33] for these same methods. The same threshold λ n = Ω(n (q-2)/2 ) could also be achieved by gradient descent and Langevin dynamics as proved in [9]. On maximum likelihood estimation for the spiked tensor model with a spherical prior, [5,6] studied the loss landscape, providing intuition that it contains many saddle points and local minima near the equator, but no bad critical points off the equator.', '< The best currently known polynomial-time algorithms can achieve a sharp threshold of λ n = Ω(n (q-2)/4 ). These include spectral methods with tensor unfolding [2,11], sum-of-squares algorithms [34][35][36], sophisticated iteration algorithms [37][38][39], and gradient descent on the smoothed landscape [40,41].', '< Another line of research has attempted to prove computational lower bounds in restricted computational models, including low-degree polynomials and statistical query algorithms [42,43]. Under the secreted leakage planted clique conjecture, [13] proved that any classical polynomial-time algorithm requires λ n = Ω(n (q-2)/4 ) for weak recovery of the signal.', '---', '> Here, Y [û ⊗(q-1) ] ∈ R n denotes the contraction of the order-q tensor Y ∈ R n q with the order-(q -1) tensor û⊗(q-1) ∈ R n q-1 . It has been shown that with O(log n) iterations, weak recovery is achievable if the SNR satisfies λ n = Ω(n (q-2)/2 / polylog(n)) [10,33]. However, this performance does not match the recovery thresholds of the best-known classical algorithms. Furthermore, rounding the output of tensor power iteration to sign(û k ) ∈ {±1} n does not asymptotically improve this threshold.', '> Other Classical Algorithms and Related Results. Early work by [2] demonstrated that tensor power iteration and approximate message passing algorithms, when initialized randomly, could recover the signal provided the SNR satisfied λ n = Ω(n (q-1)/2 ). This SNR threshold was subsequently improved to λ n = Ω(n (q-2)/2 ) for these methods by [3,10,33]. Gradient descent and Langevin dynamics were also shown to achieve this λ n = Ω(n (q-2)/2 ) threshold, as proven in [9]. Regarding maximum likelihood estimation for the spiked tensor model with a spherical prior, [5,6] investigated the loss landscape, providing insights into its structure, which includes numerous saddle points and local minima near the equator, but no problematic critical points away from it.', '> ', '> The most advanced polynomial-time classical algorithms currently known can achieve a sharper threshold of λ n = Ω(n (q-2)/4 ). This includes spectral methods leveraging tensor unfolding [2,11], sum-of-squares algorithms [34][35][36], sophisticated iterative algorithms [37][38][39], and gradient descent applied to a smoothed loss landscape [40,41].', '> ', '> A parallel research direction has focused on establishing computational lower bounds within restricted computational models, such as low-degree polynomials and statistical query algorithms [42,43]. Notably, under the secreted leakage planted clique conjecture, Ref. [13] proved an Ω(n (q-2)/4 ) lower bound on the SNR necessary for any polynomial-time classical algorithm to achieve weak recovery of the signal.', '1272d1276', '< ']
