In this section, we formally state the assumptions and show our approach finds a stabilizing controller without suffering from exponential blow-up in $n$. Our first assumption is regarding the spectral properties of $A$, which requires distinct eigenvalues with specified eigengap.

\begin{assumption}[Spectral Property]
\label{assumption:eigengap}
    $A$ is diagonalizable with distinct eigenvalues $\lambda_1,\ldots,\lambda_n$ satisfying $|\lambda_1| > |\lambda_2| > \dots > |\lambda_k| > 1 > |\lambda_{k+1}| > \dots > |\lambda_n|$. %Moreover, $|\lambda_k| > 1+\frac{C}{T}$. $|\lambda_{k+1}| < 1 - \frac{C}{T}$. 
\end{assumption}

We assume the learner knows the value of $k$. However, we point out that our algorithm works as long as the learner picks a value $\hat{k}$ at least as large as $k$. In order to provide guarantee to the estimation of the open-loop unstable system dynamics, we also need an assumption on the distribution of noise $\eta$.

\begin{assumption}[pdf of $\eta$]
    \label{assumption:pdf}
    Let $M_1 := \Bar{P}^{-1}J \Bar{P}$ denote the Jordan normal form of $M_1$, and $\Bar{P} := [\Bar{P}_1,\Bar{P}_2,\cdots,\Bar{P}_k]^*$.
    There exists $C_z \in \mathbb{R}$, such that the supremum of the probability distribution function (pdf) of $\left|\Bar{P}_i^* \sum_{j = 1}^t M_1^{-j} P_1^* \eta_j\right|$ is upper bounded almost everywhere,.i.e.
   % \begin{equation*}
       $ \esssup \text{pdf} \left(\left|\Bar{P}_i^* \sum_{j = 1}^t M_1^{-j} P_1^* \eta_j\right|\right) < C_z$, 
   % \end{equation*}
    for all $i \in \{1,\dots,k\}$ and $t \in \mathbb{N}$. 
\end{assumption}
\Cref{assumption:pdf} holds for most common noise distributions, including bounded uniform distribution and Gaussian distributions(\Cref{lemm:upper_bd_Cz}). We further discuss this assumption in \Cref{Appendix:D1} and \ref{Appendix:Aux_D1}. 

With the above assumptions, our main result is as follows. 
\begin{theorem}
\label{thm:main}
Given a noisy LTI system $x_{t+1} = Ax_t + Bu_t + \eta_t$ subject to \Cref{assumption:eigengap}, \Cref{assumption:pdf}, and additionally, $|\lambda_1| |\lambda_{k+1}| < 1$. Further, denote $\gap := \left|\prod_{\substack{m_1 \neq m_2,\\m_1, m_2 \in \{1,\dots,k\} }}(\lambda_{m_1}^{-1} - \lambda_{m_2}^{-1})\right|.$ By running \cref{alg:LTS0} with parameters
$\gamma = O(1), \quad \delta = O(m^{-\frac{1}{2}}), \quad\tau = O(1), \quad \alpha = O(1) $, and $
        T = O\left(k \log k + \log(n-k) + \log m - \log\gap\right)$, the controller returned by \Cref{alg:LTS0} is a stabilizing controller. Further, Algorithm~\ref{alg:LTS0} guarantees that 
\begin{equation*}
    \norm{x_t} < \exp \left(O \left(k \log k + \log (n-k) + m - \log\gap\right)\right),
\end{equation*}
before termination. Here the big-O notation only shows dependence on $k,m$ and $n$, while omitting dependence on $C, C_z, |\lambda_1|, |\lambda_k|,|\lambda_{k+1}|, \theta$, $\mathcal{K}$, and $\mathcal{U}$. 
\end{theorem}

The precise bound given for each constant can be found at \eqref{eqn:tau_final},\eqref{eqn:bdd_gamma},\eqref{eqn:bdd_alpha}, and \eqref{eqn:bdd_delta} in the Appendix, and the bound for $T$ is given in \Cref{thm:projection}. Despite the more challenging setting with noises and potentially underactuated systems, \Cref{thm:main} achieves a similar guarantee as \citet{LTI}. Specifically, in the regime of $m=O(k)$,\footnote{We note that the regime of $m=O(k)$ is the most interesting regime as it covers the under-actuated setting, which is known to be more challenging. } the above Theorem shows that LTS\textsubscript{0}-N finds a stabilizing controller with an upper bound on state norm at $2^{\Tilde{O}(k)}$, which is better than the state-of-the-art $2^{\Theta(n)}$ complexity in the noisy settings. Therefore, our approach leverages instance specific properties (the dimension of unstable subspace $k$) to \emph{break the exponential lower bound \citep{Chen07} and learns to stabilize without the exponential blow-up in $n$ in noisy and under-actuated settings}. 

We also point out that constant $\gap$ is also $k$-dependent. In the worst case, the $\gap$ has an order of $2^{O(k^2)}$. This is still independent of $n$. We note that \citet{LTI} did not show explicit dependence on this constant. We leave it as future work whether this additional constant is essential or is an artifact of the proof. Moreover, our assumption that $|\lambda_1||\lambda_{k+1}| < 1$ is weaker than the assumption in \citet{LTI}, which requires $|\lambda_1|^2 |\lambda_{k+1}| < |\lambda_k|$. If the system can not satisfy this assumption, than the learner can pick $\hat{k}>k$, such that $|\lambda_1||\lambda_{\hat{k}+1}|<1$ for \Cref{alg:LTS0}. In the worst case, LTS\textsubscript{0}-N has the same complexity bound as the algorithm proposed in \citet{Chen07}. Overall, we can achieve a better bound than the previous works for the LTI system with additive noise using singular value decomposition to isolate the unstable component. The key steps will be shown in \Cref{thm:projection} in \Cref{sec:proof}. 


We demonstrate the effectiveness of our algorithm in simulation in \Cref{sec:simulation}, showing our algorithm's state norm does not blow-up with $n$ and also outperforms other benchmarks.