In this section, we propose Learning to Stabilize from Zero with Noise (LTS\textsubscript{0}-N). 
The algorithm is divided into 4 stages: (i) learn an orthonormal basis $P_1$ of the unstable subspace $E_u$ (Stage 1); (ii) learn $M_1$, the restriction of $A$ onto the subspace $E_u$ (Stage 2); (iii) learn $B_\tau = P_1^* A^{\tau-1}B$ (Stage 3); and (iv) design a controller that seeks to stabilize the ``unstable'' $E_u$ subspace (Stage 4). This is formally described in \Cref{alg:LTS0}. We provide detailed descriptions of the four stages in LTS\textsubscript{0}-N. 

\input{sections/pseudocode}

%In Stage 1, the system runs in open loop with 0 control input for $T$ steps, which will push the state close to $E_u$. Using the state trajectory $D:= [x_1, \cdots, x_T]$, we can construct the projector on to $E_u$ as $\hat{\Pi}_1 = U^{(k)}(U^{(k)})^*$, where $U^{(k)}$ is the first $k$ columns of the matrix $U$ in the compressed singular value decomposition of $D = U \Sigma V^*$, as an estimator of $\Pi_1 = P_1 P_1^*$. Therefore, the column space $\hat{P}_1$ of $\hat{\Pi}_1$ is an estimator of $P_1$. In Stage 2, the algorithm uses $\hat{P}_1$ to solve the least square problem for $\hat{M}_1$. In Stage 3, the algorithm only applies nonzero control every $\tau$ steps and wait for the system to "reset" in the next $\tau - 1$ steps, by which the algorithm learns the control matrix $B_\tau$. In Stage 4, the algorithm designs the controller $\hat{K}_1$ to stabilize the system. 