
\section{Omitted experiments}

\subsection{SBMs: Additional structure}
In addition to our SBM experiment in Section~\ref{sec:general}, we have a similar experiment for an SBM with 8 clusters where one pair of clusters have a higher affinity for each other. More precisely, \(P\) is an 8 by 8 matrix with \(0.5\) along its diagonal, \(P_{12} = P_{21} = 0.3\) and every other value is set to \(0.05\). The bounds for this SBM for a varied cluster size \(n\) can be seen in Figure~\ref{fig:sbm8cluster1pair}. 
\begin{figure}[h]
    \centering
        \includegraphics[width=0.5\linewidth]{Figures/Bounds8Clusters1PairRST.png}
        \caption{Stochastic block model with 8 blocks where one pair of blocks have a higher affinity to each other. Each point is an average over 10 realisations and the standard deviation is included as filled error bars.}
        \label{fig:sbm8cluster1pair}
\end{figure}

We also considered an SBM with multiple layers in its hierarchy of clusters. We took an SBM with 12 blocks that can be split into three sets of four. In each quartet, two pairs of the blocks have a higher affinity for each other. Precisely, this was the \(12 \times 12\) probability matrix \(P\) that was used:
\[
P = \begin{bmatrix}
Q & \bf{.05} & \bf{.05} \\
\bf{.05} & Q & \bf{.05} \\
\bf{.05} & \bf{.05} & Q \\
\end{bmatrix}
\]
Where 
\[Q = \begin{bmatrix}
& \cellcolor{yellow!90}.9 & \cellcolor{yellow!30}.7 & .2 & .2 & \\
& \cellcolor{yellow!30}.7 & \cellcolor{yellow!90}.9 & .2 & .2 & \\
& .2 & .2 & \cellcolor{yellow!90}.9 & \cellcolor{yellow!30}.7 & \\
& .2 & .2 & \cellcolor{yellow!30}.7 & \cellcolor{yellow!90}.9 & \\
\end{bmatrix}\]
and \(\bf{.05}\) is a \(4 \times 4\) matrix of \(0.05\). The bounds for this SBM for varied cluster size \(n\) can be seen in Figure~\ref{fig:sbm12cluster4quartets}.

\begin{figure}[h]
    \centering
    \includegraphics[width=0.5\linewidth]{Figures/Bounds12Clusters3QuartetsRST.png}
    \caption{Stochastic block model with 12 blocks where three sets of four blocks have a higher affinity to blocks inside their quartet. Inside each quartet, two pairs of blocks have an even higher affinity to each other. Each point is an average over 5 realisations and the standard deviation is included as filled error bars.}
    \label{fig:sbm12cluster4quartets}
\end{figure}


In both experiments, we see that Theorem~\ref{thm:rec} greatly outperforms the other bounds. 

\subsection{SBMs: Close to the perfect recovery threshold}
We now consider our bounds for an SBM with 2 blocks with probability matrix
\[P = \begin{pmatrix}
    p & q \\
    q & p \\
\end{pmatrix}\]
where \(p = \frac{\alpha \log(N)}{N}, q = \frac{\beta \log(N)}{N},\) with \(\alpha\) and \(\beta\) taken at or close to the perfect recovery threshold \(\sqrt{\alpha} - \sqrt{\beta} \ge \sqrt{2}\). Details of this threshold and its significance can be found in \cite{hajek2016achieving}. We study the results of several experiments for different values of $\alpha$ and $\beta$.
In Figure~\ref{fig:ThresholdVaryingNBeta20LogScale}, we try \(\beta = 20\) with \(\alpha\) satisfying \(\sqrt{\alpha} - \sqrt{\beta} = 2\) and study the bounds for a varied cluster size \(n\). In Figure~\ref{fig:ThresholdVaryingNBeta1LogScale}, we try the same experiment with \(\beta=1\).


\begin{figure}[h!]
    \centering
    \begin{subfigure}[t]{0.48\linewidth}
        \centering
        \includegraphics[width=\textwidth]{Figures/ThresholdVaryingNBeta20LogScale.png}
        \vspace{0.01cm}
        \caption{\(\beta = 20\)}
        \label{fig:ThresholdVaryingNBeta20LogScale}
    \end{subfigure}
    ~
    \begin{subfigure}[t]{0.48\linewidth}
        \centering
        \includegraphics[width=\textwidth]{Figures/ThresholdVaryingNBeta1LogScale.png}
        \vspace{0.01cm}
        \caption{\(\beta = 1\)}
        \label{fig:ThresholdVaryingNBeta1LogScale}
    \end{subfigure}
    \caption{\(\beta = 20\) (left) and \(\beta =1\) (right), \(\alpha = (\sqrt{\beta} + \sqrt{2})^2\), varying cluster size \(n\). An average is taken over 10 realisations for each value and the standard deviation is included as filled error bars.}
\end{figure}

The bounds do not vary much for our choices of \(n\), however we see that Theorem~\ref{thm:rec} provides a tighter bound on the true value than the other results.
We also considered varying \(\beta\) with \(\alpha\) satisfying \(\sqrt{\alpha} - \sqrt{\beta} = 2\) and cluster size fixed at \(n=500\). The results can be seen in Figure~\ref{fig:ThresholdVaryingBetaAtThresholdLogScale}. \

\begin{figure}[h]
    \centering
    \includegraphics[width=0.7\linewidth]{Figures/ThresholdVaryingBetaAtThresholdLogScale.png}
    \caption{cluster size \(n = 500\); \(\beta\) is varied and \(\alpha\) is parametrised by \(\beta\) with \(\alpha = (\sqrt{\beta} + \sqrt{2})^2\). Each value is an average over 10 realisations and the standard deviation is included as filled error bars. }
    \label{fig:ThresholdVaryingBetaAtThresholdLogScale}
\end{figure}

We notice a similar behaviour in that the bounds do not change much at all as we vary our parameter \(\beta\). Again, the bound from Theorem~\ref{thm:rec} is much closer to the true value. Finally, we consider varying \(\beta\) for a fixed choice of \(\alpha\) (\(\alpha = 35\)). This will mean that as \(\beta\) is increased we will get closer to the threshold and eventually will be in violation of it. The results are pictured in Figure~\ref{fig:ThresholdVaryingBetaFixedAlphaAtThresholdLogScale}.

\begin{figure}[h]
    \centering
    \includegraphics[width=0.7\linewidth]{Figures/ThresholdVaryingBetaFixedAlphaLogScale.png}
    \caption{\(n = 500\); \(\beta\) is varied and \(\alpha\) is fixed at \(\alpha = 35\). Each value is an average over 10 realisations and the standard deviation is included as filled error bars. }
    \label{fig:ThresholdVaryingBetaFixedAlphaAtThresholdLogScale}
\end{figure}
In this experiment, we see that the true distance of the eigenvectors from the indicator vectors of the clusters grows with \(\beta\), and the bound from Theorem~\ref{thm:rec} stays relatively close to it. The other bounds are rather poor and increase slightly.


\subsection{DSBMs: Cyclic cluster structure}

Similar to our experiment in Section~\ref{sec:digraphs}, we consider a graph \(\mathcal{G} \sim \text{DSBM}(k,n,P,F)\) where \(k = 4\), \(n = 100\) and \(P\) and \(F\) are defined as follows:
\[
F = \begin{pmatrix}
    &\cellcolor{yellow!30}.5 & \cellcolor{yellow!60}1 & \cellcolor{yellow!30}.5 & \cellcolor{yellow!0}0& \\
    &\cellcolor{yellow!0}0 & \cellcolor{yellow!30}.5 & \cellcolor{yellow!60}1 & \cellcolor{yellow!30}.5& \\
    &\cellcolor{yellow!30}.5 & \cellcolor{yellow!0}0 & \cellcolor{yellow!30}.5 & \cellcolor{yellow!60}1&  \\
    &\cellcolor{yellow!60}1 & \cellcolor{yellow!30}.5 & \cellcolor{yellow!0}0 & \cellcolor{yellow!30}.5& \\
\end{pmatrix}, \ P = \begin{pmatrix}
    &\epsilon & \cellcolor{yellow!60}1 & \epsilon & \cellcolor{yellow!60}1 &\\
    &\cellcolor{yellow!60}1 & \epsilon & \cellcolor{yellow!60}1 & \epsilon &\\
    &\epsilon & \cellcolor{yellow!60}1 & \epsilon & \cellcolor{yellow!60}1 &\\
    &\cellcolor{yellow!60}1 & \epsilon & \cellcolor{yellow!60}1 & \epsilon&\\
\end{pmatrix}.
\]
As before, \(\epsilon\) is a noise parameter that we vary. This results in a graph which has a distinct cyclic cluster structure for small \(\epsilon\). The results of this experiment can be seen in Figure~\ref{fig:4Cycle}.

\begin{figure}[h]
    \centering
    \includegraphics[width=0.7\linewidth]{Figures/4_cluster_cycle_noise.png}
    \caption{Comparison of the results given by the structure theorem of our paper (green for equation ~\ref{eq:ours_ray} and orange for equation ~\ref{eq:ours_psi}) and by Laenen and Sun (red) for two DSBMs at varying level of noise. The actual values are reported in blue. Each value is an average over 10 trials.}
    \label{fig:4Cycle}
\end{figure}

As one can see, our bounds significantly outperform the result of \cite{laenen2020higher}. In particular our bound from ~\ref{eq:ours_ray} hugs the true value very tightly for all noise levels.

% \subsection{Directed graph: Yellowstone food web}
% \label{app:yellowstone}

% \begin{figure}[h]
%     \centering
%     \includegraphics[width=0.7\textwidth]{Figures/YellowstoneTrophicCascade2.png}
%     \caption{A directed graph illustrating a food web for Yellowstone National Park, United States.}
%     \label{fig:YellowstoneTrophicCascade}
% \end{figure}


% We consider a small digraph representing a food web for Yellowstone National Park \citep{yellowstone_foodweb}, where vertices represent animal species and there is an edge from $u$ to $v$ if $u$ is predated by $v$.  As shown in Figure~\ref{fig:YellowstoneTrophicCascade}, this network can be partitioned into four clusters exhibiting an almost perfect directed path structure: only the outgoing edges for \emph{Mule deer} are not consistent with this structure. Indeed, $\Psi_4 = 0.027$ and Spectral Clustering using the Hermitian Laplacian perfectly recovers these clusters. Theorem~\ref{thm:digraph} suggests an error bound less than $0.086$, which is close to the actual bound \(\|f_1 - \beta \chi_{\mathcal{S}} \|^2 = 0.039\). On the other hand, Laenen and Sun's result can only obtain an upper bound of $0.662$, which is not indicative of the actual performance of Spectral Clustering.
