\section{Hyperparameters}
\label{appendix:hyperparams}
As with any optimization algorithm, our implementation of the SAA for VI algorithm uses certain constants and hyperparameters.
Table~\ref{table:hyperparameters} details the purpose of each such number, along with the rationale behind our chosen values.
We emphasize that SAA for VI performs well across many models without tuning these parameters (our experiments used a single setting): many can be considered constants, while others control tradeoffs between computation and precision in a straightforward way, such as tolerance parameters.
While the current hyperparameter values are not tuned, we are open to the possibility of further enhancing the algorithm's performance through careful tuning.

%We anticipate that most of these can be considered constants of the algorithm and do not need tuning.
The sequence of sample sizes is controlled by the first two hyperparameters.
We tested a variety of exponentially increasing sequences and determined that the performance was largely unaffected by the specific choice.
However, the initial sample size showed a more pronounced effect on performance as it could potentially `save work' by avoiding smaller sample sizes if larger ones are required.
This is not always predictable; our addendum, following \citet{giordano2023black}'s concurrent work, refines SAA for VI by tuning this value based on the model and approximation family.

The remaining hyperparameters, listed last in the table, mainly dictate when to halt the process.
For example, a user may deem being 1 nat away from the optimum as adequate, thus setting $\delta$ to 1 instead of 0.01.
The $\alpha$ (significance level for t-test) could also be adjusted depending on the desired balance between computation cost and approximation precision.
%In fact, all these choices represent different balancing strategies.
Similar parameters are used in most implementations of other optimization algorithms (maximum iterations, absolute/relative tolerance, etc.) and tend to be less critical than parameters like step sizes as they affect the trade-off between computational time and numerical precision rather than the fundamental operation of the algorithm.

%Nonetheless, we believe that the performance is relatively stable regardless of these values, reducing the necessity for user modification.
%Our experiments indicate that SAA for VI demonstrates robust performance with a single setting of hyperparameters.


\begin{table}[ht!]
  \centering
  \renewcommand{\arraystretch}{1.2}
  \begin{tabular}{@{}p{5cm}rp{4cm}p{5.2cm}@{}}
      \toprule
      Hyperparameter & \multicolumn{1}{c}{Value} & \multicolumn{1}{c}{Purpose} & \multicolumn{1}{c}{Justification} \\
      \midrule
      Initial sample size ($n$) & 32 & Sets the starting point for the sample size sequence & Arbitrary choice. Refined based on the work of \citet{giordano2023black} in the addendum. \\
      Sample size and max iterations sequence (2n, 2$\tau$) & 2 & Determines progression of sample sizes and max inner optimizer iterations & Arbitrary. We tested alternative sequences with negligible performance impact \\
      ELBO difference threshold ($\delta$) & 0.01 & Convergence criterion for the optimizer & Conservative choice ensuring precision \\
      Max.\ number of SAA steps ($\mathrm{max\_t}$) or max.\ sample size & $2^{18}$ & Limits total number of SAA steps or sample size & Chosen to ensure optimization usually concludes for other reasons \\
      Inner optimizer early exit count (\texttt{count} $<$ 3) & 3 & Specifies how many times inner optimizer can finish after few iterations & We found empirically that this counter was necessary, but we didn't explore other alternatives. \\
      \texttt{VERY\_SMALL\_ITER} for inner optimizer & 5 & Defines what is considered a small number of iterations for the inner optimizer & Arbitrary choice. It is related to the early exit \texttt{count}. \\
      Significance level ($\alpha$) for t-test & 1\% & Statistical significance criterion & Standard value in significance testing \\
      Test set sample size & 10k & Size of the sample set for ELBO estimation & Arbitrary. It is related to $\alpha$ \\
      Initial maximum number of iterations for inner optimizer ($\tau$) & 300 & Sets an initial limit for optimizer iterations & Arbitrary. However, it self-adjusts as needed \\
      \bottomrule
  \end{tabular}
  \caption{Hyperparameter choices for our SAA for VI experiments}
  \label{table:hyperparameters}
\end{table}


  