\section{Extended evaluation}
To provide a more detailed evaluation on the proposed metrics of our paper, we also evaluate against five different baseline predictors, namely:
\begin{itemize}
    \item Zero predictor (zero): Predicts an event for every step by reporting a time-to-event of zero. This should result in high undershot and low overshot rates. Furthermore, it should have a good but not zero consistency.
    \item Maximum predictor (max): This predicts the maximum time-to-event. This should result in high overshot and low undershot rates. Similar to the zero predictor, it should also have a good consistency. However, we also expect this to perform the worst out of all predictors on the mean square and mean absolute error.
    \item Random predictor (rand): Predicts a uniformly distributed random event length as the start of a sequence and consistently reduces the time-to-saccade by the update rate of the eye-tracker. Once, we predict a time-to-event below zero, we just report that the event is going to happen every step. We expect this predictor to have an overshot and undershot rate of 0.5 and an excellent consistency, due to its definition.
\end{itemize}
Using those predictors, we measure how the proposed metrics behave on the DGaze \cite{hu2020dgaze}, FixationNet \cite{hu2021fixationnet} and EGTEA Gaze+ \cite{li2018eye} datasets.
%We additionally use the sectioning approach described in our paper to create the figures (...). This allows us for a more in dept look into the behavior of the predictors over time.

First, Tab.~\ref{tab:zero} shows the evaluation of the zero predictor, which will report a time-to-event of zero.
As expected, it is evident that the predictor undershoots every prediction, which is also shown through the undershot rate. This also results in a high undershot error, as the full sequence undershots the actual target. Second, Tab.~\ref{tab:rand} shows the evaluation of the random predictor. As predicted, this predictor has a much lower undershot rate and very high consistency. However, it does not reach a 0.5 overshot and undershot rate. This might be due to the uniform sampling that does not reflect the general distribution of the data. At last, Tab.~\ref{tab:max} shows the maximum predictor. Here, it is shown that the predictor does not produce any undershots and thus has an excellent undershot error. However, this also results in the highest average time-to-saccade errors, meaning that it does not well in its prediction. Moreover, Fig.~\ref{fig:overshot} and Fig.~\ref{fig:undershot} show the overshot and undershot rate using 10 sections to visualize the behavior of the overshot and undershot over time. As expected, the zero and maximum predictors have the highest over- and undershot rate across the sequence lengths. Whereas, the random predictor is consistently at a 0.6 overshot and 0.4 undershot rate. It can also be infered that the mean and SGD predictors tend to overshoot as the sequence reaches the event. \\
\newpage

\begin{table}[t]
    \centering
    \caption{Results of the zero predictor using the metrics described in Sec.~3 of the main paper and the mean square error (mse) and mean absolute error (mae). A lower error is preferred in all cases.}
    \begin{tabular}{l|c|c|c|c|c|c}
        \toprule
        Metric & DGaze & FixationNet & EGTEA\\ 
        \midrule
        mse$\downarrow$
                       & 0.5168 \si{\second}$^2$ & 0.6695 \si{\second}$^2$ & 0.5168 \si{\second}$^2$\\
        mae$\downarrow$
                       & 0.5434 \si{\second}     & 0.6408 \si{\second}     & 0.5434 \si{\second}\\
        avg. tts mse$\downarrow$
                       & 0.2089 \si{\second}$^2$ & 0.3088 \si{\second}$^2$ & 0.2000 \si{\second}$^2$\\
        avg. tts mae$\downarrow$
                       & 0.4066 \si{\second}     & 0.3831 \si{\second}     & 0.3733 \si{\second}\\
        undershot mse$\downarrow$
                       & 0.2089 \si{\second}$^2$ & 0.3088 \si{\second}$^2$ & 0.2000 \si{\second}$^2$\\
        undershot mae$\downarrow$
                       & 0.4066 \si{\second}     & 0.3831 \si{\second}     & 0.3733 \si{\second}\\
        overshot rate$\downarrow$
                       & 0.0    & 0.0 & 0.0\\
        undershot rate$\downarrow$
                       & 1.0    & 1.0 & 1.0\\
        consistency$\downarrow$
                       & 1.0    & 1.0 & 1.0\\
        \bottomrule
    \end{tabular}
    \label{tab:zero}
\end{table}

\begin{table}[t]
    \centering
    \caption{Results of the random predictor using the metrics described in Sec.~3 of the main paper and the mean square error (mse) and mean absolute error (mae). A lower error is preferred in all cases.}
    \begin{tabular}{l|c|c|c|c|c|c}
        \toprule
        Metric & DGaze & FixationNet & EGTEA\\ 
        \midrule
        mse$\downarrow$
                       & 0.4585 \si{\second}$^2$ & 0.6805 \si{\second}$^2$ & 0.7973 \si{\second}$^2$\\
        mae$\downarrow$
                       & 0.5384 \si{\second}     & 0.6526 \si{\second}     & 0.7066 \si{\second}\\
        avg. tts mse$\downarrow$
                       & 0.5097 \si{\second}$^2$ & 0.7877 \si{\second}$^2$ & 0.9966 \si{\second}$^2$\\
        avg. tts mae$\downarrow$
                       & 0.5742 \si{\second}     & 0.1017 \si{\second}     & 0.8003 \si{\second}\\
        undershot mse$\downarrow$
                       & 0.0653 \si{\second}$^2$ & 0.1017 \si{\second}$^2$ & 0.0575 \si{\second}$^2$\\
        undershot mae$\downarrow$
                       & 0.1306 \si{\second}     & 0.1620 \si{\second}     & 0.1006 \si{\second}\\
        overshot rate$\downarrow$
                       & 0.62   & 0.62 & 0.72\\
        undershot rate$\downarrow$
                       & 0.38   & 0.38 & 0.28\\
        consistency$\downarrow$
                       & 0.24   & 0.25 & 0.20\\
        \bottomrule
    \end{tabular}
    \label{tab:rand}
\end{table}

%\begin{table}[!th]
%    \centering
%    \caption{Results of the mean predictor using the metrics described in Sec.~\ref{sec:Methodology} and the mean square error (mse) and mean absolute error (mae). A lower error is preferred in all cases.}
%    \begin{tabular}{l|c|c|c|c|c|c}
%        \toprule
%        Metric & DGaze & FixationNet & EGTEA\\ 
%        \midrule
%        mse$\downarrow$
%                       & 0.2315 \si{\second}$^2$ & 0.3647 \si{\second}$^2$ & 0.3042 \si{\second}$^2$\\
%        mae$\downarrow$
%                       & 0.3387 \si{\second}     & 0.4261 \si{\second}     & 0.3677 \si{\second}\\
%        undershot mse$\downarrow$
%                       & 0.2252 \si{\second}$^2$ & 0.3543 \si{\second}$^2$ & 0.2979 \si{\second}$^2$\\
%        undershot mae$\downarrow$
%                       & 0.3006 \si{\second}     & 0.3757 \si{\second}     & 0.3277 \si{\second}\\
%        overshot rate$\downarrow$
%                       & 0.39   & 0.44 & 0.47\\
%        undershot rate$\downarrow$
%                       & 0.61   & 0.56 & 0.53\\
%        consistency$\downarrow$
%                       & 1.0   & 1.0 & 1.0\\
%        \bottomrule
%    \end{tabular}
%    \label{tab:mean}
%\end{table}

\begin{table}[!h]
    \centering
    \caption{Results of the maximum predictor using the metrics described in Sec.~3 of the main paper and the mean square error (mse) and mean absolute error (mae). A lower error is preferred in all cases.}
    \begin{tabular}{l|c|c|c|c|c|c}
        \toprule
        Metric & DGaze & FixationNet & EGTEA\\ 
        \midrule
        mse$\downarrow$
                       & 2.7385 \si{\second}$^2$ & 3.9040 \si{\second}$^2$ & 4.1814 \si{\second}$^2$\\
        mae$\downarrow$
                       & 1.6050 \si{\second}     & 1.9092 \si{\second}     & 1.9899 \si{\second}\\
        avg. tts mse$\downarrow$
                       & 2.9792 \si{\second}$^2$ & 4.3473 \si{\second}$^2$ & 4.7266 \si{\second}$^2$\\
        avg. tts mae$\downarrow$
                       & 1.7134 \si{\second}     & 2.0669 \si{\second}     & 2.1601 \si{\second}\\
        undershot mse$\downarrow$
                       & 0.0000 \si{\second}$^2$ & 0.0000 \si{\second}$^2$ & 0.0000 \si{\second}$^2$\\
        undershot mae$\downarrow$
                       & 0.0000 \si{\second}     & 0.0000 \si{\second}     & 0.0000 \si{\second}\\
        overshot rate$\downarrow$
                       & 1.0   & 1.0 & 1.0\\
        undershot rate$\downarrow$
                       & 0.0   & 0.0 & 0.0\\
        consistency$\downarrow$
                       & 1.0   & 1.0 & 1.0\\
        \bottomrule
    \end{tabular}
    \label{tab:max}
\end{table}

\newpage
\begin{figure}[!th]
    \centering
    \includegraphics[width=0.5\textwidth]{images/overshot-dgaze.png}%
    \includegraphics[width=0.5\textwidth]{images/overshot-fixationnet.png}%
    \caption{Overshot rate calculated over 10 sections on the DGaze \cite{hu2020dgaze} and FixationNet \cite{hu2021fixationnet} datasets.}
    \label{fig:overshot}
\end{figure}

\begin{figure}[!th]
    \centering
    \includegraphics[width=0.5\textwidth]{images/undershot-dgaze.png}%
    \includegraphics[width=0.5\textwidth]{images/undershot-fixationnet.png}%
    \caption{Undershot rate calculated over 10 sections on the DGaze \cite{hu2020dgaze} and FixationNet \cite{hu2021fixationnet} datasets.}
    \label{fig:undershot}
\end{figure}
