\section{Curriculum Training}

\begin{figure*}[t]
  \centering
   \includegraphics[width=1.0\linewidth]{figures/curriculum_figure.png}

   \caption{The proposed curriculum scheme. It consists of five stages, with each stage introducing more synthetic samples of increasing haze density.}
   \label{fig:curriculum}
\end{figure*}

\par We adopt a state-of-the-art dehazing architecture \cite{dehazeformer} as the backbone for our framework and introduce a five‑stage curriculum learning strategy that exploits the monotonic fog‑density structure of our synthetic dataset. We start by testing our approach on \(1\) new fog density level being introduced at each stage, but the end goal is to introduce two per stage, for a total of \(10\) fog density levels. The models remain unchanged architecturally; the curriculum acts solely at the data‑level, enabling staged exposure to increasingly severe fog conditions.
\par Training directly on heavily fogged images often leads to unstable gradients and suboptimal convergence, as dense haze severely obscures scene structure and attenuates high‑frequency details. Our synthetic dataset provides five fog levels per ground‑truth scene, ranging from light to extremely dense haze. This provides a natural ordering of difficulty, allowing the model to first learn basic restoration behavior under mild degradations before progressively addressing more complex scattering effects.
\par The curriculum consists of five stages, as shown in Fig. ~\ref{fig:curriculum}. Stage 1 uses only the lightest fog level. Subsequent stages incrementally add fog levels while retaining all previous ones, culminating in Stage 5, in which all five fog levels are included. Each training iteration draws uniformly from the available levels, and all samples remain in the form of independent (foggy image, ground‑truth) pairs. No multi‑input fusion or joint conditioning on multiple fog levels is used.
\par During Stage 1, the model learns fundamental contrast enhancement and low‑level feature extraction. Stage 2 introduces moderate haze, prompting the network to generalize beyond lightly degraded inputs. In Stage 3, medium fog levels introduce stronger scattering and color distortions, requiring the model to infer missing details. Stage 4 further increases difficulty by adding heavy fog, while Stage 5 exposes the network to the full distribution of degradations. This progression maintains stability by anchoring the training signal with easier examples even as more challenging ones are introduced.
\par Training hyperparameters, including the optimizer and learning‑rate schedule, remain fixed to ensure that improvements stem from the curriculum itself rather than modified optimization dynamics.