\section{Ablation}
Table~\ref{tab:ablation_buffer} analyzes the contribution of each buffer-selection criterion across $\beta$ and sequences $S1$ and $S2$.
Within the difficult samples criterion ($R_{\text{diff}}$), uncertainty (U) outperforms lesion complexity (LC), so the $R_{\text{diff}}$ assigns greater weight to U.
Within the representativeness criterion ($R_{\text{rep}}$), lesion size (LS) outperforms corrected confidence (CC), leading to a higher weight for LS in the $R_{\text{rep}}$. Since U and LS show broadly comparable performance, the buffer $\mathcal{B}_t$ for dataset $D_t$ allocates an equal number of samples from $R_{\text{diff}}$ and $R_{\text{rep}}$.

As $\beta$ increases, $R_{\text{rep}}$ improves stability by anchoring dominant lesion characteristics, while $R_{\text{diff}}$ captures challenging boundary and morphology cases that are more prone to forgetting. When combined, the two criteria yield higher, AVG, ILM, and BWT scores across $\beta$ and sequences.
This indicates that jointly capturing both stable lesion structure and difficult boundary cases is necessary for effective replay under heterogeneous sequential streams.
The consistent improvements across $\beta$ support that the balanced selection mechanism retains knowledge more effectively than any single criterion.

Compared with the fixed-input design in~\cite{sadegheih2025modality}, ILI mechanism in CLMU-Net enables the model to accommodate an arbitrary and previously unknown number of input modalities by dynamically expanding the input layer when new modalities appear, rather than preallocating channels for a maximum set at the start of training. Fixed-input architectures must reserve channels for modalities that are absent in early tasks, leading to under-utilised capacity and potentially suboptimal representations, whereas ILI preserves parameters for previously seen modalities and allocates new channels only when required. As reported in Table~\ref{tab:ablation_inflation}, this design yields consistent gains over the fixed-input baseline across all values of $\beta$, with average improvements in \{AVG, ILM, BWT\} of \{7.61\%, 3.71\%, 46.38\%\} and \{9.03\%, 3.75\%, 41.25\%\} in $S1$ and $S2$, respectively, indicating better overall segmentation performance and substantially reduced forgetting in sequential multi-modal MRI segmentation without prior knowledge of the maximum modality set.
