\section{Discussion and conclusion}

We presented a learning strategy for modality-agnostic brain MRI segmentation, which builds on classical generative models for Bayesian segmentation. Sampling a wide range of model parameters enables the network to learn to segment a wide variety of contrasts and shapes during training. At test time, the network can therefore segment neuroanatomy given an unprocessed scan of any contrast in seconds. While the network is trained in a supervised fashion, the only data required are a few label maps. Importantly, we do not require any real scans during training, since images are synthesized from the labels, and are thus always perfectly aligned -- in contrast to techniques relying on manual delineations. Our method requires the training label maps to contain labels for all brain structures to be synthesized.

While a supervised network excels on test data from the same domain it was trained on, its performance quickly decays when faced with more variability, even within the same type of MRI contrast. We emphasize that this effect is particularly pronounced as we tackle the challenging task of segmentation starting with \textit{unprocessed} scans. This is one reason why deep learning segmentation techniques have not yet been adopted by widespread neuroimaging packages like FreeSurfer or FSL, where fewer assumptions on the specific MRI contrast of the user's data need to be made. In contrast, \netname{} maintains accuracy across T1 variants as well as other MRI modalities.

In absolute terms, \netname{}'s Dice scores are consistently high: higher than SAMSEG, and not far from \textit{supervised} contrast-specific networks, like the T1 baseline or scores reported in recent literature~\cite{roy_quicknat_2019}. Compared with our recent article that uses a CNN to estimate the GMM and registration parameters of the Bayesian segmentation framework~\cite{dalca_unsupervised_2019}, the method proposed here achieves higher average Dice on T1 (0.86 vs 0.82) and PD datasets (0.83 vs 0.80). However, we highlight that direct comparison is not available due to differences in datasets: in this work, we could only use 19 subjects from T1-39 for evaluation. More importantly, our previous method requires significant preprocessing and modality-specific unsupervised re-training. This highlights the ability of our new method to segment a wide variety of contrasts without retraining or preprocessing; the latter eliminates the dependence on additional tools which can be computationally expensive and require manual tuning.

We believe that the proposed learning strategy is applicable to many generative models from which sampling can yields sensible data, even beyond neuroimaging. By greatly increasing the robustness of fast segmentation CNNs to a wide variety of MRI contrast, without any need for retraining, \netname{} promises to enable adoption of deep learning segmentation techniques by the neuroimaging community.