\section{Introduction}

Advancements in computational modeling have enabled high-fidelity patient-specific treatment simulations for acute ischemic stroke (AIS) \cite{luraghi2021first,liu2022simulation}.
These simulations support the promise of in-silico clinical trials (ISCTs) as alternatives to traditional trials for developing medical treatments and devices \cite{konduri2020silico,miller2023silico}. However, ISCTs require large virtual populations of high-quality 3D vascular geometries, which are challenging to create due to resource-intensive processes. Synthetic data generation, e.g., with deep generative models, addresses these limitations by generating diverse, high-quality synthetic geometries from limited real data. Because synthetic populations bypass privacy restrictions, they enable data sharing and support downstream tasks reliant on large datasets.



Several methods for generating 3D vascular geometry have been proposed in recent literature. \citet{danu2019deep} utilize an image-based generative approach and represent the geometry as discrete 3D voxel occupancy grids, limiting the resolution of the generated vessels. \citet{wolterink2018blood} generate single-branch centerline graphs sequentially using a generative adversarial network (GAN), but do not support bifurcations. Expanding on this concept, \citet{feldman2023vesselvae} generate vessel centerline tree graphs using a recursive variational autoencoder that does supports bifurcations. However, the method is limited by its recursion depth and does not support looping vessels or multiple disconnected graphs, which all occur in cerebral vascular structures such as the circle of Willis. As such, these methods are not directly applicable for generating cerebral vessel trees. In contrast, \citet{sinha2024representing} represent 3D vascular geometry using implicit neural representations (INRs) with occupancy fields, making this method free of topological restrictions.

Besides topological restrictions, a major limitation of the aforementioned methods is the lack of semantic labeling of the individual vessels in the generated trees, which is crucial in computational stroke treatment models for the automatic placement of thrombi in specific vessels and locations. Additionally, having access to semantic labels allows for more robust evaluation of the synthetic vessel trees by assessing the quality of each individual vessel within the tree. As a result, \citet{kuipers2024generating} introduced a point cloud-based diffusion approach for generating semantic cerebral vessel trees. However, a separate rule-based post-processing algorithms is required to reconstruct the vessel tree topology for which we show that it does not generalize well to topologically complex trees.

% Several methods for generating 3D vascular geometry have been proposed in recent literature.
% \citet{danu2019deep} utilize an image-based generative approach and represent the geometry as 3D voxel occupancy grids.
% Alternatively, some methods parameterize the vascular geometry via points on the respective centerlines, modeling point coordinates and corresponding radii.
% \citet{wolterink2018blood} generate single-branch centerline graphs sequentially using a generative adversarial network (GAN) \cite{goodfellow2020generative}.
% Expanding on this concept, \citet{feldman2023vesselvae} generate vessel centerline tree graphs using a recursive variational autoencoder that supports multiple branches. In contrast, \citet{sinha2024representing} represent 3D vascular geometry using implicit neural representations (INRs) with occupancy fields. A major limitation of these methods is the lack of semantic information in the generated trees, which is crucial, in computational stroke treatment models require for automatic placements of thrombi in specific vessels. Additionally, it allows for more robust evaluation of the synthetic vessel trees by assessing the quality of each separate vessel in the tree. As a result, \citet{kuipers2024generating} introduced a pointcloud-based diffusion approach for generating semantic cerebral vessel trees. However, this method requires separate post-processing algorithms to reconstruct the vessel tree topology that do not generalize well to topologically complex trees. 

\begin{wrapfigure}{R}{0.33\textwidth}
    \begin{center}
        \includegraphics[width=0.3\textwidth]{figures/1_non_closed_mesh.png}
    \end{center}
    \caption{Non-closed mesh from \cite{yang2020intra}.}
    \label{fig:1_non_closed_mesh}
\end{wrapfigure}

In this work, we employ INRs and propose representing cerebral vessel trees as semantic signed distance fields (SDFs), avoiding the need for separate post-processing algorithms.
SDFs represent the distance from a point to the surface of the shape, with the sign indicating whether the point is inside or outside the shape.
INRs provide several advantages over voxel- or point cloud-based methods, including memory efficiency, support for arbitrary resolution, continuity, and automatic differentiation, e.g., for computing surface normals \cite{berzins2024geometry}.
In the generative setting, INRs are typically optimized in a supervised manner using ground truth scalar fields, which require access to \textit{watertight} geometry, i.e., closed surfaces representing a volume \cite{chibane2020neural}.
However, watertight geometry is often unavailable, particularly for tubular vascular structures, as seen in Figure \ref{fig:1_non_closed_mesh}, or when the surface geometry is represented as a point cloud. Moreover, obtaining accurate watertight geometry often involves labor-intensive manual processing. Building on the approach of \citet{alblas2023going}, we leverage the inductive bias of SDFs, i.e., SDFs satisfy the Eikonal equation, extending the fully self-supervised learning of implicit neural shapes to the generative setting. As a result, our model does not require access to ground truth signed distances or occupancy grids, making it compatible with any type of surface representation, and will always yield a mesh that is watertight.

% In this work, we address the limitations of requiring watertight meshes in generative shape modeling.
% Building on the approach of \citet{wolterink2023going}, we leverage the inductive bias of SDFs, i.e., the gradient of an SDF with respect to an input coordinate needs to be of unit-length, extending the fully self-supervised learning of implicit neural shapes to the generative setting.
% Additionally, we augment the SDF to include semantic information, allowing the modeling of individual components within the geometry.
% This provides a deeper and more reliable evaluation of the quality of the generated shapes.

% An INR implicitly represents geometry as a function that maps input spatial coordinates to a scalar field, typically representing either occupancy grids \cite{zhang20233dshape2vecset} or signed distance fields (SDFs) \cite{park2019deepsdf}.
% Occupancy grids treat geometry as a volume, defined by voxels marked as lying inside our outside the shape volume.
% SDFs, on the other hand, represent the distance from a point to the surface of the shape, with the sign indicating whether the point is inside or outside the volume.
% INRs provide several advantages over voxel- or point cloud-based methods, including memory efficiency, support for arbitrary resolution, continuity, and automatic differentiation (e.g., for computing surface normals \cite{berzins2024geometry}), with continuity and high resolution being particularly important for treatment simulations.
% In the generative setting, INRs are typically optimized in a supervised manner using ground truth scalar fields, which require access to \textit{watertight} geometry, i.e., closed surfaces representing a volume \cite{chibane2020neural}.
% However, watertight geometry is often unavailable, particularly for tubular vascular structures, as seen in Figure \ref{fig:1_non_closed_mesh}, or when the surface geometry is represented as a point cloud.
% Moreover, obtaining accurate watertight geometry often involves labor-intensive manual processing.

Our generative framework is inspired by 3DShape2VecSet \cite{zhang20233dshape2vecset}, a state-of-the-art two-stage approach for 3D shape representation and generation .
% Our work bridges the gap between self-supervised SDF learning and large-scale shape representation and generation models.
In the first stage, a variational autoencoder (VAE) \cite{kingma2013auto} encodes semantic point clouds sampled from the shape surface, learning a distribution of robust latent semantic shape representations that are then decoded into semantic SDFs.
In the second stage, a latent diffusion model \cite{rombach2022high} samples latent shape representations from the VAE's prior distribution.
Latents sampled from this distribution are then decoded into a semantic SDF that represents a synthetic vascular tree.\footnote{Our code is available \href{https://github.com/ThijsKuipers1995/vessel-sdf}{here}.}


% Our contributions are twofold: (1) We introduce semantic signed distance fields to represent and generate topologically complex synthetic cerebral vessel trees, eliminating the need for post-processing algorithms to construct the final tree. (2) We demonstrate that generative models for generating vessel tree geometry can be effectively trained in a fully self-supervised manner, removing the requirement for watertight geometry and enabling compatibility with any type of surface representation.\footnote{Our code is available here.}

% In summary, we present a robust framework for fully self-supervised semantic 3D cerebral vessel tree generation that requires no post-processing algorithms to obtain the final trees.
% Our method requires only shape surfaces, readily obtainable from meshes, segmentations, and pointclouds.  We leverage latent diffusion for efficient, high-quality synthetic geometry generation.
% By incorporating semantic information into the shape representations, our approach enables the simultaneous generation of multi-component geometries.\footnote{Our code is available here.}
% Additionally, our interpretability prior enables direct qualitative analysis of the synthetic shapes, highlighting areas that are challenging to generate and that could be inaccurate.

