\documentclass{article}
%\usepackage{maeb_2025}
%\usepackage[preprint]{maeb_2025}
\usepackage[final]{maeb_2025}

% to compile a preprint version, e.g., for submission to arXiv, add add the
% [preprint] option:
%     \usepackage[preprint]{maeb_2025}

% to compile a camera-ready version, add the [final] option, e.g.:
%     \usepackage[final]{maeb_2025}

% to avoid loading the natbib package, add option nonatbib:
%    \usepackage[nonatbib]{maeb_2025}

\usepackage[utf8]{inputenc} % allow utf-8 input
\usepackage[T1]{fontenc}    % use 8-bit T1 fonts
\usepackage[hidelinks]{hyperref}       % hyperlinks
\usepackage{url}            % simple URL typesetting
\usepackage{booktabs}       % professional-quality tables
\usepackage{amsfonts}       % blackboard math symbols
\usepackage{amsmath}
\usepackage{nicefrac}       % compact symbols for 1/2, etc.
\usepackage{microtype}      % microtypography
\usepackage{xcolor}         % colors
\usepackage{xfrac}

\usepackage{algorithm}
\usepackage{algpseudocode}
\usepackage{csquotes}
\usepackage[acronym]{glossaries}
\usepackage{graphicx}
\usepackage{subcaption}

%%%%%%%%%%
% Glosario
%
\newglossaryentry{nbd}{
    name=$N$-bodies,
    description={Problema en mecánica celeste y física computacional que modela la evolución de $N$ partículas bajo interacciones gravitacionales mutuas.}
}


\newacronym{ai}{AI}{artificial intelligence}
\newacronym{dl}{DL}{deep learning}
\newacronym{fmm}{FMM}{fast multipole method}
\newacronym{gnn}{GNN}{graph neural network}
\newacronym{knn}{KNN}{$k$-nearest neighbors}
\newacronym{ml}{ML}{machine learning}
\newacronym{mlp}{MLP}{multilayer perceptron}
\newacronym{mse}{MSE}{mean squared error}

% Para el módulo de un vector
\providecommand{\vmod}[1]{\lVert#1\rVert}

\title{Efficient $n$-body simulations using physics informed graph neural networks}

\author{%
  Víctor Ramos Osuna\\%\thanks{Use footnote for providing further information about author (webpage, alternative address)---\emph{not} for acknowledging funding agencies.} \\
  Departmento de Sistemas Informáticos\\
  E.T.S.I Sistemas Informáticos -- Universidad Politécnica de Madrid\\
  C. de Alan Turing, s/n, 28031 Madrid, Spain \\
  \texttt{victor.ramos.osuna@upm.es} \\
  \And
  Alberto Díaz-Álvarez\\%\thanks{Use footnote for providing further information about author (webpage, alternative address)---\emph{not} for acknowledging funding agencies.} \\
  Departmento de Sistemas Informáticos\\
  E.T.S.I Sistemas Informáticos -- Universidad Politécnica de Madrid\\
  C. de Alan Turing, s/n, 28031 Madrid, Spain \\
  \texttt{alberto.diaz@upm.es} \\
  \And
  Raúl Lara Cabrera\\%\thanks{Use footnote for providing further information about author (webpage, alternative address)---\emph{not} for acknowledging funding agencies.} \\
  Departmento de Sistemas Informáticos\\
  E.T.S.I Sistemas Informáticos -- Universidad Politécnica de Madrid\\
  C. de Alan Turing, s/n, 28031 Madrid, Spain \\
  \texttt{raul.lara@upm.es} \\
}


\begin{document}

% EL PAPER TIENE QUE OCUPAR COMO MUCHO 9 PÁGINAS INCLUIDAS FIGURAS (POR LO VISTO LAS REFERENCIAS NO CUENTAN)

\maketitle

\begin{abstract}
    This paper presents a novel approach for accelerating \gls{nbd} simulations by integrating a physics-informed \acrfullpl{gnn} with traditional numerical methods. Our method implements a leapfrog-based simulation engine to generate datasets from diverse astrophysical scenarios which are then transformed into graph representations. A custom-designed \acrshort{gnn} is trained to predict particle accelerations with high precision. Experiments, conducted on $60$ training and $6$ testing simulations spanning from $3$ to $500$ bodies over $1000$ time steps, demonstrate that the proposed model achieves extremely low prediction errors—loss values while maintaining robust long-term stability, with accumulated errors in position, velocity, and acceleration remaining insignificant. Furthermore, our method yields a modest speedup of approximately 17\% over conventional simulation techniques. These results indicate that the integration of \acrlong{dl} with traditional physical simulation methods offers a promising pathway to significantly enhance computational efficiency without compromising accuracy.
\end{abstract}

\section{Introduction}
\label{s:introduction}

A \gls{nbd} system models the evolution of $N$ particles under mutual gravitational interactions, governed by Newton's laws. Analytical solutions exist only for $N = 2$; for larger $N$, numerical methods are required. Despite their precision, these approaches are computationally expensive due to the need to evaluate every pairwise interaction at each step~\cite{aarseth2003gravitational}, motivating the search for faster alternatives that preserve accuracy.

Over the past two decades, \gls{dl} techniques have opened new avenues for modelling such complex systems. Among them, \glspl{gnn} stand out for their ability to capture both local and global relationships in graph-structured data, as demonstrated in various physical prediction tasks~\cite{battaglia2018relational}. Although gravity is a long-range force, we adopt a local interaction approximation—similar to hierarchical methods like Barnes-Hut~\cite{barnes1986hierarchical}—to improve efficiency. While this simplifies the problem, it enables the \gls{gnn} to approximate global dynamics effectively.

In this work, we propose an approach that combines the generation of simulation data from an \gls{nbd} system with a regression model based on \glspl{gnn}. The methodology presented consists of three main modules:

\begin{enumerate}
    \item \textbf{Physical simulation and data generation}: A simulation engine employing a leapfrog integration scheme~\cite{iserles1986generalized} iteratively updates the positions, velocities, and accelerations of particles. It generates diverse scenarios (e.g. random distributions, 3D discs, multi-disc systems) to create a dataset containing both particle attributes and their target accelerations.
    \item \textbf{Transformation to graph representation}: The simulation data is converted into graphs, with each particle represented as a node (including attributes such as position) and edges established based on proximity.
    \item \textbf{\Glspl{gnn}-based regression model}: A graph neural network based on the \textit{EdgeConv} architecture~\cite{wang2019dynamicgraphcnnlearning} propagates and transforms node information to predict particle accelerations. The model includes encoders for nodes and edges, mapping initial features into a higher-dimensional space, and an output layer tailored for the regression task.
\end{enumerate}

The \textbf{primary objective} of this work is to \textbf{validate the ability of the \gls{gnn} model to predict accelerations in \gls{nbd} systems with sufficient precision}, demonstrating that the integration of realistic physical simulation with \gls{dl} can substantially improve the simulation time of complex dynamics while maintaining a high degree of fidelity to reality.

The remainder of the paper is organised as follows. Section~\nameref{s:related-works} reviews recent work on the application of \gls{dl} in computational physics. This is followed by Sections~\nameref{s:methodology} and~\nameref{s:experiments-and-results}, which detail the structure, execution, and results of the experiments. Finally, the paper concludes with Section~\nameref{s:conclusion}, which presents the findings and outlines potential future directions for this work.

\section{Related works}
\label{s:related-works}

The simulation of \gls{nbd} systems has long been a core topic in computational physics, especially in astrophysics. Early work from the 1980s relied on direct methods to compute gravitational interactions between all particle pairs. While precise for small systems, these algorithms have $O(N^2)$ complexity, which scales poorly. This led to the development of more efficient alternatives, notably the Barnes-Hut method~\cite{barnes1986hierarchical}, which reduces complexity to $O(N\log N)$ using a hierarchical approximation of forces, and the \gls{fmm}~\cite{greengard1987fast}, which achieves $O(N)$ by grouping distant particles into clusters.

In parallel, numerical integration schemes were developed to simulate system evolution over time. The leapfrog integrator, introduced in 1967, remains one of the most widely used methods due to its ability to preserve physical invariants such as energy~\cite{iserles1986generalized}. These techniques have enabled simulations of stellar dynamics and galaxy formation~\cite{hernquist1990analytical,navarro1997universal}, often using parametric density profiles to approximate mass distribution.

Since the 2010s, the rise of \gls{ml} has encouraged the combination of traditional methods with \gls{ai} techniques. Sánchez-González et al.\cite{sanchez2020learning} showed that \glspl{gnn} can learn latent representations that capture system dynamics across scales, enabling accurate predictions of trajectories and accelerations without explicitly computing all interactions. This significantly reduces computational cost, allowing application to scenarios where full simulation would be prohibitive, such as diffusion, turbulence, or granular systems\cite{raissi2019physics,pfaff2020learning}. Similar successes have been reported in related areas like fluid dynamics and materials modelling~\cite{maurizi2022predicting,sergeev2024graph}.

\Glspl{gnn} have also proven highly effective for regression tasks in physics. They have been used to predict continuous quantities—such as forces, energies, or accelerations—with high accuracy~\cite{battaglia2016interaction,kipf2016semi}, thanks to their ability to capture both local structure and global connectivity. This allows them to infer variations in force fields from particle distributions~\cite{gilmer2017neural}, and generalise across different configurations, including materials and biological systems~\cite{chen2023md,yang2022bionet,bongini2022biognn}.

In summary, while classical models remain fundamental for \gls{nbd} simulation, the literature shows that \gls{dl}—and \glspl{gnn} in particular—can ease their computational burden. Rather than replacing numerical methods (e.g., finite difference or finite element), \glspl{gnn} serve as efficient computational shortcuts that complement physical models, providing data-driven approximations that reduce cost without compromising physical fidelity.

\section{Methodology}
\label{s:methodology}

We will divide this section into the four main stages that have comprised this study: (i) generation of a dataset from a physical \gls{nbd} simulation, (ii) transformation of the raw dataset into a graph structure, (iii) definition of a \gls{gnn} architecture for a regression problem, that is, the prediction of each body's acceleration, and (iv)training and evaluation of the model.

All experiments were executed on a workstation equipped with an AMD Ryzen 5 5600X 6-Core Processor (6 cores, 12 threads) running Ubuntu GNU/Linux 24.04, with a frequency of approximately 4650 MHz, and 32 GB of system memory. The machine uses an NVIDIA GeForce RTX 2070 graphics card with 8 GB of dedicated memory, running under CUDA 12.4 with driver version 550.120.

\subsection{Data generation from physical simulation}

The foundation of the methodology is a simulation engine that integrates the evolution of \gls{nbd} systems. It begins with a set of initial conditions defined in terms of the particles’ positions, velocities, and masses. The simulation engine employs a leapfrog integration scheme, which guarantees stable numerical integration and preserves system invariants (e.g. total energy) over time.

\subsubsection{Gravitational acceleration formula}

For each particle $i$ in a system of $N$ particles, the acceleration is computed by summing the contributions from all particles $j$ (excluding $i$, as is evident) as follows:

\begin{equation}
    \label{eq:newton2}
    \Vec{a_i} = G \sum_{j=1, j \neq i}^{N} m_j \frac{\Vec{r_j} - \Vec{r_i}}{\left (\vmod{\Vec{r_j} - \Vec{r_i}}^2 + \epsilon^2 \right )^{\sfrac{3}{2}}}
\end{equation}

where $G$ is the gravitational constant, $m_j$ is the mass of particle $j$, $\Vec{r_i}$ and $\Vec{r_j}$ are the positions of particles $i$ and $j$ respectively, and $\epsilon$ is the smoothing parameter used to avoid singularities. The pseudocode is presented in Algorithm~\ref{alg:dataset-generation}.

\begin{algorithm}
    \caption{\Gls{nbd} simulation using leapfrog method}
    \label{alg:dataset-generation}
    \begin{algorithmic}[1]
    \Procedure{NBodySimulation}{$R, V, M, dt, T, G, \varepsilon$}
        \State $A \gets \text{Accelerations}(R, M, G, \varepsilon)$
        \For{$t \gets 1$ \textbf{to} $T$}
            \For{$i \gets 1$ \textbf{to} $N$}
                \State $V[i] \gets V[i] + A[i] \cdot \frac{dt}{2}$
                \State $R[i] \gets R[i] + V[i] \cdot dt$
            \EndFor
            \State $A \gets \text{Accelerations}(R, M, G, \varepsilon)$
            \For{$i \gets 1$ \textbf{to} $N$}
                \State $V[i] \gets V[i] + A[i] \cdot \frac{dt}{2}$
            \EndFor
            \State Store $(R, V, A)$
        \EndFor
    \EndProcedure
    \end{algorithmic}
\end{algorithm}

In this algorithm, the function \texttt{Accelerations} is responsible for implementing the formula from Equation~\ref{eq:newton2} for each particle.

\subsubsection{Dataset generation}

The dataset is generated from multiple simulations by varying the initial parameters (i.e. random distribution, $3D$ disc, multi-disc), organising the information into three sets:

\begin{itemize}
    \item \textbf{Positions $\{\Vec{r_t}\}$} for each frame $t$.
    \item \textbf{Features}: These may include additional information (e.g. mass).
    \item \textbf{Regression labels} corresponding to the accelerations $\{\Vec{a_t}\}$.
\end{itemize}

Additionally, the approach includes the possibility of incorporating temporal information by shifting the position sequence, which is achieved by concatenating previous frames as part of the feature vector for each time step.

\subsection{Transformation to graph structure}
\label{ss:dataset-to-graph}

The dataset is structured in a tabular format, but to capture the inherent relational structure of these  \gls{nbd} systems, we will transform it into a graph-based structure. To achieve this:

\begin{enumerate}
    \item Each particle is represented as a \textbf{node}, with its initial feature being a vector that includes its position, its mass, and other data derived from its temporal evolution.
    \item These nodes will be connected by \textbf{edges} based on a \gls{knn} criterion, where each node is connected to its $k$ closest neighbors in terms of Euclidean distance. The parameter $k$ has been experimentally defined. Optionally, edge attributes may be computed, for example, the magnitude of the position difference: $e_{ij} = \vmod{\Vec{r_i} - \Vec{r_j}}$. However, in this work, no attributes have been incorporated into the edges.
\end{enumerate}

Algorithm~\ref{alg:dataset-to-graph-knn} describes the process by which each frame of the dataset is transformed into a graph.

\begin{algorithm}
    \caption{Frame to graph transformation using KNN}
    \label{alg:dataset-to-graph-knn}
    \begin{algorithmic}[1]
        \Procedure{GraphTransformation}{$R, X, Y, k$}
            \For{$i \gets 1$ \textbf{to} $N$}
                \State Add node $i$ with feature $X[i]$
                \State Find $k$ nearest neighbours of $i$
                \For{each neighbour $j$}
                    \State Add edge $(i,j)$ with attribute $\|R[i] - R[j]\|$
                \EndFor
            \EndFor
            \State Build graph $G$ with assigned $Y$ tags
            \State \Return $G$
        \EndProcedure
    \end{algorithmic}
\end{algorithm}


\subsection{Model architecture and inference}

The model architecture is structured into three main modules: (i) the initial transformation of input features, (ii) the iterative propagation of information through message passing layers, and (iii) the generation of the final output for the regression task.

It has been designed to be modular, allowing experiments with different types of aggregation functions --such as mean or maximum--modifying the depth of the network, or incorporating attention mechanisms to weight the messages from neighbouring nodes differently.

All the implementation is based on PyTorch and PyTorch Geometric, leveraging vectorised operations and GPU acceleration, which is essential when training with large datasets and/or models.

Algorithm~\ref{alg:forward-pass} describes the inference process through the proposed model.

\begin{algorithm}
    \caption{Inference process}
    \label{alg:forward-pass}
    \begin{algorithmic}[1]
        \Procedure{ForwardPass}{$G, L$}
            \For{node $i$ in $G$}
                \State $h[i] \gets \text{NodeEncoder}(X[i])$
            \EndFor
            \For{$l \gets 1$ \textbf{to} $L$}
                \For{node $i$ in $G$}
                    \State $m[i] \gets \sum_{j \in \mathcal{N}(i)} \phi\big(h[i], h[j], E[i,j]\big)$
                    \State $h[i] \gets \text{LayerNorm}\left( \text{concat}(h[i], m[i]) \right)$
                \EndFor
            \EndFor
            \For{node $i$ in $G$}
                \State $\hat{a}[i] \gets \text{Linear}(h[i])$
            \EndFor
            \State \Return $\{\hat{a}[i]\}_{i=1}^{N}$
        \EndProcedure
    \end{algorithmic}
\end{algorithm}

\subsubsection{Initial features transformation}

Before message propagation begins, each node in the graph is transformed via an encoder—based on an \gls{mlp}—that projects the original features into a $d$-dimensional space, thereby facilitating the capture of non-linear relationships. The MLP employs a Tanh activation function to preserve both positive and negative values in the transformed features.

In other words, if $\Vec{x_i} \in \mathbb{R}^{d_{in}}$ is the input feature vector for node $i$, then the encoder produces:

\begin{equation} 
    \Vec{h_i^{(0)}} = f_{\text{node}}(\Vec{x_i}), \quad f_{\text{node}}: \mathbb{R}^{d_{in}} \rightarrow \mathbb{R}^d \end{equation}

This process may incorporate regularization through dropout to stabilize training, although it is not employed in the present work.

Similarly, although it is not used in this paper, it is possible to enable a similar encoder via a configuration parameter to map edge attributes into a space compatible with dimension $d$. That is, if $\Vec{e_{ij}}$ is the attribute associated with the edge between nodes $i$ and $j$, then:

\begin{equation} 
    \Vec{z_{ij}} = f_{\text{edge}}(\Vec{e_{ij}}), \quad f_{\text{edge}}: \mathbb{R}^{d_{e}} \rightarrow \mathbb{R}^d 
\end{equation}

As in the node encoder, a Tanh activation function can be applied in $f_{\text{edge}}$ to maintain the full range of values. This allows the additional information regarding the relationships between nodes to be integrated coherently with the node representations.

\subsubsection{Message passing and aggregation}

The architecture is built upon message passing layers based on \textit{EdgeConv}~\cite{wang2019dynamicgraphcnnlearning}, a variant of graph convolutions that eases the integration of topological information from the nodes.

\paragraph{Message function} For each node $i$, a function $\phi$ is defined that combines the information from $i$ with that of each of its neighbors $j$. This function can be implemented using an \gls{mlp} and takes into account both the current representations $\Vec{h_i^{(l)}}$ and $\Vec{h_j^{(l)}}$, as well as the edge attributes $\Vec{z_{ij}}$—when available. The combination is expressed as:

\begin{equation} 
    \Vec{m_{ij}^{(l)}} = \phi \left ( \Vec{h_j^{(l)}}, \Vec{h_i^{(l)}}, \Vec{z_{ij}} \right ) 
\end{equation}

The function $\phi$ is designed to be invariant to the order of neighbors and to capture complex interactions. In our implementation, the \gls{mlp} employs a Tanh activation function to maintain negative values in the message representations, ensuring that the transformations preserve important directional information.

\paragraph{Aggregation and update}

The messages from all neighbours are aggregated using a summation function. The choice of this function is due to the properties it exhibits in the \glspl{gnn} literature~\cite{gilmer2017neural,xu2018powerful}.

\begin{equation}
    \Vec{m_i^{(l)}} = \sum_{j \in \mathcal{N}(i)} \Vec{m_{ij}^{(l)}}
\end{equation}

Where $\mathcal{N}(i)$ denotes the nodes in the neighborhood of node $i$. Next, the node is updated via a residual operation that concatenates the previous representation and the aggregated message, followed by a LayerNorm operation:

\begin{equation} \Vec{h_i^{(l+1)}} = \text{LayerNorm} \left ( \text{concat} \left( \Vec{h_i^{(l)}}, \Vec{m_i^{(l)}} \right) \right) \end{equation}

This residual update—$\Vec{h_i^{(l)}}$—helps preserve information across layers, while LayerNorm ensures stable training dynamics by normalizing the concatenated features.

\paragraph{Regarding depth and receptive field}

The number of layers $L$ is a critical hyperparameter: as $L$ increases, each node is able to access information from more distant neighbours in the graph, which is essential for capturing long-range interactions in \gls{nbd} systems. However, an excessive number of layers may lead to over-smoothing~\cite{li2020deepergcn}, meaning that the node representations become overly homogeneous and the model's ability to differentiate between nodes is reduced, compounded by an increase in computational load that negates the original efficiency benefits.

\subsubsection{Output layer and mapping to the regression task}

The final module of the architecture is responsible for converting the representations obtained after the message passing layers into the desired output.

Each final node $i$ has a representation $\Vec{h_i^{(L)}}$ which is transformed by a linear layer that maps the embedded space to a vector of dimension $d_{\text{out}}$. In our case, for the prediction of accelerations in a $3D$ space, $d_{\text{out}} = 3$:

\begin{equation}
    \hat{\Vec{a_i}} = W_{\text{out}} \Vec{h_i^{(L)}} + \Vec{b_{\text{out}}}
\end{equation}

where $W_{\text{out}}$ and $\Vec{b_{\text{out}}}$ are the parameters of the output layer.

\subsection{Training and evaluation}

After defining the dataset—through graphs derived from the raw simulation data—and the \gls{gnn} architecture, the training and evaluation process aims to optimize both the model’s predictive capability and generalization for the problem. Algorithm~\ref{alg:train-gnn} describes the training process of the model.

\begin{algorithm}
    \caption{Training process}
    \label{alg:train-gnn}
    \begin{algorithmic}[1]
        \Procedure{TrainGNN}{$D, E, B$}
            \For{$\text{epoch} \gets 1$ \textbf{to} $E$}
                \State Shuffle $D$
                \State Divide $D$ into $B$ batches
                \For{batch in $B$}
                    \State $loss \gets 0$
                    \For{graph $G$ in batch}
                        \State $\hat{a} \gets \text{ForwardPass}(G)$
                        \State $loss \gets loss + \text{MSE}\big(\hat{a}, Y\big)$
                    \EndFor
                    \State Backpropagate $loss$
                    \State Update weights (ADAM)
                \EndFor
            \EndFor
            \State Final test evaluation
        \EndProcedure
    \end{algorithmic}
\end{algorithm}

The dataset has been divided with an approximately 90\%-10\% split, where 90\% of the data is used for training and 10\% for testing. No validation set was used. The loss function employed is the \gls{mse} between the model's predicted accelerations and the actual accelerations.

For the training process, the Adam optimizer is used to update the model parameters. The implemented training cycle consists of:

\begin{itemize}
    \item \textbf{Batch extraction}: Iterating over the training dataset in batches, where each batch contains a set of graphs along with their corresponding labels.
    \item \textbf{Inference}: For each batch, the model performs a forward pass through the graph, generating acceleration predictions $\Vec{\hat{a_i}}$ for each node.
    \item \textbf{Loss calculation}: \Gls{mse} is computed between the predictions and the actual accelerations.
    \item \textbf{Backpropagation}: The loss is backpropagated and the model weights are updated.
\end{itemize}


\section{Experiments and results}
\label{s:experiments-and-results}

The model was trained on a dataset comprising 60 random simulations, with 10 simulations for each of the following particle counts: $3$, $25$, $50$, $100$, $250$, and $500$. Each simulation ran for $1000$ iterations. Evaluation was carried out on $6$ additional simulations—one for each of the same particle counts—also executed over $1000$ steps.

The simulations were carried out by modelling spiral galaxies that include a black hole. The simulation parameters are described in Table~\ref{tbl:sim-params}\footnote{The complete source code used in this work is publicly available at \url{https://github.com/KNODIS-Research-Group/efficient-n-body-simulations-using-physics-informed-gnn}.}.

\begin{table}[h]
    \caption{Simulation parameters and their respective values}
    \label{tbl:sim-params}
    \centering
    \begin{tabular}{@{}ll@{}}
        \toprule
        \textbf{Parameter}                     & \textbf{Value}       \\
        \midrule
        Time step                          & $0.0001$             \\
        Gravitational constant                & $4.5 \times 10^{-6}$ \\
        Total galaxy mass               & $1.0$                \\
        Radial scale                          & $3.0$                \\
        Vertical scale                        & $0.3$                \\
        Black hole mass proportion & $0.01$               \\
        Arms                       & $2$                  \\ \bottomrule
    \end{tabular}
\end{table}

These parameters have been adjusted based on the total mass of the galaxy, in an effort to simulate the initial state of a galaxy in a dimensionless system in a relatively realistic manner. However, absolute fidelity was not the aim, as the focus of this paper is on the simulation of \gls{nbd} systems; thus, while these values are representative, they do not correspond to any actual configuration of a theoretical or known galaxy.

\subsection{Stepwise analysis}

The results obtained from the model were grouped by scene. The data show that the loss is extremely low, with values on the order of $10^{-14}$ to $10^{-13}$. Figure~\ref{fig:stepwise-loss-and-time} summarises the results for the loss and the average step times used during the simulation execution.

\begin{figure}[ht]
    \centering
    \begin{subfigure}[b]{0.45\textwidth}
        \centering
        \includegraphics[width=\textwidth]{images/stepwise-loss.png}
        \label{subfig:a}
    \end{subfigure}
    \hfill
    \begin{subfigure}[b]{0.45\textwidth}
        \centering
        \includegraphics[width=\textwidth]{images/stepwise-time.png}
        \label{subfig:b}
    \end{subfigure}
    \caption{Results of the error analysis (left) and average scene execution time (right) for $1000$ simulation steps performed using the model.}
    \label{fig:stepwise-loss-and-time}
\end{figure}

These results indicate that the model achieves more than acceptable accuracy and consistency in its predictions at each step.

\subsection{Rollout evaluation}

Errors for position, velocity, and acceleration of all particles have been averaged across scenes for each step. The acceleration error remains almost constant (with only minimal variation at the sixth decimal place), which indicates that the prediction of acceleration is very stable over time. In contrast, the errors in position and velocity start at zero and gradually increase, reaching values on the order of $10^{-21}$ and $10^{-17}$ respectively by the final step of the simulation, thereby demonstrating good control over error propagation in these variables. Figure~\ref{fig:rollout-evolution} illustrates the evolution of these metrics over time.

\begin{figure}[ht]
    \centering
    \includegraphics[width=.5\textwidth]{images/rollout-evolution.png}
    \caption{Evolution of the \gls{mse} in position, velocity, and acceleration over $1000$ simulation steps. The value at each step is the average of the values for all particles across all scenarios for that step.}
    \label{fig:rollout-evolution}
\end{figure}

In the figure, one can observe the complete stability of the acceleration predictions and an almost imperceptible increase in the errors in position and velocity. Focusing on the accumulated errors (see Figure~\ref{fig:cumulative-errors}), which are obtained by summing the errors at each step, it reveals that:

\begin{figure}[ht]
    \centering
        \begin{subfigure}[b]{0.32\textwidth}
        \centering
        \includegraphics[width=\textwidth]{images/cumulative-position-error.png}
        \caption{}
        \label{subfig:a}
    \end{subfigure}
    \hfill
    \begin{subfigure}[b]{0.32\textwidth}
        \centering
        \includegraphics[width=\textwidth]{images/cumulative-velocity-error.png}
        \caption{}
        \label{subfig:b}
    \end{subfigure}
    \hfill
    \begin{subfigure}[b]{0.32\textwidth}
        \centering
        \includegraphics[width=\textwidth]{images/cumulative-acceleration-error.png}
        \caption{c}
        \label{subfig:c}
    \end{subfigure}
    \caption{Accumulated errors in position (left), velocity (centre) and acceleration (right). The error accumulation is shown for each of the $6$ scenes, from the one with the fewest particles (scene $0$, $3$ bodies) to that with the most particles (scene $5$, $500$).}
    \label{fig:cumulative-errors}
\end{figure}

\begin{enumerate}
    \item The accumulated error in the position variable at the end of the simulation is on the order of $10^{-18}$, with a final average of approximately $8.27 \times 10^{-19}$.
    \item The accumulated error in velocity is on the order of $10^{-14}$, with a final average of approximately $1.6 \times 10^{-14}$.
    \item The accumulated error in acceleration reaches values on the order of $10^{-11}$, with a final average of approximately $1.26 \times 10^{-11}$.
\end{enumerate}

These low error values indicate that the error propagation over time is practically negligible.

\subsection{Regarding computational efficiency}

A speedup of approximately $1.17$ times has been achieved, that is, the model runs roughly 17\% faster than the base simulation. Although moderate, it represents a promising avenue for further research, particularly as it has been realised without sacrificing the accuracy of the predictions.

\section{Conclusion}
\label{s:conclusion}

Following the analysis of the results obtained in this study, it appears that integrating \glspl{gnn} techniques with physical \gls{nbd} simulations can accelerate their execution while maintaining a high degree of accuracy, with very low losses and negligible accumulated errors.

It should be noted that the dataset was divided into a training-test split without a dedicated validation set. This decision was primarily driven by the focus on establishing baseline performance. However, the absence of a validation set is a limitation, as it restricts our ability to monitor overfitting. Future work will aim to incorporate a separate validation set or adopt cross-validation techniques to enhance performance evaluation and robustness.

Another relevant consideration is that the black hole is treated uniformly as any other particle, which may inadequately capture its dominant gravitational influence. A differentiated treatment—such as assigning a higher weight or designing a specialized connectivity scheme—should be explored in future work to better reflect its unique role in the system. Additionally, further improvements to the implementation may enhance the model’s performance and make it even more competitive compared to traditional simulation techniques.

Finally, although this study focuses on small- to medium-scale systems (up to 500 bodies), real-world astrophysical simulations often involve thousands or even millions of particles. Addressing scalability is therefore a key avenue for future research. Possible approaches include using sparse graph representations, hierarchical modeling strategies, or hardware acceleration to extend the applicability of the method to larger and more complex systems.

\begin{ack}
    The support of the Comunidad Autónoma de Madrid under ALENTAR-J-CM project (reference TEC-2024/COM-224) is gratefully acknowledged.
\end{ack}

\bibliographystyle{apalike}
\bibliography{maeb_2025}

\end{document}