Equivariant Modelling for Catalysis on 2D MXenes

Published: 31 Oct 2025, Last Modified: 24 Nov 2025SIMBIOCHEM 2025 SpotlightEveryoneRevisionsBibTeXCC BY-NC-SA 4.0
Keywords: MXenes, equivariance, catalysis
TL;DR: Generating 2D MXenes dataset; training an equivariant model for catalysis on 2D MXenes and achieving computational acceleration while maintaining robustness
Abstract: Merging advanced computations with machine learning, we aim to accelerate the exploration of catalytic behaviour in novel materials. We focus on two-dimensional (2D) Ti$_2$CT$_y$ MXenes, whose versatile surface chemistry makes them particularly compelling candidates for catalysis. However, resolving their composition and structure under realistic conditions requires going beyond the systems typically studied with density functional theory (DFT), as the computational cost of such calculations limits accessible system sizes and timescales, calling instead for more efficient approaches. To address this challenge, we generate a comprehensive dataset of 50,000 DFT calculations for training and 10,000 for testing, encompassing both Ti$_2$CT$_y$ MXene configurations and molecular systems, along with an augmented dataset where systems are artificially repeated to investigate how well the model generalises to larger systems. Employing advances in geometric deep learning, we train and validate an equivariant (i.e., symmetry-aware) model (EquiformerV2) that accurately predicts atomic forces and formation energies --- quantities that DFT must repeatedly compute for structural and catalytic investigations --- for these 2D materials. This combined DFT–ML framework achieves computational acceleration of the order ${\sim}10^3$--$10^4$ (on a CPU) while maintaining DFT-level accuracy (${\sim} {\pm} 45$ meV/Å for forces and ${\sim} {\pm} 6$ meV for per-atom energies), paving the way for more efficient investigations of MXene catalytic behaviour. Moreover, we confirm that the total energy prediction error of the model grows linearly with the number of atoms in an input system, while the force error remains the same, which, along with the equivariant model design, is a necessity for a robust model. The dataset and the trained models with the code will be made publicly available upon acceptance.
Release To Public: Yes, please release this paper to the public
Submission Number: 24
Loading