SIREN: Scalable isotropic recursive column multimodal neural architecture

Hubert Truchan, Alan Farag, Zahra Ahmadi

Published: 01 Jan 2026, Last Modified: 20 Nov 2025Information FusionEveryoneRevisionsCC BY-SA 4.0
Abstract: Highlights•Introduces a multimodal isotropic architecture with <<math><mo is="true">&lt;</mo></math>18MB, <<math><mo is="true">&lt;</mo></math>4.5M params, 23ms/obs.•Scales to complex modalities via isotropic blocks, recurrent and column links.•Presents a 2D embedding that avoids compression, enabling flexible downsizing.•Achieves higher accuracy than SOTA, with recursion boosting semantic features.•Releases the Pentostreda dataset with five modalities for classification and regression.
Loading