Neuromodulation Gated Transformer

Published: 01 Jan 2023, Last Modified: 15 May 2025CoRR 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: We introduce a novel architecture, the Neuromodulation Gated Transformer (NGT), which is a simple implementation of neuromodulation in transformers via a multiplicative effect. We compare it to baselines and show that it results in the best average performance on the SuperGLUE benchmark validation sets.
Loading