On the computation of the gradient in implicit neural networks

Béla J. Szekeres, Ferenc Izsák

Published: 2024, Last Modified: 02 Mar 2026J. Supercomput. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Implicit neural networks and the related deep equilibrium models are investigated. To train these networks, the gradient of the corresponding loss function should be computed. Bypassing the implicit function theorem, we develop an explicit representation of this quantity, which leads to an easily accessible computational algorithm. The theoretical findings are also supported by numerical simulations.

External IDs:dblp:journals/tjs/SzekeresI24