Representation with a capital 'R': measuring functional alignment with causal perturbation

Published: 10 Oct 2024, Last Modified: 08 Nov 2024UniRepsEveryoneRevisionsBibTeXCC BY 4.0
Track: Extended Abstract Track
Keywords: representation, functional mechanism, alignment, deep encoding models, fMRI, constraints on encoding, information routing
TL;DR: Studies of alignment must move beyond conventional "representation" as encoding, in favor of "Representation" (with a capital R), which specifies how information is used functionally.
Abstract: Ambiguity surrounding the term 'representation' in biological and artificial neural systems hampers our ability to assess their alignment. In this paper, we draw a critical distinction between two notions of representation: the conventional 'representation' as a mere encoding, and 'Representation-with-a capital-R', which entails functional use within a system. We argue that while current methods in neuroscience and artificial intelligence often focus on the former, advancing our understanding requires a shift toward the latter. We critique existing linking methods such as representational similarity analysis and encoding models, highlighting key limitations in their ability to capture functional correspondence. We then propose an updated paradigm that involves specifying explicit models of information readout and testing their functional properties using causal perturbation. This framework, which treats neural networks as a new species of model organism, may help reveal the principles governing functional representations in both biological and artificial systems.
Submission Number: 67
Loading