Characterizing the Representational Capacity of Neural Processes

Robin Young

Characterizing the Representational Capacity of Neural Processes

Robin Young

Published: 25 May 2026, Last Modified: 25 May 2026ProbML 2026 Proceedings TrackEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Neural Processes, representational capacity, expressiveness hierarchy, Gaussian Processes, meta-learning, attention mechanisms

Abstract: What functions can Neural Processes represent? We analyze the representational capacity of popular NP architectures: Conditional Neural Processes (CNPs), Attentive Neural Processes (ANPs), Transformer Neural Processes (TNPs), and their latent variants. We prove these architectures form a strict hierarchy. CNP-representable functions are exactly those depending on finitely many expected features of the context distribution. ANPs strictly generalize CNPs via query-dependent reweighting, enabling kernel smoothers. ConvCNPs and ANPs are incomparable; each contains functions outside the other, separated by stationarity versus translation equivariance. TNPs with $L$ self-attention layers capture $L$-hop context interactions. For latent NPs, we show finite-dimensional latents provide coherent sampling but do not circumvent encoder limitations; matching GP posterior distributions requires latent dimension scaling with context size. These results provide a theoretical foundation for architecture selection based on task structure.

Submission Number: 20

Loading