Automated Natural Language Explanation of Deep Visual Neurons with Large Models (Student Abstract)

Chenxu Zhao, Wei Qian, Yucheng Shi, Mengdi Huai, Ninghao Liu

Published: 2024, Last Modified: 23 Jan 2026AAAI 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Interpreting deep neural networks through examining neurons offers distinct advantages when it comes to exploring the inner workings of Deep Neural Networks. Previous research has indicated that specific neurons within deep vision networks possess semantic meaning and play pivotal roles in model performance. Nonetheless, the current methods for generating neuron semantics heavily rely on human intervention, which hampers their scalability and applicability. To address this limitation, this paper proposes a novel post-hoc framework for generating semantic explanations of neurons with large foundation models, without requiring human intervention or prior knowledge. Experiments are conducted with both qualitative and quantitative analysis to verify the effectiveness of our proposed approach.