A CNN-transformer hybrid approach for decoding visual neural activity into text

Jiang Zhang, Chen Li, Ganwanming Liu, Min Min, Chong Wang, Jiyi Li, Yuting Wang, Hongmei Yan, Zhentao Zuo, Wei Huang, Huafu Chen

Published: 2022, Last Modified: 13 May 2025Comput. Methods Programs Biomed. 2022EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•A CNN-Transformer hybrid decoding model is proposed to decode visual neural activities evoked by natural images into texts about the visual stimuli.•A specific architecture of the transformer is investigated to improve the decoding performance.•The function of visual durations, attention mapping, and visual regions are explored to understand the neural mechanism in the human brain.