Visual question answering from another perspective: CLEVR mental rotation tests

Published: 01 Jan 2023, Last Modified: 13 May 2025Pattern Recognit. 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We propose a version of CLEVR that is inspired by mental rotation tests.•Latent feature volumes can be used instead of feature maps for VQA tasks grounded in 3D.•Contrastive learning can be used to learn an encoder that maps images to latent volumes.
Loading