Published: 01 Jan 2024, Last Modified: 07 Aug 2024Comput. Vis. Image Underst. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract:Highlights•Directly regress the camera pose from the query image.•Transformer Encoders can aggregate task-informative latent representations•We show how to learn both single- and multi- scene camera pose regression.