Learning Unorthogonalized Matrices for Rotation Estimation

22 Sept 2023 (modified: 11 Feb 2024)Submitted to ICLR 2024EveryoneRevisionsBibTeX
Primary Area: representation learning for computer vision, audio, language, and other modalities
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.
Keywords: geometry, optimization, pose estimation
Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2024/AuthorGuide.
Abstract: Estimating 3D rotations is a common procedure for 3D computer vision. The accuracy depends heavily on the rotation representation. Rotation matrices, recently, have been popular due to their continuity, especially for pose estimation tasks. The learning process usually incorporates orthogonalizations to generate orthonormal matrices. We observe that common orthogonalization procedures like Gram- Schmidt-based and SVD-based may slow down the training efficiency via a gradi- ent analysis. To this end, we advocate removing orthogonalization from the learn- ing process and learning unorthogonalized ‘Pseudo’ Rotation Matrices (PRoM). To prove the superiority of PRoM over orthogonalization incorporated methods, we conduct an optimization analysis to explicitly demonstrate that PRoM can con- verge at a higher rate and to a better solution. By replacing the orthogonalization incorporated representation with our proposed PRoM in various rotation-related tasks, we can achieve state-of-the-art results on large-scale benchmarks
Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors' identity.
No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.
Submission Number: 4833
Loading