ST-KeyS: Self-supervised Transformer for Keyword Spotting in historical handwritten documents

Published: 01 Jan 2026, Last Modified: 25 Jul 2025Pattern Recognit. 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•A self-supervised approach based on vision transformers for keyword spotting.•Learning useful representations from unlabeled data using a masked auto-encoder.•Improving feature embedding using a two-stage downstream task.•The proposed approach outperforms state-of-the-art approaches on three datasets.
Loading