Generative compositor for few-shot visual information extraction

Zhibo Yang, Wei Hua, Sibo Song, Cong Yao, Yingying Zhu, Wenqing Cheng, Xiang Bai

Published: 2025, Last Modified: 05 Mar 2026Pattern Recognit. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•A novel VIE method, called Generative Compositor, leverages layout and prompt priors.•Three pre-training tasks to improve the model’s spatial contextual capabilities.•A prompt-aware resampler for distilling and merging the multi-modal embeddings.•Significant improvements in few-shot settings.

External IDs:dblp:journals/pr/YangHSYZCB25