Precision at scale: Domain-specific datasets on-demand

Published: 2026, Last Modified: 09 Nov 2025Pattern Recognit. 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Precision at Scale (PaS) automatically creates domain-specific datasets on-demand.•It leverages LLMs, VLMs, and generative models for data collection and curation.•PaS includes a task-agnostic framework for assessing dataset diversity.•Training on PaS datasets outperform model pretraining on large-scale general-domain datasets.•PaS datasets efficiently fine-tune SoTA VLMs in specialized domains.
Loading