Abstract: Highlights•Precision at Scale (PaS) automatically creates domain-specific datasets on-demand.•It leverages LLMs, VLMs, and generative models for data collection and curation.•PaS includes a task-agnostic framework for assessing dataset diversity.•Training on PaS datasets outperform model pretraining on large-scale general-domain datasets.•PaS datasets efficiently fine-tune SoTA VLMs in specialized domains.
External IDs:dblp:journals/pr/RodriguezdeVeraESNR26
Loading