Mixture of Soft Prompts for Controllable Data Generation

Derek Chen; Celine Lee; Yunan Lu; Domenic Rosati; Zhou Yu

Mixture of Soft Prompts for Controllable Data Generation

Derek Chen, Celine Lee, Yunan Lu, Domenic Rosati, Zhou Yu

Published: 07 Oct 2023, Last Modified: 01 Dec 2023EMNLP 2023 FindingsEveryoneRevisionsBibTeX

Submission Type: Regular Long Paper

Submission Track: Efficient Methods for NLP

Submission Track 2: Natural Language Generation

Keywords: data augmentation, parameter efficient training, few-shot learning, structured prediction

TL;DR: Use LLM to generate data and train smaller model, which outperforms original larger LLM.

Abstract: Large language models (LLMs) effectively generate fluent text when the target output follows natural language patterns. However, structured prediction tasks confine the output format to a limited ontology, causing even very large models to struggle since they were never trained with such restrictions in mind. The difficulty of using LLMs for direct prediction is exacerbated in few-shot learning scenarios, which commonly arise due to domain shift and resource limitations. We flip the problem on its head by leveraging the LLM as a tool for data augmentation rather than direct prediction. Our proposed Mixture of Soft Prompts (MSP) serves as a parameter-efficient procedure for generating multi-attribute data in a controlled manner. Denoising mechanisms are further applied to improve the quality of synthesized data. Automatic metrics show our method is capable of producing diverse and natural text, while preserving label semantics. Moreover, MSP achieves state-of-the-art results on three benchmarks when compared against strong baselines. Our method offers an alternate data-centric approach for applying LLMs to complex prediction tasks.

Submission Number: 3831

Loading