Structure and Component: Factors Influencing the Compositionality Development of Pre-trained Language Models During Fine-tuning

Structure and Component: Factors Influencing the Compositionality Development of Pre-trained Language Models During Fine-tuning

ACL ARR 2026 January Submission2984 Authors

04 Jan 2026 (modified: 20 Mar 2026)ACL ARR 2026 January SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: compositionality, structure, component, fine-tuning, pre-trained language model

Abstract: In this work, we investigate factors influencing the compositionality development of pre-trained language models during fine-tuning from the perspective of two key concepts in linguistic compositionality: structure and component. Our key finding is that even fine-tuning of identical symbolic essences can lead to different compositionality developments of the same pre-trained language model, driven by the following factors: (1) structures uncommon in natural language lead to high compositionality, (2) components uncommon in natural language lead to high compositionality, and (3) component combinations uncommon in natural language lead to low compositionality. This phenomenon is intrinsically linked to linguistic compositionality, offering a new perspective for compositionality research on language models.

Paper Type: Short

Research Area: Semantics: Lexical, Sentence-level Semantics, Textual Inference and Other areas

Research Area Keywords: compositionality

Contribution Types: NLP engineering experiment

Languages Studied: English

Submission Number: 2984

Loading