Stylistic Contrastive Learning for Human-Like AI Text Generation

Michael Bronikowski

Stylistic Contrastive Learning for Human-Like AI Text Generation

Michael Bronikowski

Published: 08 Oct 2025, Last Modified: 21 Oct 2025Agents4ScienceEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Stylistic Contrastive Learning, Human-like Text Generation, Stylometry, Style Control for LLMs, Large Language Models

TL;DR: We introduce Stylistic Contrastive Learning (SCL), a technique that learns a human‑style embedding and uses it to steer generative models like GPT‑5 toward more varied, idiomatic, and human‑sounding text while preserving the original content.

Abstract: AI-generated text is often fluent yet stylistically off: it leans formal, repeats safe phrasing, underuses idioms, and exhibits templated discourse, making it detectably non-human to both algorithms and attentive readers. We synthesize recent evidence quantifying these gaps—lexical diversity, syntactic variety, idiomaticity, and discourse planning—and propose Stylistic Contrastive Learning (SCL), a training framework that learns a human-style embedding and pushes generations toward it via a supervised contrastive objective. We instantiate SCL on GPT-5 and evaluate across essays, newsy expositions, and dialogues. SCL reduces stylometric detectability, and increases distinct-n and idiom use, raising human “sounds-human” ratings while preserving topical fidelity. Ablations identify idiom frequency and discourse markers as the strongest perceptual drivers and we discuss implications for evaluation, alignment, and detection.

Supplementary Material: zip

Submission Number: 344

Loading