AesX: Enhance Your Images with Stunning Aesthetic Beauty

Yuyan Chen; Zhendong Hou; Lei Xia; Jiahao Li; Zhuolin Ji; Zhixu Li

AesX: Enhance Your Images with Stunning Aesthetic Beauty

Yuyan Chen, Zhendong Hou, Lei Xia, Jiahao Li, Zhuolin Ji, Zhixu Li

Published: 18 Apr 2026, Last Modified: 22 Apr 2026ACL 2026 Industry Track PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Large Multi-modal Models, Text-to-Image Generation, Aesthetic Score

Abstract: In the fields of advertising design, artistic creation, and cultural dissemination, there is an increasingly urgent demand for high-quality images that cater to fine-grained aesthetic preferences. Although existing large-scale models can generally meet basic requirements for clarity and alignment with textual elements, they still face significant bottlenecks in achieving precise control and aesthetic optimization. To address this limitation, we propose a set of comprehensive preference indicators across two major dimensions, text-image consistency and aesthetic quality, encompassing multiple criteria ranging from exposure and clarity to visual guidance and innovativeness. Building on these indicators, we have developed a generative framework named AesX to steer the model consistently toward a generation path that more closely aligns with human aesthetic sensibilities. Our experimental findings demonstrate that this approach yields significant improvements in both target recognition accuracy and overall visual aesthetic presentation.

Submission Type: Discovery

Copyright Form: pdf

Submission Number: 467

Loading