Can LLMs Replace Human Evaluators? An Empirical Study of LLM-as-a-Judge in Software Engineering

Ruiqi Wang, Jiyu Guo, Cuiyun Gao, Guodong Fan, Chun Yong Chong, Xin Xia

Published: 22 Jun 2025, Last Modified: 08 Jan 2026Proceedings of the ACM on Software EngineeringEveryoneRevisionsCC BY-SA 4.0
External IDs:doi:10.1145/3728963
Loading