Re-evaluating metamorphic testing of chess engines: A replication study

Published: 01 Jan 2025, Last Modified: 22 Jul 2025Inf. Softw. Technol. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We replicated the original study varying depth and dataset (real games).•Metamorphic relations (MRs) are less effective on real positions and higher depths.•MRs must consider depth; effectiveness drops past threshold.•Stockfish code review explains discrepancies in transformed and low-depth positions.•Replication matters: MRs must be causally related to domain’s specificities.
Loading