Generating Code to Verify Cryptic Crossword Reasoning

Martin Andrews; Sam Witteveen

Generating Code to Verify Cryptic Crossword Reasoning

Martin Andrews, Sam Witteveen

Published: 06 Mar 2025, Last Modified: 19 Apr 2025DL4C @ ICLR 2025EveryoneRevisionsBibTeXCC BY 4.0

Track: long paper (up to 9 pages)

Keywords: Cryptic Crosswords, Code Generation, Reasoning Verification, Large Language Models, Natural Language Understanding, Interpretability, Algorithmic Reasoning

TL;DR: Using code generation, we automate and verify linguistic reasoning for cryptic crossword solving, demonstrating a novel application of deep learning for complex puzzle-solving

Abstract: Cryptic crossword clues are challenging linguistic reasoning puzzles, with fresh, real-world test cases published daily in major newspapers globally. Unlike standard crosswords, cryptic clues uniquely combine a ‘definition’ with intricate ‘wordplay’ – a logical puzzle embedded in language – designed to prove each answer’s correctness through reasoning alone, independent of grid context. This work describes an open-licensed, LLM-driven system for automated cryptic crossword solving which generates: (i) answer hypotheses and wordplay explanations; and (ii) code-based verification of the proposed reasoning, ensuring solution validity. Critically, our system achieves state-of-the-art performance on the challenging Cryptonite dataset, comprising cryptic clues from prestigious UK newspapers like The Times and The Telegraph. Furthermore, by expressing each verified solution and its reasoning in executable Python code, our system offers unprecedented interpretability, allowing for detailed inspection and analysis of the automated cryptic crossword solving process.

Anonymization: This submission has been anonymized for double-blind review via the removal of identifying information such as names, affiliations, and identifying URLs.

Submission Number: 10

Loading