Towards Finding Longer Proofs

Zsolt Zombori; Adrián Csiszárik; Henryk Michalewski; Cezary Kaliszyk; Josef Urban

Towards Finding Longer Proofs

Zsolt Zombori, Adrián Csiszárik, Henryk Michalewski, Cezary Kaliszyk, Josef Urban

25 Sept 2019 (modified: 22 Jun 2025)ICLR 2020 Conference Blind SubmissionReaders: Everyone

TL;DR: We present FLoP, a reinforcement learning based guidance system for automated theorem proving geared towards Finding Longer Proofs.

Abstract: We present a reinforcement learning (RL) based guidance system for automated theorem proving geared towards Finding Longer Proofs (FLoP). FLoP focuses on generalizing from short proofs to longer ones of similar structure. To achieve that, FLoP uses state-of-the-art RL approaches that were previously not applied in theorem proving. In particular, we show that curriculum learning significantly outperforms previous learning-based proof guidance on a synthetic dataset of increasingly difficult arithmetic problems.

Code: http://bit.ly/code_atpcurr

Keywords: automated theorem proving, reinforcement learning, curriculum learning, internal guidance

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/towards-finding-longer-proofs/code)

Original Pdf: pdf

8 Replies

Loading