Towards Finding Longer Proofs

Zsolt Zombori; Adrián Csiszárik; Henryk Michalewski; Cezary Kaliszyk; Josef Urban

Towards Finding Longer Proofs

Zsolt Zombori, Adrián Csiszárik, Henryk Michalewski, Cezary Kaliszyk, Josef Urban

28 Sept 2020 (modified: 22 Jun 2025)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: automated reasoning, reinforcement learning, reasoning by analogy

Abstract: We present a reinforcement learning (RL) based guidance system for automated theorem proving geared towards Finding Longer Proofs (FLoP). FLoP is a step towards learning to reason by analogy, reducing the dependence on large scale search in automated theorem provers. We use several simple, structured datasets with very long proofs to show that FLoP can successfully generalise a single training proof to a large class of related problems, implementing a simple form of analogical reasoning. On these benchmarks, FLoP is competitive with strong theorem provers despite using very limited search.

One-sentence Summary: FLoP is a theorem prover that uses RL based guidance to implement a simple form of analogical reasoning to overcome fundamental limitations of search based approaches.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/towards-finding-longer-proofs/code)

Reviewed Version (pdf): https://openreview.net/references/pdf?id=msZz1biPgh

8 Replies

Loading