Keywords: cryptography, math, datasets, AI4math, transformers
TL;DR: We release the largest preprocessed datasets for the Learning with Errors (LWE) problem
Abstract: AI-powered attacks on Learning with Errors (LWE)—an important hard math problem in post-quantum cryptography—rival or outperform "classical" attacks on LWE under certain parameter settings. Despite the promise of this approach, a dearth of accessible data limits AI practitioners' ability to study and improve these attacks. Creating LWE data for AI model training is time- and compute-intensive and requires significant domain expertise. To fill this gap and accelerate AI research on LWE attacks, we propose the TAPAS datasets, a ${\bf t}$oolkit for ${\bf a}$nalysis of ${\bf p}$ost-quantum cryptography using ${\bf A}$I ${\bf s}$ystems. These datasets cover several LWE settings and can be used off-the-shelf by AI practitioners to prototype new approaches to cracking LWE. This work documents TAPAS dataset creation, establishes attack performance baselines, and lays out directions for future work.
Croissant File: json
Dataset URL: https://huggingface.co/datasets/facebook/TAPAS
Primary Area: Other (please use sparingly, only use the keyword field for more details)
Submission Number: 1520
Loading