Do LLMs Understand Calculus? Evaluating Symbolic Math Generalization with ASyMOB

Published: 12 Jun 2025, Last Modified: 13 Jun 2025AI4X 2025 OralEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Large Language Model, Symbolic Mathematics, CAS, Tool Use, AI for Science, AI for Math, Integrals, Benchmark, Reasoning, Mathematical Capabilities, Evaluation
TL;DR: Assessing LLM mathematics skills using a new question dataset that focuses on symbolic computation - we show that the most advanced models go beyond memorizing patterns and show signs of deeper understanding of symbolic math.
Confirmation Of Submission Requirements: I submit an abstract. It uses the template provided on the submission page and is no longer than 2 pages.
PDF: pdf
Submission Number: 299
Loading