Do LLMs Understand Calculus? Evaluating Symbolic Math Generalization with ASyMOB
Keywords: Large Language Model, Symbolic Mathematics, CAS, Tool Use, AI for Science, AI for Math, Integrals, Benchmark, Reasoning, Mathematical Capabilities, Evaluation
TL;DR: Assessing LLM mathematics skills using a new question dataset that focuses on symbolic computation - we show that the most advanced models go beyond memorizing patterns and show signs of deeper understanding of symbolic math.
Confirmation Of Submission Requirements: I submit an abstract. It uses the template provided on the submission page and is no longer than 2 pages.
PDF: pdf
Submission Number: 299
Loading