Data Augmentation for Physical Commonsense Reasoning

Published: 17 Dec 2024, Last Modified: 18 Dec 2024UMich CSE595 NLP FA2024EveryoneRevisionsBibTeXCC BY 4.0
Keywords: NLP, Sense Reasoning
Abstract: Tiered Reasoning for Intuitive Physics (TRIP) is a new NLP dataset from the SLED Lab at University of Michigan. It poses a multi-layered physical commonsense reasoning task where AI systems must determine which of two stories (each describing a series of physical actions applied to household objects) is plausible. Further, they must justify their decision by identifying which sentences in the implausible story are conflicting with each other, and the specific physical state changes that cause the conflict. When we evaluate systems on this tiered prediction, baseline results are very low (only up to 10% of stories are classified correctly and verified with coherent supporting evidence). For this project, we are going to use three potential approaches to improve the baseline performance on TRIP.
Archival Option: Yes
Submission Number: 4
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview