Toggle navigation
OpenReview
.net
Login
×
Back to
ACL
ACL ARR 2025 July Submissions
ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting
ACL ARR 2025 July Submission875 Authors
29 Jul 2025 (modified: 19 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Agnus: Robust Entity Disambiguation using decoder-only LMs
ACL ARR 2025 July Submission872 Authors
29 Jul 2025 (modified: 03 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment
ACL ARR 2025 July Submission867 Authors
29 Jul 2025 (modified: 17 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Context-Aware Whisper for Arabic ASR Under Linguistic Varieties
ACL ARR 2025 July Submission863 Authors
29 Jul 2025 (modified: 19 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
GitChameleon 2.0: Evaluating AI Code Generation Against Python Library Version Incompatibilities
ACL ARR 2025 July Submission861 Authors
29 Jul 2025 (modified: 20 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Test-Time Scaling with Repeated Sampling Improves Multilingual Text Generation
ACL ARR 2025 July Submission858 Authors
29 Jul 2025 (modified: 19 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Beyond Accuracy: Alignment and Error Detection across Languages in the Bi-GSM8K Math-Teaching Benchmark
ACL ARR 2025 July Submission853 Authors
29 Jul 2025 (modified: 01 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Modeling Annotator Disagreement with Demographic-Aware Experts and Synthetic Perspectives
ACL ARR 2025 July Submission839 Authors
28 Jul 2025 (modified: 31 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Counterfactual Evaluation for Blind Attack Detection in LLM-based Evaluation Systems
ACL ARR 2025 July Submission838 Authors
28 Jul 2025 (modified: 26 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Syllable Tokenization Does Not Improve Phonological Awareness in Large Language Models
ACL ARR 2025 July Submission837 Authors
28 Jul 2025 (modified: 06 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Ready to Translate, Not to Represent? Bias and Performance Gaps in Multilingual LLMs Across Language Families and Domains
ACL ARR 2025 July Submission836 Authors
28 Jul 2025 (modified: 20 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
ReGATE: Learning Faster and Better with Fewer Tokens in MLLMs
ACL ARR 2025 July Submission835 Authors
28 Jul 2025 (modified: 01 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs
ACL ARR 2025 July Submission833 Authors
28 Jul 2025 (modified: 01 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Scaling Down, Powering Up: RLHF-Enhanced Small LLMs for Healthcare Misinformation Detection
ACL ARR 2025 July Submission831 Authors
28 Jul 2025 (modified: 07 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Seeing and Solving: An Interpreter-Solver Framework for Geometric Reasoning with Large Vision and Language Models
ACL ARR 2025 July Submission825 Authors
28 Jul 2025 (modified: 03 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Task Matters: Knowledge Requirements Shape LLM Responses to Context–Memory Conflict
ACL ARR 2025 July Submission819 Authors
28 Jul 2025 (modified: 02 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
LayerNorm vs RMSNorm: Geometric Perspective and a Case Against Mean Subtraction
ACL ARR 2025 July Submission818 Authors
28 Jul 2025 (modified: 30 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
RoD-TAL: A Benchmark for Answering Questions in Romanian Driving License Exams
ACL ARR 2025 July Submission817 Authors
28 Jul 2025 (modified: 20 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA
ACL ARR 2025 July Submission816 Authors
28 Jul 2025 (modified: 19 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
The Rarity Blind Spot: A Framework for Evaluating Statistical Reasoning in LLMs
ACL ARR 2025 July Submission815 Authors
28 Jul 2025 (modified: 27 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
What Would You Ask When You First Saw $a^2+b^2=c^2$? Evaluating LLM on Curiosity-Driven Question Generation
ACL ARR 2025 July Submission809 Authors
28 Jul 2025 (modified: 05 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Identifying the Achilles' Heel: An Iterative Method for Uncovering Factual Errors in Large Language Models
ACL ARR 2025 July Submission805 Authors
28 Jul 2025 (modified: 21 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
Atomic Calibration of LLMs in Long-Form Generations
ACL ARR 2025 July Submission801 Authors
28 Jul 2025 (modified: 22 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
BhashaSetu: Cross-Lingual Knowledge Transfer from High-Resource to Low-Resource Language
ACL ARR 2025 July Submission799 Authors
28 Jul 2025 (modified: 20 Aug 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks For LLMs
ACL ARR 2025 July Submission798 Authors
28 Jul 2025 (modified: 03 Sept 2025)
ACL ARR 2025 July Submission
Readers:
Everyone
«
‹
3
4
5
6
7
8
9
10
11
12
›
»