Toggle navigation
OpenReview
.net
Login
×
Back to
ACL
ACL ARR 2024 December Submissions
Entering Real Social World! Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective
ACL ARR 2024 December Submission2230 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Evaluating the Long-Term Memory of Large Language Models
ACL ARR 2024 December Submission2229 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
NDP: Next Distribution Prediction as a More Broad Target
ACL ARR 2024 December Submission2228 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check
ACL ARR 2024 December Submission2226 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Benchmarking Multimodal Idiomaticity: Tasks and Methods for Idiomatic Language Understanding in Text and Images
ACL ARR 2024 December Submission2220 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Spatial Layouts in News Homepages Capture Human Preferences
ACL ARR 2024 December Submission2219 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Automating Legal Concept Interpretation with LLMs: Retrieval, Generation, and Evaluation
ACL ARR 2024 December Submission2211 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Self-Resolve: Resolving Consistent Reasoning Structures for Fact Verification
ACL ARR 2024 December Submission2194 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Structure-aware Domain Knowledge Injection for Large Language Models
ACL ARR 2024 December Submission2191 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Tracing and Dissecting How LLMs Recall Factual Knowledge for Real World Questions
ACL ARR 2024 December Submission2187 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Verify with Caution: The Pitfalls of Relying on Imperfect Factuality Metrics
ACL ARR 2024 December Submission2183 Authors
16 Dec 2024 (modified: 22 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
A Novel Multi-Document Retrieval Benchmark: Journalist Source-Selection in Newswriting
ACL ARR 2024 December Submission2176 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Activating and Probing: Deep Detection of Jailbreaking Prompts in Large Language Models
ACL ARR 2024 December Submission2171 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Unconstrained Model Fusion for Enhanced LLM Reasoning
ACL ARR 2024 December Submission2162 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Linguini: A benchmark for language-agnostic linguistic reasoning
ACL ARR 2024 December Submission2154 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance
ACL ARR 2024 December Submission2150 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language Models
ACL ARR 2024 December Submission2145 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization
ACL ARR 2024 December Submission2144 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
SAGE: A Generic Framework for LLM Safety Evaluation
ACL ARR 2024 December Submission2143 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
NewsInterview: a Dataset and a Playground to Evaluate LLMs' Grounding Gap via Informational Interviews
ACL ARR 2024 December Submission2142 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
From Induction to Deduction: Hierarchical Rule Learning for LLM Reasoning Tasks
ACL ARR 2024 December Submission2132 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems
ACL ARR 2024 December Submission2129 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
ACL ARR 2024 December Submission2128 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Making Sense of Korean Sentences: A Comprehensive Evaluation of LLMs through KoSEnd Dataset
ACL ARR 2024 December Submission2127 Authors
16 Dec 2024 (modified: 05 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
Waste-Bench: A Comprehensive Benchmark for Evaluating VLLMs in Cluttered Environments
ACL ARR 2024 December Submission2125 Authors
16 Dec 2024 (modified: 15 Feb 2025)
ACL ARR 2024 December Submission
Readers:
Everyone
«
‹
1
2
3
4
5
6
7
8
9
10
›
»