ReaGeo: Reasoning-Enhanced End-to-End Geocoding with LLMs

ReaGeo: Reasoning-Enhanced End-to-End Geocoding with LLMs

ACL ARR 2026 January Submission9929 Authors

06 Jan 2026 (modified: 20 Mar 2026)ACL ARR 2026 January SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Geocoding, End-to-End, Large Language Models, Reinforcement Learning, Chain-of-Thought

Abstract: This paper proposes ReaGeo, an end-to-end geocoding framework based on large language models, designed to overcome the limitations of traditional multi-stage approaches that rely on text or vector similarity retrieval over geographic databases, including workflow complexity, error propagation, and heavy dependence on structured geographic knowledge bases. The method converts geographic coordinates into geohash sequences, reformulating the coordinate prediction task as a text generation problem, and introduces a Chain-of-Thought mechanism to enhance the model’s reasoning over spatial relationships. Furthermore, reinforcement learning with a distance-deviation-based reward is applied to optimize the generation accuracy. Comprehensive experiments show that ReaGeo can accurately handle explicit address queries in single-point predictions and effectively resolve vague relative location queries. In addition, the model demonstrates strong predictive capability for non-point geometric regions, highlighting its versatility and generalization ability in geocoding tasks.

Paper Type: Long

Research Area: Language Models

Research Area Keywords: Applications, Chain-of-Thought, Fine-Tuning, Prompting

Languages Studied: English

Submission Number: 9929

Loading