Rethinking Input Methods As Back-Transliteration and the Quest for Linguistic Equity

ACL ARR 2026 January Submission10029 Authors

06 Jan 2026 (modified: 20 Mar 2026)ACL ARR 2026 January SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Keywords: input methods, language bias, large language model, back-transliteration
Abstract: Back-transliteration serves a dual purpose: processing informal Romanized text and powering phonetic Input Methods (IMEs). This survey bridges the fragmented research landscape by introducing a "maturity spectrum" that contrasts the sophisticated engines of CJK languages with the foundational models of Arabic and Indic scripts. We systematically analyze the convergent technological evolution—from statistical models to Large Language Models (LLMs). We demonstrate that despite the rise of LLMs, specialized back-transliteration remains essential for latency-sensitive input. The paper concludes with a technical roadmap for building the next generation of universal input systems.
Paper Type: Long
Research Area: Ethics, Bias, and Fairness
Research Area Keywords: Ethics, Bias, and Fairness
Contribution Types: Surveys
Languages Studied: English
Submission Number: 10029
Loading