NASH: Numerically Aware Scoring Heuristic for Robust Semantic Similarity

Published: 01 Jul 2026, Last Modified: 24 Apr 2026ACL 2026 FindingsEveryoneCC BY 4.0
Abstract: Numerical precision is critical in financial NLP, yet embedding-based semantic similarity metrics exhibit numerical blindness---failing to distinguish contradictory values within similar contexts. We introduce NASH (Numerically Aware Scoring Hueristic), a model-agnostic metric that decouples numerical verification from textual semantic evaluation through a three-stage pipeline: (1) modal separation via numeric masking, (2) dual-channel similarity estimation through masked-text similarity and context-aware numeric alignment, and (3) IDF-weighted aggregation. NASH functions as a drop-in enhancement to existing embedding-based metrics. Validated on our proposed NumFinE financial numerical evaluation benchmark and established semantic similarity datasets (STS-B, Financial-STS), NASH achieves substantial improvements in numerical sensitivity (up to +159.6% on listwise ranking) while preserving general semantic performance, establishing a reliable standard for numeracy-aware evaluation.
Loading