Preprocessing of natural language process variables using a data-driven method improves the association with suicide risk in a large veterans affairs population

Published: 2025, Last Modified: 08 May 2026Comput. Biol. Medicine 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•AMC preprocessing improves associations between NLP variables and suicide risk.•Over 90 % of AMC-processed NLP variables are significantly associated with suicide.•AMC outperforms quantile categorization in whole and undersampled cohorts.•AMC refines risk modeling for suicide prevention in clinical settings.•AMC may enhance NLP-based suicide risk prediction in Veterans Affairs EHR notes.
Loading