Comparative ranking of marginal confounding impact of natural language processing-derived versus structured features in pharmacoepidemiology
Abstract: Highlights•Unstructured clinical notes contain potential confounder information.•Natural language processing can auto-generate features from notes.•We ranked the marginal confounding impact of features on estimated causal effects.•Prespecified knowledge and claims data dominated the top 25 ranked features.•39 % of the top 100 ranked features were natural language processing generated.
External IDs:dblp:journals/cbm/PlasekWWYDTNBLZ25
Loading