Beta-CoRM: A Bayesian approach for n-gram profiles analysis

Published: 01 Jan 2025, Last Modified: 25 Sept 2025Comput. Stat. Data Anal. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We develop a feature allocation model for grouped data with binary attributes and demonstrate its use on n-gram data.•Show how the model can be estimated using a simple, exact Markov chain Monte Carlo method.•Introduce a post-hoc variable selection step which finds variable that maximally discriminate among groups.•The variable selection method leads to better out-of-sample classification accuracy in simulated and real data.
Loading