Abstract: Highlights•An approach for LM creation combining syntactical and statistical analysis of training texts.•A combined knowledge-based statistical phoneme set selection method for obtaining an optimal set for ASR.•Results of the experiments on Russian ASR with a large vocabulary over 200 K words.
Loading