A Bayesian Model for Learning SCFGs with Discontiguous RulesDownload PDFOpen Website

2012 (modified: 10 Nov 2022)EMNLP-CoNLL 2012Readers: Everyone
Abstract: We describe a nonparametric model and corresponding inference algorithm for learning Synchronous Context Free Grammar derivations for parallel text. The model employs a Pitman-Yor Process prior which uses a novel base distribution over synchronous grammar rules. Through both synthetic grammar induction and statistical machine translation experiments, we show that our model learns complex translational correspondences--- including discontiguous, many-to-many alignments---and produces competitive translation results. Further, inference is efficient and we present results on significantly larger corpora than prior work.
0 Replies

Loading