Validating the web-based evaluation of NLG systemsDownload PDFOpen Website

2009 (modified: 13 Nov 2022)ACL/IJCNLP (Short Papers) 2009Readers: Everyone
Abstract: The GIVE Challenge is a recent shared task in which NLG systems are evaluated over the Internet. In this paper, we validate this novel NLG evaluation methodology by comparing the Internet-based results with results we collected in a lab experiment. We find that the results delivered by both methods are consistent, but the Internet-based approach offers the statistical power necessary for more fine-grained evaluations and is cheaper to carry out.
0 Replies

Loading