Focused Evaluation for Image Description with Binary Forced-Choice TasksDownload PDF

2016 (modified: 16 Jul 2019)VL@ACL 2016Readers: Everyone
Abstract: Current evaluation metrics for image description may be too coarse. We therefore propose a series of binary forced-choice tasks that each focus on a different aspect of the captions. We evaluate a number of different off-the-shelf image description systems. Our results indicate strengths and shortcomings of both generation and ranking based approaches.
0 Replies

Loading