Find the Cliffhanger: Multi-modal Trailerness in Soap Operas

Published: 01 Jan 2024, Last Modified: 20 Jun 2024MMM (2) 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Creating a trailer requires carefully picking out and piecing together brief enticing moments out of a longer video, making it a challenging and time-consuming task. This requires selecting moments based on both visual and dialogue information. We introduce a multi-modal method for predicting the trailerness to assist editors in selecting trailer-worthy moments from long-form videos. We present results on a newly introduced soap opera dataset, demonstrating that predicting trailerness is a challenging task that benefits from multi-modal information. Code is available at https://github.com/carlobretti/cliffhanger.
Loading