TV100: a TV series dataset that pre-trained CLIP has not seen

Published: 01 Jan 2024, Last Modified: 13 Nov 2024Frontiers Comput. Sci. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: The era of pre-trained models has ushered in a wealth of new insights for the machine learning community. Among the myriad of questions that arise, one of paramount importance is: ‘Do pre-trained models possess comprehensive knowledge?’ This paper seeks to address this crucial inquiry. In line with our objective, we have made publicly available a novel dataset comprised of images from TV series released post-2021. This dataset holds significant potential for use in various research areas, including the evaluation of novel class iscovery and long-tailed learning, among others.
Loading