The Tournesol dataset: Which videos should be more largely recommended?

10 May 2025 (modified: 30 Oct 2025)Submitted to NeurIPS 2025 Datasets and Benchmarks TrackEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Recommendation, Ethics, Preferences, Human
TL;DR: We publish a dataset of human judgments on which videos should be more largely recommended by algorithms.
Abstract: This paper introduces the Tournesol public dataset, which was collected as part of the online deployed platform https://tournesol.app. Our dataset contains a list of 1,116,318 comparative judgments of YouTube videos by 8,804 users of the Tournesol platform. 263,668 of these judgments were about which video should be more largely recommended, while the remaining evaluate secondary criteria like content reliability, topic importance and layman-friendliness. The dataset also exports information about users’ pretrust statuses and vouches. It is published at https://api.tournesol.app/exports/all under ODC-By license. The data is currently used by Tournesol to make community-driven video content recommendations to over 6,000 users.
Croissant File: json
Dataset URL: https://api.tournesol.app/exports/all
Supplementary Material: zip
Primary Area: AL/ML Datasets & Benchmarks for social sciences (e.g. climate, health, life sciences, physics, social sciences)
Submission Number: 1405
Loading