Toggle navigation
OpenReview
.net
Login
×
Back to
NeurIPS
NeurIPS 2023 Track Datasets and Benchmarks Submissions
Evaluating Open-QA Evaluation
Cunxiang Wang
,
Sirui Cheng
,
Qipeng Guo
,
Yuanhao Yue
,
Bowen Ding
,
Zhikun Xu
,
Yidong Wang
,
Xiangkun Hu
,
Zheng Zhang
,
Yue Zhang
Published: 26 Sept 2023, Last Modified: 15 Jan 2024
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Estimating Generic 3D Room Structures from 2D Annotations
Denys Rozumnyi
,
Stefan Popov
,
Kevis-kokitsi Maninis
,
Matthias Nießner
,
Vittorio Ferrari
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Visual Abductive Reasoning Meets Driving Hazard Prediction: Problem Formulation and Dataset
Korawat Charoenpitaks
,
Van-Quang Nguyen
,
Masanori Suganuma
,
Masahiro Takahashi
,
Ryoma Niihara
,
Takayuki Okatani
31 May 2023 (modified: 12 Dec 2023)
Submitted to NeurIPS 2023 Datasets and Benchmarks
Readers:
Everyone
Symmetry-Informed Geometric Representation for Molecules, Proteins, and Crystalline Materials
Shengchao Liu
,
weitao Du
,
Yanjing Li
,
Zhuoxinran Li
,
Zhiling Zheng
,
Chenru Duan
,
Zhi-Ming Ma
,
Omar M. Yaghi
,
Anima Anandkumar
,
Christian Borgs
,
Jennifer T Chayes
,
Hongyu Guo
,
Jian Tang
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering
Jingying Gao
,
Qi Wu
,
Alan Blair
,
Maurice Pagnucco
Published: 26 Sept 2023, Last Modified: 05 Jan 2024
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
ADGym: Design Choices for Deep Anomaly Detection
Minqi Jiang
,
Chaochuan Hou
,
Ao Zheng
,
Songqiao Han
,
Hailiang Huang
,
Qingsong Wen
,
Xiyang Hu
,
Yue Zhao
Published: 26 Sept 2023, Last Modified: 06 Jan 2024
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Digital Typhoon: Long-term Satellite Image Dataset for the Spatio-Temporal Modeling of Tropical Cyclones
Asanobu Kitamoto
,
Jared Hwang
,
Bastien Vuillod
,
Lucas Gautier
,
Yingtao Tian
,
Tarin Clanuwat
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Spotlight
Readers:
Everyone
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Yazhe Niu
,
Yuan Pu
,
Zhenjie Yang
,
Xueyan Li
,
Tong Zhou
,
Jiyuan Ren
,
Shuai Hu
,
Hongsheng Li
,
Yu Liu
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Spotlight
Readers:
Everyone
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data Only
Guilherme Penedo
,
Quentin Malartic
,
Daniel Hesslow
,
Ruxandra Cojocaru
,
Hamza Alobeidli
,
Alessandro Cappelli
,
Baptiste Pannier
,
Ebtesam Almazrouei
,
Julien Launay
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
Siobhan Mackenzie Hall
,
Fernanda Gonçalves Abrantes
,
Hanwen Zhu
,
Grace Sodunke
,
Aleksandar Shtedritski
,
Hannah Rose Kirk
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web
NeurIPS 2023 Track Datasets and Benchmarks Submission468 Authors
Published: 26 Sept 2023, Last Modified: 02 Feb 2024
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
SiT Dataset: Socially Interactive Pedestrian Trajectory Dataset for Social Navigation Robots
Jongwook Bae
,
Jungho Kim
,
Junyong Yun
,
Changwon Kang
,
Jeongseon Choi
,
Chanhyeok Kim
,
Junho Lee
,
Jungwook Choi
,
Jun Won Choi
Published: 26 Sept 2023, Last Modified: 16 Jan 2024
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
LithoBench: Benchmarking AI Computational Lithography for Semiconductor Manufacturing
Su Zheng
,
Haoyu Yang
,
Binwu Zhu
,
Bei Yu
,
Martin D.F. Wong
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Lo-Hi: Practical ML Drug Discovery Benchmark
Simon Steshin
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Evaluating Self-Supervised Learning for Molecular Graph Embeddings
Hanchen Wang
,
Jean Kaddour
,
Shengchao Liu
,
Jian Tang
,
Joan Lasenby
,
Qi Liu
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset
Jiaming Ji
,
Mickel Liu
,
Juntao Dai
,
Xuehai Pan
,
Chi Zhang
,
Ce Bian
,
Boyuan Chen
,
Ruiyang Sun
,
Yizhou Wang
,
Yaodong Yang
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Revealing the unseen: Benchmarking video action recognition under occlusion
Shresth Grover
,
Vibhav Vineet
,
Yogesh S Rawat
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
CMMA: Benchmarking Multi-Affection Detection in Chinese Multi-Modal Conversations
Yazhou Zhang
,
Yang Yu
,
Qing Guo
,
Benyou Wang
,
Dongming Zhao
,
Sagar Uprety
,
Dawei Song
,
Qiuchi Li
,
Jing Qin
Published: 26 Sept 2023, Last Modified: 30 Dec 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
Jinyang Li
,
Binyuan Hui
,
GE QU
,
Jiaxi Yang
,
Binhua Li
,
Bowen Li
,
Bailin Wang
,
Bowen Qin
,
Ruiying Geng
,
Nan Huo
,
Xuanhe Zhou
,
Chenhao Ma
,
Guoliang Li
,
Kevin Chang
,
Fei Huang
,
Reynold Cheng
,
Yongbin Li
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Spotlight
Readers:
Everyone
Understanding Social Reasoning in Language Models with Language Models
Kanishk Gandhi
,
Jan-Philipp Fränken
,
Tobias Gerstenberg
,
Noah Goodman
Published: 26 Sept 2023, Last Modified: 14 Jan 2024
NeurIPS 2023 Datasets and Benchmarks Spotlight
Readers:
Everyone
Rectifying Open-Set Object Detection: Proper Evaluation and a Taxonomy
Yusuke Hosoya
,
Masanori Suganuma
,
Takayuki Okatani
31 May 2023 (modified: 12 Dec 2023)
Submitted to NeurIPS 2023 Datasets and Benchmarks
Readers:
Everyone
Tartarus: A Benchmarking Platform for Realistic And Practical Inverse Molecular Design
AkshatKumar Nigam
,
Robert Pollice
,
Gary Tom
,
Kjell Jorner
,
John Willes
,
Luca Thiede
,
Anshul Kundaje
,
Alan Aspuru-Guzik
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Does Continual Learning Meet Compositionality? New Benchmarks and An Evaluation Framework
Weiduo Liao
,
Ying Wei
,
Mingchen Jiang
,
Qingfu Zhang
,
Hisao Ishibuchi
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Peng Xu
,
Wenqi Shao
,
Kaipeng Zhang
,
Peng Gao
,
Shuo Liu
,
Fanqing Meng
,
Siyuan Huang
,
Meng Lei
,
Ping Luo
,
Yu Qiao
31 May 2023 (modified: 12 Dec 2023)
Submitted to NeurIPS 2023 Datasets and Benchmarks
Readers:
Everyone
Hyper-Skin: A Hyperspectral Dataset for Reconstructing Facial Skin-Spectra from RGB Images
Pai Chet Ng
,
Zhixiang Chi
,
Yannick Verdie
,
Juwei Lu
,
Konstantinos N Plataniotis
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
«
‹
3
4
5
6
7
8
9
10
11
12
›
»