Toggle navigation
OpenReview
.net
Login
×
Back to
NeurIPS
NeurIPS 2023 Track Datasets and Benchmarks Submissions
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time
David Mayo
,
Jesse Cummings
,
Xinyu Lin
,
Dan Gutfreund
,
Boris Katz
,
Andrei Barbu
Published: 26 Sept 2023, Last Modified: 16 Jan 2024
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models
Wenxuan Zhang
,
Mahani Aljunied
,
Chang Gao
,
Yew Ken Chia
,
Lidong Bing
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
RaLEs: a Benchmark for Radiology Language Evaluations
Juan Manuel Zambrano Chaves
,
Nandita Bhaskhar
,
Maayane Attias
,
Jean-Benoit Delbrouck
,
Daniel Rubin
,
Andreas Markus Loening
,
Curtis Langlotz
,
Akshay S Chaudhari
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
M5HisDoc: A Large-scale Multi-style Chinese Historical Document Analysis Benchmark
Yongxin Shi
,
Chongyu Liu
,
Dezhi Peng
,
Cheng Jian
,
Jiarong Huang
,
Lianwen Jin
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning
Valeriia Cherepanova
,
Roman Levin
,
Gowthami Somepalli
,
Jonas Geiping
,
C. Bayan Bruss
,
Andrew Gordon Wilson
,
Tom Goldstein
,
Micah Goldblum
Published: 26 Sept 2023, Last Modified: 15 Jan 2024
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
LogicBench: A Benchmark for Evaluation of Logical Reasoning
Mihir Parmar
,
Neeraj Varshney
,
Nisarg Patel
,
Santosh Mashetty
,
Man Luo
,
Arindam Mitra
,
Chitta Baral
01 Jun 2023 (modified: 12 Dec 2023)
Submitted to NeurIPS 2023 Datasets and Benchmarks
Readers:
Everyone
LOVM: Language-Only Vision Model Selection
Orr Zohar
,
Shih-Cheng Huang
,
Kuan-Chieh Wang
,
Serena Yeung
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
Phitchaya Mangpo Phothilimthana
,
Sami Abu-El-Haija
,
Kaidi Cao
,
Bahare Fatemi
,
Michael Burrows
,
Charith Mendis
,
Bryan Perozzi
Published: 26 Sept 2023, Last Modified: 27 Dec 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Holistic Evaluation of Text-to-Image Models
Tony Lee
,
Michihiro Yasunaga
,
Chenlin Meng
,
Yifan Mai
,
Joon Sung Park
,
Agrim Gupta
,
Yunzhi Zhang
,
Deepak Narayanan
,
Hannah Benita Teufel
,
Marco Bellagente
,
Minguk Kang
,
Taesung Park
,
Jure Leskovec
,
Jun-Yan Zhu
,
Li Fei-Fei
,
Jiajun Wu
,
Stefano Ermon
,
Percy Liang
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Spotlight
Readers:
Everyone
ToolQA: A Dataset for LLM Question Answering with External Tools
Yuchen Zhuang
,
Yue Yu
,
Kuan Wang
,
Haotian Sun
,
Chao Zhang
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
ChessGPT: Bridging Policy Learning and Language Modeling
Xidong Feng
,
Yicheng Luo
,
Ziyan Wang
,
Hongrui Tang
,
Mengyue Yang
,
Kun Shao
,
David Henry Mguni
,
Yali Du
,
Jun Wang
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Live Graph Lab: Towards Open, Dynamic and Real Transaction Graphs with NFT
Zhen Zhang
,
Bingqiao Luo
,
Shengliang Lu
,
Bingsheng He
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
BubbleML: A Multiphase Multiphysics Dataset and Benchmarks for Machine Learning
Sheikh Md Shakeel Hassan
,
Arthur Feeney
,
Akash Dhruv
,
Jihoon Kim
,
Youngjoon Suh
,
Jaiyoung Ryu
,
Yoonjin Won
,
Aparna Chandramowlishwaran
Published: 26 Sept 2023, Last Modified: 15 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Spotlight
Readers:
Everyone
Datasets and Benchmarks for Nanophotonic Structure and Parametric Design Simulations
Jungtaek Kim
,
Mingxuan Li
,
Oliver Hinder
,
Paul Leu
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Yuanxin Liu
,
Lei Li
,
Shuhuai Ren
,
Rundong Gao
,
Shicheng Li
,
Sishuo Chen
,
Xu Sun
,
Lu Hou
Published: 26 Sept 2023, Last Modified: 27 Dec 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks
Micah Goldblum
,
Hossein Souri
,
Renkun Ni
,
Manli Shu
,
Viraj Uday Prabhu
,
Gowthami Somepalli
,
Prithvijit Chattopadhyay
,
Mark Ibrahim
,
Adrien Bardes
,
Judy Hoffman
,
Rama Chellappa
,
Andrew Gordon Wilson
,
Tom Goldstein
Published: 26 Sept 2023, Last Modified: 27 Dec 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
INSPECT: A Multimodal Dataset for Pulmonary Embolism Diagnosis and Prognosis
Shih-Cheng Huang
,
Zepeng Huo
,
Ethan Steinberg
,
Chia-Chun Chiang
,
Matthew P. Lungren
,
Curtis Langlotz
,
Serena Yeung
,
Nigam Shah
,
Jason Alan Fries
Published: 26 Sept 2023, Last Modified: 13 Jan 2024
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Scientific Document Retrieval using Multi-level Aspect-based Queries
Jianyou Wang
,
Kaicheng Wang
,
Xiaoyue Wang
,
Prudhviraj Naidu
,
Leon Bergen
,
Ramamohan Paturi
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
DataPerf: Benchmarks for Data-Centric AI Development
Mark Mazumder
,
Colby Banbury
,
Xiaozhe Yao
,
Bojan Karlaš
,
William A Gaviria Rojas
,
Sudnya Diamos
,
Greg Diamos
,
Lynn He
,
Alicia Parrish
,
Hannah Rose Kirk
,
Jessica Quaye
,
Charvi Rastogi
,
Douwe Kiela
,
David Jurado
,
David Kanter
,
Rafael Mosquera
,
Will Cukierski
,
Juan Ciro
,
Lora Aroyo
,
Bilge Acun
et al. (27 additional authors not shown)
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
On the Need for a Language Describing Distribution Shifts: Illustrations on Tabular Datasets
Jiashuo Liu
,
Tianyu Wang
,
Peng Cui
,
Hongseok Namkoong
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
AVIDa-hIL6: A Large-Scale VHH Dataset Produced from an Immunized Alpaca for Predicting Antigen-Antibody Interactions
Hirofumi Tsuruta
,
Hiroyuki Yamazaki
,
Ryota Maeda
,
Ryotaro Tamura
,
Jennifer N. Wei
,
Zelda E Mariet
,
Poomarin Phloyphisut
,
Hidetoshi Shimokawa
,
Joseph R. Ledsam
,
Lucy J Colwell
,
Akihiro Imura
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Improving multimodal datasets with image captioning
Thao Nguyen
,
Samir Yitzhak Gadre
,
Gabriel Ilharco
,
Sewoong Oh
,
Ludwig Schmidt
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
Evaluating Graph Neural Networks for Link Prediction: Current Pitfalls and New Benchmarking
Juanhui Li
,
Harry Shomer
,
Haitao Mao
,
Shenglai Zeng
,
Yao Ma
,
Neil Shah
,
Jiliang Tang
,
Dawei Yin
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
GeoDE: a Geographically Diverse Evaluation Dataset for Object Recognition
Vikram V. Ramaswamy
,
Sing Yu Lin
,
Dora Zhao
,
Aaron Bryan Adcock
,
Laurens van der Maaten
,
Deepti Ghadiyaram
,
Olga Russakovsky
Published: 26 Sept 2023, Last Modified: 02 Nov 2023
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
OpenAGI: When LLM Meets Domain Experts
NeurIPS 2023 Track Datasets and Benchmarks Submission625 Authors
Published: 26 Sept 2023, Last Modified: 02 Feb 2024
NeurIPS 2023 Datasets and Benchmarks Poster
Readers:
Everyone
«
‹
1
2
3
4
5
6
7
8
9
10
›
»