Toggle navigation
OpenReview
.net
Login
×
Back to
NeurIPS
NeurIPS 2024 Workshop LanGame Submissions
Beyond Benchmarking: Automated Capability Discovery via Model Self-Exploration
Cong Lu
,
Shengran Hu
,
Jeff Clune
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
S2L-RM: Short-to-Long Reward Modeling
Changyu Chen
,
Zichen Liu
,
Haonan Wang
,
Chao Du
,
Tianyu Pang
,
Qian Liu
,
Arunesh Sinha
,
Pradeep Varakantham
,
Min Lin
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
AidanBench: Evaluating Novel Idea Generation on Open-Ended Questions
Aidan McLaughlin
,
Anuja Uppuluri
,
James Campbell
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
LlaMa meets Cheburashka: impact of cultural background for LLM quiz reasoning
Mikhail Lifar
,
Bogdan Protsenko
,
Daniil Kupriianenko
,
Nazar Chubkov
,
Kulaev Kirill Dmitrievich
,
Alexander Guda
,
Irina Piontkovskaya
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Sample-Efficient Alignment for LLMs
Zichen Liu
,
Changyu Chen
,
Chao Du
,
Wee Sun Lee
,
Min Lin
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
PIANIST: Learning Partially Observable World Models with LLMs for Multi-Agent Decision Making
Jonathan Light
,
Sixue Xing
,
Yuanzhe Liu
,
Weiqin Chen
,
Min Cai
,
Xiusi Chen
,
Guanzhi Wang
,
Wei Cheng
,
Yisong Yue
,
Ziniu Hu
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Efficacy of Language Model Self-Play in Non-Zero-Sum Games
Austen Liao
,
Nicholas Tomlin
,
Dan Klein
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Spotlight
Readers:
Everyone
GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents
Anthony Costarelli
,
Mat Allen
,
Roman Hauksson
,
Grace Sodunke
,
Suhas Hariharan
,
Carlson Cheng
,
Wenjie Li
,
Joshua M Clymer
,
Arjun Yadav
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Creativity Has Entered the Chat, With a Stranger: Novelty is a Nash Equilibrium
Kotaro Sakamoto
,
Shiro Takagi
,
Shuhei Ogawa
,
Yutaka Matsuo
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
On Reward Functions For Self-Improving Chain-of-Thought Reasoning Without Supervised Datasets (Abridged Version)
Thomas Foster
,
Eltayeb Ahmed
,
Jonathan Cook
,
Shalev Lifshitz
,
Tim Rocktäschel
,
Jakob Nicolaus Foerster
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Situated Instruction Following Under Ambiguous Human Intent
So Yeon Min
,
Xavier Puig
,
Devendra Singh Chaplot
,
Tsung-Yen Yang
,
Akshara Rai
,
Priyam Parashar
,
Russ Salakhutdinov
,
Yonatan Bisk
,
Roozbeh Mottaghi
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation
Quanting Xie
,
So Yeon Min
,
Tianyi Zhang
,
Kedi Xu
,
Aarav Bajaj
,
Russ Salakhutdinov
,
Matthew Johnson-Roberson
,
Yonatan Bisk
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Embodied LLM Agents Learn to Cooperate in Organized Teams
Xudong Guo
,
Kaixuan Huang
,
Jiale Liu
,
Wenhui Fan
,
Natalia Vélez
,
Qingyun Wu
,
Huazheng Wang
,
Thomas L. Griffiths
,
Mengdi Wang
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
OnThePlanning Abilities of OpenAI’s o1 Models: Feasibility, Optimality, and Generalizability
Kevin Wang
,
Junbo Li
,
Neel P. Bhatt
,
Yihan Xi
,
qiang liu
,
ufuk topcu
,
Zhangyang Wang
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Mimicking Human Emotions: Persona-Driven Behavior of LLMs in the ‘Buy and Sell’ Negotiation Game
mingyu jeon
,
Jae Young Suh
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
Jonathan Cook
,
Tim Rocktäschel
,
Jakob Nicolaus Foerster
,
Dennis Aumiller
,
Alex Wang
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
What Makes Your Model a Low-empathy or Warmth Person: Exploring the Oringins of Personality in LLMs
Shu Yang
,
Shenzhe Zhu
,
Liang Liu
,
Mengdi Li
,
Lijie Hu
,
Di Wang
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Games as Ontology Engines: AI and LLMs Invoke Spatiotemporal and Metaphysical Realities in Virtual Worlds
Jasmine Roberts
,
Andrzej Banburski
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Stutter Makes Smarter: Learning Self-Improvement for Large Language Models
Pei-Chen Ho
,
Meng-Hsi Chen
,
Alberto Bernacchia
,
Philipp Ennen
,
Yen-Chen Wu
,
Da-shan Shiu
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Evolving Alignment via Asymmetric Self-Play
Ziyu Ye
,
Rishabh Agarwal
,
Tianqi Liu
,
Rishabh Joshi
,
Sarmishta Velury
,
Quoc V Le
,
Qijun Tan
,
Yuan Liu
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Spotlight
Readers:
Everyone
Sharing Minds during MARL Training for Enhanced Cooperative LLM Agents
Jiaxuan Gao
,
Yule Wen
,
Chao Yu
,
Yi Wu
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?
Wenzhe Li
,
Yong Lin
,
Mengzhou Xia
,
Chi Jin
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
PokéChamp: an Expert-level Minimax Language Agent for Competitive Pokémon
Seth Karten
,
Andy Luu Nguyen
,
Chi Jin
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
Communication via Shared Memory Improves Multi-agent Pathfinding
Alsu Sagirova
,
Yuri Kuratov
,
Mikhail Burtsev
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
Chen Yang
,
Chenyang Zhao
,
Quanquan Gu
,
Dongruo Zhou
Published: 30 Oct 2024, Last Modified: 13 Dec 2024
LanGame Poster
Readers:
Everyone
«
‹
1
2
›
»