OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Shuang Wu
Researcher, Tencent AI Lab
Joined
September 2017
Names
Shuang Wu
(Preferred)
,
shuang wu
Emails
****@mails.tsinghua.edu.cn
(Confirmed)
,
****@tencent.com
(Confirmed)
,
****@bytedance.com
(Confirmed)
Personal Links
Google Scholar
ORCID
Career & Education History
Researcher
Tencent AI Lab
(tencent.com)
2020
–
Present
PhD student
Tsinghua University, Tsinghua University
(tsinghua.edu.cn)
2015
–
2020
Undergrad student
Huazhong University of Science and Technology, Tsinghua University
(hust.edu.cn)
2011
–
2015
Advisors, Relations & Conflicts
PhD Advisor
Luping Shi
2015
–
2020
Expertise
reinforcement learning
2018
–
Present
deep learning
2015
–
Present
neuromorphic computing
2015
–
Present
Publications
Greedy when Sure and Conservative when Uncertain about the Opponents
Haobo Fu
,
Ye Tian
,
Hongxiang Yu
,
Weiming Liu
,
Shuang Wu
,
Jiechao Xiong
,
Ying Wen
,
Kai Li
,
Junliang Xing
,
QIANG FU
,
Yang Wei
ICML 2022 Spotlights
Readers:
Everyone
Enhance Reasoning for Large Language Models with Reinforcement Learning in the Game Werewolf
Shuang Wu
,
Liwen Zhu
,
Tao Yang
,
Shiweixu
,
QIANG FU
,
Yang Wei
,
Haobo Fu
Submitted to ICLR 2025
Readers:
Everyone
PreCo: Enhancing Generalization in Co-Design of Modular Soft Robots via Brain-Body Pre-Training
Yuxing Wang
,
Shuang Wu
,
Tiantian Zhang
,
Yongzhe Chang
,
Haobo Fu
,
QIANG FU
,
Xueqian Wang
Published: 30 Aug 2023, Last Modified: 17 Oct 2023
CoRL 2023 Oral
Readers:
Everyone
Curriculum-based Co-design of Morphology and Control of Voxel-based Soft Robots
Yuxing Wang
,
Shuang Wu
,
Haobo Fu
,
QIANG FU
,
Tiantian Zhang
,
Yongzhe Chang
,
Xueqian Wang
Published: 01 Feb 2023, Last Modified: 02 Mar 2023
ICLR 2023 poster
Readers:
Everyone
Quality-Similar Diversity via Population Based Reinforcement Learning
Shuang Wu
,
Jian Yao
,
Haobo Fu
,
Ye Tian
,
Chao Qian
,
Yaodong Yang
,
QIANG FU
,
Yang Wei
Published: 01 Feb 2023, Last Modified: 01 Mar 2023
ICLR 2023 poster
Readers:
Everyone
Speedup Training Artificial Intelligence for Mahjong via Reward Variance Reduction
Jinqiu Li
,
Shuang Wu
,
Haobo Fu
,
Qiang Fu
,
Enmin Zhao
,
Junliang Xing
2022 (modified: 24 Apr 2023)
CoG 2022
Readers:
Everyone
Greedy when Sure and Conservative when Uncertain about the Opponents
Haobo Fu
,
Ye Tian
,
Hongxiang Yu
,
Weiming Liu
,
Shuang Wu
,
Jiechao Xiong
,
Ying Wen
,
Kai Li
,
Junliang Xing
,
Qiang Fu
,
Wei Yang
2022 (modified: 24 Apr 2023)
ICML 2022
Readers:
Everyone
Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game
Haobo Fu
,
Weiming Liu
,
Shuang Wu
,
Yijia Wang
,
Tao Yang
,
Kai Li
,
Junliang Xing
,
Bin Li
,
Bo Ma
,
Qiang Fu
,
Wei Yang
2022 (modified: 24 Apr 2023)
ICLR 2022
Readers:
Everyone
Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game
Haobo Fu
,
Weiming Liu
,
Shuang Wu
,
Yijia Wang
,
Tao Yang
,
Kai Li
,
Junliang Xing
,
Bin Li
,
Bo Ma
,
QIANG FU
,
Yang Wei
Published: 28 Jan 2022, Last Modified: 13 Feb 2023
ICLR 2022 Poster
Readers:
Everyone
Hybrid neural state machine for neural network
Lei Tian
,
Zhenzhi Wu
,
Shuang Wu
,
Luping Shi
Published: 01 Jan 2021, Last Modified: 09 May 2023
Sci. China Inf. Sci. 2021
Readers:
Everyone
View all 19 publications
Co-Authors
Bin Li
Bo Ma
Chao Qian
Dong Wu
Enmin Zhao
Feng Chen
Guanrui Wang
Guoqi Li
Haobo Fu
Hongxiang Yu
Jian Yao
Jiechao Xiong
Jinqiu Li
Junliang Xing
Kai Li
Lei Deng
Lei Tian
Liu Liu
Liwen Zhu
Luping Shi
Pei Tang
QIANG FU
Qiang Fu
Shiweixu
Tao Yang
View all 42 co-authors