OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Qichao Zhang
Associate Professor, Institute of automation, Institute of automation, Chinese academy of science, Chinese Academy of Sciences
Joined
June 2021
Names
Qichao Zhang
(Preferred)
,
qichao Zhang
Emails
****@ia.ac.cn
(Confirmed)
Personal Links
Google Scholar
ORCID
Career & Education History
Associate Professor
Institute of automation,
Institute of automation, Chinese academy of science, Chinese Academy of Sciences
(ia.ac.cn)
2019
–
Present
Assistant Professor
Institute of automation,
Institute of automation, Chinese academy of science, Chinese Academy of Sciences
(ia.ac.cn)
2017
–
2019
PhD student
Institute of automation,
Institute of automation, Chinese academy of science, Chinese Academy of Sciences
(ia.ac.cn)
2014
–
2017
MS student
Control theory and control engineering,
Northeastern University
(neu.edu)
2012
–
2014
Advisors, Relations & Conflicts
Coworker
Yuanheng Zhu
2014
–
2024
PhD Advisor
Dongbin Zhao
2014
–
2017
Expertise
deep reinforcement learning
Present
Autonomous driving
2016
–
2021
decision-making and control
2014
–
2021
Offline Reinforcement learning
2014
–
2021
Publications
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
Yuqian Fu
,
Tinghong Chen
,
Jiajun Chai
,
Xihuai Wang
,
Songjun Tu
,
Guojun Yin
,
Wei Lin
,
Qichao Zhang
,
Yuanheng Zhu
,
Dongbin Zhao
ICLR 2026 Poster
Readers:
Everyone
Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL
Songjun Tu
,
Jiahao Lin
,
Qichao Zhang
,
Xiangyu Tian
,
Linjing Li
,
Xiangyuan Lan
,
Dongbin Zhao
NeurIPS 2025 poster
Readers:
Everyone
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving
Xueyi Liu
,
Zuodong Zhong
,
Qichao Zhang
,
Yuxin Guo
,
Yupeng Zheng
,
Junli Wang
,
Dongbin Zhao
,
Yun-Fu Liu
,
Zhiguo Su
,
Yinfeng Gao
,
Qiao Lin
,
Chen Huiyong
CoRL 2025 Poster
Readers:
Everyone
Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation
Songjun Tu
,
Jiahao Lin
,
Xiangyu Tian
,
Qichao Zhang
,
Linjing Li
,
Yuqian Fu
,
Nan Xu
,
Wei He
,
Xiangyuan Lan
,
Dongmei Jiang
,
Dongbin Zhao
COLM 2025
Readers:
Everyone
Deep-Reinforcement-Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks
Yuqi Liu
,
Qichao Zhang
,
Yinfeng Gao
,
Dongbin Zhao
IEEE Trans. Cogn. Dev. Syst. 2024
Readers:
Everyone
Unsupervised Zero-Shot Reinforcement Learning via Dual-Value Forward-Backward Representation
Jingbo Sun
,
Songjun Tu
,
qichao Zhang
,
Haoran Li
,
Xin Liu
,
Yaran Chen
,
Ke Chen
,
Dongbin Zhao
ICLR 2025 Poster
Readers:
Everyone
Prototypical Context-Aware Dynamics for Generalization in Visual Control With Model-Based Reinforcement Learning
Junjie Wang
,
Qichao Zhang
,
Yao Mu
,
Dong Li
,
Dongbin Zhao
,
Yuzheng Zhuang
,
Ping Luo
,
Bin Wang
,
Jianye Hao
IEEE Trans. Ind. Informatics 2024
Readers:
Everyone
Dynamic-Horizon Model-Based Value Estimation With Latent Imagination
Junjie Wang
,
Qichao Zhang
,
Dongbin Zhao
IEEE Trans. Neural Networks Learn. Syst. 2024
Readers:
Everyone
High-quality Synthetic Data is Efficient for Model-based Offline Reinforcement Learning
Qichao Zhang
,
Xing Fang
,
Kaixuan Xu
,
Weixin Zhao
,
Haoran Li
,
Dongbin Zhao
IJCNN 2024
Readers:
Everyone
Planning-Inspired Hierarchical Trajectory Prediction via Lateral-Longitudinal Decomposition for Autonomous Driving
Ding Li
,
Qichao Zhang
,
Zhongpu Xia
,
Yupeng Zheng
,
Kuan Zhang
,
Menglong Yi
,
Wenda Jin
,
Dongbin Zhao
IEEE Trans. Intell. Veh. 2024
Readers:
Everyone
View all 35 publications
Co-Authors
Bin Wang
Bo Zhao
Bu Jin
Chaoxu Mu
Chen Huiyong
Chengliang Zhong
Dawei Ding
Ding Li
Donald C. Wunsch
Dong Li
Dongbin Zhao
Dongmei Jiang
Guogang Liao
Guojun Yin
Guyue Zhou
Hao Zhao
Haoran Li
Huan-ang Gao
Hufei Zhu
Jiahao Lin
Jiajun Chai
Jianye HAO
Jianye Hao
Jingbo Sun
Junjie Wang
View all 81 co-authors