OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Zhibo Yang
Researcher, Alibaba Group
Joined
February 2020
Names
Zhibo Yang
(Preferred)
,
ZhiBo Yang
Emails
****@gmail.com
(Confirmed)
Personal Links
Homepage
Google Scholar
DBLP
ORCID
Career & Education History
Researcher
Alibaba Group
(alibaba-inc.com)
2014
–
Present
MS student
Tsinghua University
(tsinghua.edu.cn)
2010
–
2014
Advisors, Relations & Conflicts
Coworker
Cong Yao
2021
–
2023
Coworker
Sibo Song
2019
–
2023
Expertise
Visual Large Language Models
2023
–
Present
Visual Document Understanding
2020
–
Present
Publications
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking
Mingxin Li
,
Yanzhao Zhang
,
Dingkun Long
,
Keqin Chen
,
Sibo Song
,
Shuai Bai
,
ZhiBo Yang
,
Pengjun Xie
,
An Yang
,
Dayiheng Liu
,
Jingren Zhou
,
Junyang Lin
OpenReview Archive Direct Upload
Readers:
Everyone
From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning
Ruilin Luo
,
Chufan Shi
,
Yizhen Zhang
,
Cheng Yang
,
Songtao Jiang
,
Tongkun Guan
,
Ruizhe Chen
,
Ruihang Chu
,
Peng Wang
,
Mingkun Yang
,
Lei Wang
,
Yujiu Yang
,
Junyang Lin
,
Zhibo Yang
ICLR 2026 Poster
Readers:
Everyone
DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding
Wenwen Yu
,
Zhibo Yang
,
Yuliang Liu
,
Xiang Bai
CoRR 2025
Readers:
Everyone
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
,
Zhibo Yang
,
Jianqiang Wan
,
Sibo Song
,
Jun Tang
,
Wenqing Cheng
,
Yuliang Liu
,
Xiang Bai
CoRR 2025
Readers:
Everyone
Qwen2.5-VL Technical Report
Shuai Bai
,
Keqin Chen
,
Xuejing Liu
,
Jialin Wang
,
Wenbin Ge
,
Sibo Song
,
Kai Dang
,
Peng Wang
,
Shijie Wang
,
Jun Tang
,
Humen Zhong
,
Yuanzhi Zhu
,
Ming-Hsuan Yang
,
Zhaohai Li
,
Jianqiang Wan
,
Pengfei Wang
,
Wei Ding
,
Zheren Fu
,
Yiheng Xu
,
Jiabo Ye
et al. (7 additional authors not shown)
CoRR 2025
Readers:
Everyone
HierCode: A lightweight hierarchical codebook for zero-shot Chinese text recognition
Yuyi Zhang
,
Yuanzhi Zhu
,
Dezhi Peng
,
Peirong Zhang
,
Zhenhua Yang
,
Zhibo Yang
,
Cong Yao
,
Lianwen Jin
Pattern Recognit. 2025
Readers:
Everyone
LORE++: Logical location regression network for table structure recognition with pre-training
Rujiao Long
,
Hangdi Xing
,
Zhibo Yang
,
Qi Zheng
,
Zhi Yu
,
Fei Huang
,
Cong Yao
Pattern Recognit. 2025
Readers:
Everyone
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction
Rujiao Long
,
Pengfei Wang
,
Zhibo Yang
,
Wenqing Cheng
ICDAR (1) 2025
Readers:
Everyone
Generative compositor for few-shot visual information extraction
Zhibo Yang
,
Wei Hua
,
Sibo Song
,
Cong Yao
,
Yingying Zhu
,
Wenqing Cheng
,
Xiang Bai
Pattern Recognit. 2025
Readers:
Everyone
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Jiawei Liu
,
Yuanzhi Zhu
,
Feiyu Gao
,
Zhibo Yang
,
Peng Wang
,
Junyang Lin
,
Xinggang Wang
,
Wenyu Liu
CoRR 2025
Readers:
Everyone
View all 28 publications
Co-Authors
An Yang
Chao Zhang
Cheng Yang
Cheng-Lin Liu
Chufan Shi
Chunhua Shen
Cong Yao
Daniel Lopresti
Dayiheng Liu
Dezhi Peng
Ding Liang
Dingkun Long
Enze Xie
Fei Huang
Feiyu Gao
Gui-Song Xia
Haiyang Xu
Hang Zhang
Hangdi Xing
Humen Zhong
Jiabo Ye
Jialin Wang
Jianqiang Wan
Jiawei Liu
Jingren Zhou
View all 91 co-authors