﻿Company Name,Publication Date,Paper Title,Authors
ByteDance Ltd.,2025-06-27,Investigating the Overlooked Hessian Structure: From CNNs to LLMs,Qian-Yuan Tang* 1 Yufei Gu* 2 Yunfeng Cai 3 Mingming Sun 4 Ping Li 5 Xun Zhou 6 Zeke Xie†
ByteDance Ltd.,2025-06-26,Active Reward Modeling: Adaptive Preference Labeling for Large Language Model Alignment,"Yunyi Shen, Hao Sun, Jean-Francois Ton"
ByteDance Ltd.,2025-06-20,Polybasic Speculative Decoding Through a Theoretical Perspective,"Ruilin Wang, Huixia Li, Yuexiao Ma, Xiawu Zheng, Fei Chao, Xuefeng Xiao, Rongrong Ji"
ByteDance Ltd.,2025-06-16,Robust Multi-bit Text Watermark with LLM-based Paraphrasers,"Xiaojun Xu, Jinghan Jia, Yuanshun Yao, Yang Liu, Hang Li"
ByteDance Ltd.,2025-06-15,Improving Zero-Shot Adversarial Robustness in Vision-Language Models by Closed-form Alignment of Adversarial Path Simplices,"Junhao Dong, Piotr Koniusz, Yifei Zhang, Hao Zhu, Weiming Liu, Xinghua Qu, Yew-Soon Ong"
ByteDance Ltd.,2025-06-12,Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts,"Yike Yuan, Ziyu Wang, Zihao Huang, Defa Zhu, Xun Zhou, Jingyi Yu, Qiyang Min"
ByteDance Ltd.,2025-06-11,Seedance 1.0: Exploring the Boundaries of Video Generation Models,"Yu Gao, Haoyuan Guo, Tuyen Hoang, Weilin Huang, Lu Jiang, Fangyuan Kong, Huixia Li, Jiashi Li, Liang Li, Xiaojie Li, Xunsong Li, Yifu Li, Shanchuan Lin, Zhijie Lin, Jiawei Liu, Shu Liu, Xiaonan Nie, Zhiwu Qing, Yuxi Ren, Li Sun, Zhi Tian, Rui Wang, Sen Wang, Guoqiang Wei, Guohong Wu, Jie Wu, Ruiqi Xia, Fei Xiao, Xuefeng Xiao, Jiangqiao Yan, Ceyuan Yang, Jianchao Yang, Runkai Yang, Tao Yang, Yihang Yang, Zilyu Ye, Xuejiao Zeng, Yan Zeng, Heng Zhang, Yang Zhao, Xiaozheng Zheng, Peihao Zhu, Jiaxin Zou, Feilong Zuo"
ByteDance Ltd.,2025-06-06,BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning,"Han Zhong, Yutong Yin, Shenao Zhang, Xiaojun Xu, Yuanxin Liu, Yifei Zuo, Zhihan Liu, Boyi Liu, Sirui Zheng, Hongyi Guo, Liwei Wang, Mingyi Hong, Zhaoran Wang"
ByteDance Ltd.,2025-06-06,Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning,"Sheng Chen, Peiyu He, Jiaxin Hu, Ziyang Liu, Yansheng Wang, Tao Xu, Chi Zhang, Chongchong Zhang, Chao An, Shiyu Cai, Duo Cao, Kangping Chen, Shuai Chu, Tianwei Chu, Mingdi Dan, Min Du, Weiwei Fang, Pengyou Fu, Junkai Hu, Xiaowei Jiang, Zhaodi Jiang, Fuxuan Li, Jun Li, Minghui Li, Mingyao Li, Yanchang Li, Zhibin Li, Guangming Liu, Kairui Liu, Lihao Liu, Weizhi Liu, Xiaoshun Liu, Yufei Liu, Yunfei Liu, Qiang Lu, Yuanfei Luo, Xiang Lv, Hongying Ma, Sai Ma, Lingxian Mi, Sha Sa, Hongxiang Shu, Lei Tian, Chengzhi Wang, Jiayu Wang, Kaijie Wang, Qingyi Wang, Renwen Wang, Tao Wang, Wei Wang, Xirui Wang, Chao Wei, Xuguang Wei, Zijun Xia, Zhaohao Xiao, Tingshuai Yan, Liyan Yang, Yifan Yang, Zhikai Yang, Zhong Yin, Li Yuan, Liuchun Yuan, Chi Zhang, Jinyang Zhang, Junhui Zhang, Linge Zhang, Zhenyi Zhang, Zheyu Zhang, Dongjie Zhu, Hang Li, Yangang Zhang"
ByteDance Ltd.,2025-06-05,SeedEdit 3.0: Fast and High-Quality Generative Image Editing,"Peng Wang, Yichun Shi, Xiaochen Lian, Zhonghua Zhai, Xin Xia, Xuefeng Xiao, Weilin Huang, Jianchao Yang"
ByteDance Ltd.,2025-06-04,Sounding that Object: Interactive Object-Aware Image to Audio Generation,"Tingle Li, Baihe Huang, Xiaobin Zhuang, Dongya Jia, Jiawei Chen, Yuping Wang, Zhuo Chen, Gopala Anumanchipalli, Yuxuan Wang"
ByteDance Ltd.,2025-05-31,An All-Atom Generative Model for Designing Protein Complexes,"Ruizhe Chen, Dongyu Xue, Xiangxin Zhou, Zaixiang Zheng, Xiangxiang Zeng, Quanquan Gu"
ByteDance Ltd.,2025-05-27,PaSa: An LLM Agent for Comprehensive Academic Paper Search,"Yichen He, Guanhua Huang, Peiyuan Feng, Yuan Lin, Yuchen Zhang, Hang Li, Weinan E"
ByteDance Ltd.,2025-05-27,Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling,"Xiangxin Zhou, Mingyu Li, Yi Xiao, Jiahan Li, Dongyu Xue, Zaixiang Zheng, Jianzhu Ma, Quanquan Gu"
ByteDance Ltd.,2025-05-25,DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation,"Dongya Jia, Zhuo Chen, Jiawei Chen, Chenpeng Du, Jian Wu, Jian Cong, Xiaobin Zhuang, Chumin Li, Zhen Wei, Yuping Wang, Yuxuan Wang"
ByteDance Ltd.,2025-05-23,Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling,"Hongzhi Huang, Defa Zhu, Banggu Wu, Yutao Zeng, Ya Wang, Qiyang Min, Xun Zhou"
ByteDance Ltd.,2025-05-21,MMaDA: Multimodal Large Diffusion Language Models,"Ling Yang, Ye Tian, Bowen Li, Xinchen Zhang, Ke Shen, Yunhai Tong, Mengdi Wang"
ByteDance Ltd.,2025-05-20,Emerging Properties in Unified Multimodal Pretraining,"Chaorui Deng, Deyao Zhu, Kunchang Li, Chenhui Gou, Feng Li, Zeyu Wang, Shu Zhong, Weihao Yu, Xiaonan Nie, Ziang Song, Guang Shi, Haoqi Fan"
ByteDance Ltd.,2025-05-17,Model Merging in Pre-training of Large Language Models,"Yunshui Li, Yiyuan Ma, Shen Yan, Chaoyi Zhang, Jing Liu, Jianqiao Lu, Ziwen Xu, Mengzhao Chen, Minrui Wang, Shiyi Zhan, Jin Ma, Xunhao Lai, Deyi Liu, Yao Luo, Xingyan Bin, Hongbin Ren, Mingji Han, Wenhao Hao, Bairen Yi, LingJun Liu, Bole Ma, Xiaoying Jia, Xun Zhou, Siyuan Qiao, Liang Xiang, Yonghui Wu"
ByteDance Ltd.,2025-05-13,Seed1.5-VL Technical Report,"Dong Guo, Faming Wu, Feida Zhu, Fuxing Leng, Guang Shi, Haobin Chen, Haoqi Fan, Jian Wang, Jianyu Jiang, Jiawei Wang, Jingji Chen, Jingjia Huang, Kang Lei, Liping Yuan, Lishu Luo, Pengfei Liu, Qinghao Ye, Rui Qian, Shen Yan, Shixiong Zhao, Shuai Peng, Shuangye Li, Sihang Yuan, Sijin Wu, Tianheng Cheng, Weiwei Liu, Wenqian Wang, Xianhan Zeng, Xiao Liu, Xiaobo Qin, Xiaohan Ding, Xiaojun Xiao, Xiaoying Zhang, Xuanwei Zhang, Xuehan Xiong, Yanghua Peng, Yangrui Chen, Yanwei Li, Yanxu Hu, Yi Lin, Yiyuan Hu, Yiyuan Zhang, Youbin Wu, Yu Li, Yudong Liu, Yue Ling, Yujia Qin, Zanbo Wang, Zhiwu He, Aoxue Zhang, Bairen Yi, Bencheng Liao, Can Huang, Can Zhang, Chaorui Deng, Chaoyi Deng, Cheng Lin, Cheng Yuan, Chenggang Li, Chenhui Gou, Chenwei Lou, Chengzhi Wei, Chundian Liu, Chunyuan Li, Deyao Zhu, Donghong Zhong, Feng Li, Feng Zhang, Gang Wu, Guodong Li, Guohong Xiao, Haibin Lin, Haihua Yang, Haoming Wang, Heng Ji, Hongxiang Hao, Hui Shen, Huixia Li, Jiahao Li, Jialong Wu, Jianhua Zhu, Jianpeng Jiao, Jiashi Feng, Jiaze Chen, Jianhui Duan, Jihao Liu, Jin Zeng, Jingqun Tang, Jingyu Sun, Joya Chen, Jun Long, Junda Feng, Junfeng Zhan, Junjie Fang, Junting Lu, Kai Hua, Kai Liu, Kai Shen, Kaiyuan Zhang, Ke Shen et al. (97 additional authors not shown)"
ByteDance Ltd.,2025-05-09,Understanding Stragglers in Large Model Training Using What-if Analysis,"Jinkun Lin, Ziheng Jiang, Zuquan Song, Sida Zhao, Menghan Yu, Zhanghan Wang, Chenyuan Wang, Zuocheng Shi, Xiang Shi, Wei Jia, Zherui Liu, Shuguang Wang, Haibin Lin, Xin Liu, Aurojit Panda, Jinyang Li"
ByteDance Ltd.,2025-05-05,MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design,"Haojie Duanmu, Xiuhong Li, Zhihang Yuan, Size Zheng, Jiangfei Duan, Xingcheng Zhang, Dahua Lin"
ByteDance Ltd.,2025-04-25,ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference,"Hanshi Sun, Li-Wen Chang, Wenlei Bao, Size Zheng, Ningxin Zheng, Xin Liu, Harry Dong, Yuejie Chi, Beidi Chen"
ByteDance Ltd.,2025-04-24,Let the Code LLM Edit Itself When You Edit the Code,"Zhenyu He, Jun Zhang, Shengjie Luo, Jingjing Xu, Zhi Zhang, Di He"
ByteDance Ltd.,2025-04-22,Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation,"Haotong Lin, Sida Peng, Jingxiao Chen, Songyou Peng, Jiaming Sun, Minghuan Liu, Hujun Bao, Jiashi Feng, Xiaowei Zhou, Bingyi Kang"
ByteDance Ltd.,2025-04-15,Seedream 3.0 Technical Report,"Yu Gao, Haoyuan Guo, Tuyen Hoang, Weilin Huang, Lu Jiang, Fangyuan Kong, Huixia Li, Jiashi Li, Liang Li, Xiaojie Li, Xunsong Li, Yifu Li, Shanchuan Lin, Zhijie Lin, Jiawei Liu, Shu Liu, Xiaonan Nie, Zhiwu Qing, Yuxi Ren, Li Sun, Zhi Tian, Rui Wang, Sen Wang, Guoqiang Wei, Guohong Wu, Jie Wu, Ruiqi Xia, Fei Xiao, Xuefeng Xiao, Jiangqiao Yan, Ceyuan Yang, Jianchao Yang, Runkai Yang, Tao Yang, Yihang Yang, Zilyu Ye, Xuejiao Zeng, Yan Zeng, Heng Zhang, Yang Zhao, Xiaozheng Zheng, Peihao Zhu, Jiaxin Zou, Feilong Zuo"
ByteDance Ltd.,2025-04-10,Seed-Thinking-v1.5: Advancing Superb Reasoning Models with Reinforcement Learning,"Jiaze Chen, TianTian Fan, Xin Liu, Lingjun Liu, Zhiqi Lin, Mingxuan Wang, Chengyi Wang, Xiangpeng Wei, Wenyuan Xu,Yufeng Yuan, Yu Yue, Lin Yan, Qiying Yu, Xiaochen Zuo, Chi Zhang"
ByteDance Ltd.,2025-04-03,ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation,"Ali Athar, Xueqing Deng, Liang-Chieh Chen"
ByteDance Ltd.,2025-04-03,Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving,"Daoguang Zan, Zhirong Huang, Wei Liu, Hanwu Chen, Linhao Zhang, Shulin Xin, Lu Chen, Qi Liu, Xiaojian Zhong, Aoyan Li, Siyao Liu, Yongsheng Xiao, Liangqiang Chen, Yuyu Zhang, Jing Su, Tianyu Liu, Rui Long, Kai Shen, Liang Xiang"
ByteDance Ltd.,2025-04-01,Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?,"Kai Yan, Yufei Xu, Zhengyin Du, Xuesong Yao, Zheyu Wang, Xiaowen Guo, Jiecao Chen"
ByteDance Ltd.,2025-03-25,RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories,"Huiyang Shao, Xin Xia, Yuhong Yang, Yuxi Ren, Xing Wang, Xuefeng Xiao"
ByteDance Ltd.,2025-03-24,Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders,"Rui Chen, Jianfeng Zhang, Yixun Liang, Guan Luo, Weiyu Li, Jiarui Liu, Xiu Li, Xiaoxiao Long, Jiashi Feng, Ping Tan"
ByteDance Ltd.,2025-03-22,SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration,"Jianyi Wang, Zhijie Lin, Meng Wei, Yang Zhao, Ceyuan Yang, Fei Xiao, Chen Change Loy, Lu Jiang"
ByteDance Ltd.,2025-03-20,Multi-Reward as Condition for Instruction-based Image Editing,"Xin Gu, Ming Li, Libo Zhang, Fan Chen, Longyin Wen, Tiejian Luo, Sijie Zhu"
ByteDance Ltd.,2025-03-18,"DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Download Paper","Qiying Yu, Zheng Zhang, Ruofei Zhu, Yufeng Yuan, Xiaochen Zuo, Yu Yue, Tiantian Fan, Gaohong Liu, Lingjun Liu, Xin Liu, Haibin Lin, Zhiqi Lin, Bole Ma, Guangming Sheng, Yuxuan Tong, Chi Zhang, Mofan Zhang, Wang Zhang, Hang Zhu, Jinhua Zhu, Jiaze Chen, Jiangjie Chen, Chengyi Wang, Hongli Yu, Weinan Dai, Yuxuan Song, Xiangpeng Wei, Hao Zhou, Jingjing Liu, Wei-Ying Ma, Ya-Qin Zhang, Lin Yan, Mu Qiao, Yonghui Wu, Mingxuan Wang"
ByteDance Ltd.,2025-03-18,Hyper-Connections,"Defa Zhu, Hongzhi Huang, Zihao Huang, Yutao Zeng, Yunyao Mao, Banggu Wu, Qiyang Min, Xun Zhou"
ByteDance Ltd.,2025-03-17,FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis,"Luxi Chen, Zihan Zhou, Min Zhao, Yikai Wang, Ge Zhang, Wenhao Huang, Hao Sun, Ji-Rong Wen, Chongxuan Li"
ByteDance Ltd.,2025-03-14,ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance,"Jiannan Huang, Jun Hao Liew, Hanshu Yan, Yuyang Yin, Yao Zhao, Humphrey Shi, Yunchao Wei"
ByteDance Ltd.,2025-03-10,Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model,"Lixue Gong, Xiaoxia Hou, Fanshi Li, Liang Li, Xiaochen Lian, Fei Liu, Liyang Liu, Wei Liu, Wei Lu, Yichun Shi, Shiqi Sun, Yu Tian, Zhi Tian, Peng Wang, Xun Wang, Ye Wang, Guofeng Wu, Jie Wu, Xin Xia, Xuefeng Xiao, Linjie Yang, Zhonghua Zhai, Xinyu Zhang, Qi Zhang, Yuwei Zhang, Shijia Zhao, Jianchao Yang, Weilin Huang"
ByteDance Ltd.,2025-03-05,VideoWorld: Exploring Knowledge Learning from Unlabeled Videos,"Zhongwei Ren, Yunchao Wei, Xun Guo, Yao Zhao, Bingyi Kang, Jiashi Feng, Xiaojie Jin"
ByteDance Ltd.,2025-03-04,LLaVA-Critic: Learning to Evaluate Multimodal Models,"Tianyi Xiong, Xiyao Wang, Dong Guo, Qinghao Ye, Haoqi Fan, Quanquan Gu, Heng Huang, Chunyuan Li"
ByteDance Ltd.,2025-03-03,The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model,"Jiawei Chen, Wentao Chen, Jing Su, Jingjing Xu, Hongyu Lin, Mengjie Ren, Yaojie Lu, Xianpei Han, Le Sun"
ByteDance Ltd.,2025-03-01,TC-MoE: Augmenting Mixture of Experts with Ternary Expert Choice Download Paper,"Shen Yan, Xingyan Bin, Sijun Zhang, Yisen Wang, Zhouchen Lin"
ByteDance Ltd.,2025-02-28,FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference,"Xunhao Lai, Jianqiao Lu, Yao Luo, Yiyuan Ma, Xun Zhou"
ByteDance Ltd.,2025-02-27,Comet: Fine-grained Computation-communication Overlapping for Mixture-of-Experts,"Shulai Zhang, Ningxin Zheng, Haibin Lin, Ziheng Jiang, Wenlei Bao, Chengquan Jiang, Qi Hou, Weihao Cui, Size Zheng, Li-Wen Chang, Quan Chen, Xin Liu"
ByteDance Ltd.,2025-02-27,SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines,"M-A-P Team, Xinrun Du, Yifan Yao, Kaijing Ma, Bingli Wang, Tianyu Zheng, Kang Zhu, Minghao Liu, Yiming Liang, Xiaolong Jin, Zhenlin Wei, Chujie Zheng, Kaixin Deng, Shian Jia, Sichao Jiang, Yiyan Liao, Rui Li, Qinrui Li, Sirun Li, Yizhi Li, Yunwen Li, Dehua Ma, Yuansheng Ni, Haoran Que, Qiyao Wang, Zhoufutu Wen, Siwei Wu, Tianshun Xing, Ming Xu, Zhenzhu Yang, Zekun Moore Wang, Junting Zhou, Yuelin Bai, Xingyuan Bu, Chenglin Cai, Liang Chen, Yifan Chen, Chengtuo Cheng, Tianhao Cheng, Keyi Ding, Siming Huang, Yun Huang, Yaoru Li, Yizhe Li, Zhaoqun Li, Tianhao Liang, Chengdong Lin, Hongquan Lin, Yinghao Ma, Tianyang Pang, Zhongyuan Peng, Zifan Peng, Qige Qi, Shi Qiu, Xingwei Qu, Shanghaoran Quan, Yizhou Tan, Zili Wang, Chenqing Wang, Hao Wang, Yiya Wang, Yubo Wang, Jiajun Xu, Kexin Yang, Ruibin Yuan, Yuanhao Yue, Tianyang Zhan, Chun Zhang, Jinyang Zhang, Xiyue Zhang, Xingjian Zhang, Yue Zhang, Yongchi Zhao, Xiangyu Zheng, Chenghua Zhong, Yang Gao, Zhoujun Li, Dayiheng Liu, Qian Liu, Tianyu Liu, Shiwen Ni, Junran"
ByteDance Ltd.,2025-02-26,Towards Semantic Equivalence of Tokenization in Multimodal LLM,"Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan"
ByteDance Ltd.,2025-02-25,You Only Sample Once: Taming One-Step Text-to-Image Synthesis by Self-Cooperative Diffusion GANS,"Yihong Luo, Xiaolong Chen, Xinghua Qu, Tianyang Hu, Jing Tang"
ByteDance Ltd.,2025-02-25,X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention,"XiaoChen Zhao, Hongyi Xu, Guoxian Song, You Xie, Chenxu Zhang, Xiu Li, Linjie Luo, Jinli Suo, Yebin Liu"
ByteDance Ltd.,2025-02-17,MagicArticulate: Make Your 3D Models Articulation-Ready,"Chaoyue Song, Jianfeng Zhang, Xiu Li, Fan Yang, Yiwen Chen, Zhongcong Xu, Jun Hao Liew, Xiaoyang Guo, Fayao Liu, Jiashi Feng, Guosheng Lin"
ByteDance Ltd.,2025-02-13,"MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency","Dongzhi Jiang, Renrui Zhang, Ziyu Guo, Yanwei Li, Yu Qi, Xinyan Chen, Liuhui Wang, Jianhan Jin, Claire Guo, Shen Yan, Bo Zhang, Chaoyou Fu, Peng Gao, Hongsheng Li"
ByteDance Ltd.,2025-02-12,"One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs","Yinghui Li, Jiayi Kuang, Haojing Huang, Zhikun Xu, Xinnian Liang, Yi Yu, Wenlian Lu, Yangning Li, Xiaoyu Tan, Chao Qu, Ying Shen, Hai-Tao Zheng, Philip S. Yu

"
ByteDance Ltd.,2025-02-10,MARS: Unleashing the Power of Variance Reduction for Training Large Models,"Huizhuo Yuan, Yifeng Liu, Shuang Wu, Xun Zhou, Quanquan Gu"
ByteDance Ltd.,2025-02-05,Teaching Language Models to Critique via Reinforcement Learning,"Zhihui Xie, Jie Chen, Liyu Chen, Weichao Mao, Jingjing Xu, Lingpeng Kong"
ByteDance Ltd.,2025-02-06,Ultra-Sparse Memory Network,"Zihao Huang, Qiyang Min, Hongzhi Huang, Defa Zhu, Yutao Zeng, Ran Guo, Xun Zhou"
ByteDance Ltd.,2025-02-05,Shot2Story: A New Benchmark for Comprehensive Understanding of Multi-shot Videos,"Mingfei Han, Linjie Yang, Xiaojun Chang, Lina Yao, Heng Wang"
ByteDance Ltd.,2025-02-05,BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving,"Ran Xin, Chenguang Xi, Jie Yang, Feng Chen, Hang Wu, Xia Xiao, Yifan Sun, Shen Zheng, Kai Shen"
ByteDance Ltd.,2025-01-21,Video Depth Anything: Consistent Depth Estimation for Super-Long Videos,"Sili Chen, Hengkai Guo, Shengnan Zhu, Feihu Zhang, Zilong Huang, Jiashi Feng, Bingyi Kang"
ByteDance Ltd.,2025-01-21,UI-TARS: Pioneering Automated GUI Interaction with Native Agents,"Yujia Qin, Yining Ye, Junjie Fang, Haoming Wang, Shihao Liang, Shizuo Tian, Junda Zhang, Jiahao Li, Yunxin Li, Shijue Huang, Wanjun Zhong, Kuanye Li, Jiale Yang, Yu Miao, Woyu Lin, Longxiang Liu, Xu Jiang, Qianli Ma, Jingyu Li, Xiaojun Xiao, Kai Cai, Chuang Li, Yaowei Zheng, Chaolin Jin, Chen Li, Xiao Zhou, Minchao Wang, Haoli Chen, Zhaojian Li, Haihua Yang, Haifeng Liu, Feng Lin, Tao Peng, Xin Liu, Guang Shi"
ByteDance Ltd.,2025-01-20,X-Dyna: Expressive Dynamic Human Image Animation,"Di Chang, Hongyi Xu, You Xie, Yipeng Gao, Zhengfei Kuang, Shengqu Cai, Chenxu Zhang, Guoxian Song, Chao Wang, Yichun Shi, Zeyuan Chen, Shijie Zhou, Linjie Luo, Gordon Wetzstein, Mohammad Soleymani"
ByteDance Ltd.,2025-01-16,VideoWorld: Exploring Knowledge Learning from Unlabeled Videos,"Zhongwei Ren, Yunchao Wei, Xun Guo, Yao Zhao, Bingyi Kang, Jiashi Feng, Xiaojie Jin"
ByteDance Ltd.,2025-01-14,Diffusion Adversarial Post-Training for One-Step Video Generation,"Shanchuan Lin, Xin Xia, Yuxi Ren, Ceyuan Yang, Xuefeng Xiao, Lu Jiang"
ByteDance Ltd.,2024-12-19,Parallelized Autoregressive Visual Generation,"Yuqing Wang, Shuhuai Ren, Zhijie Lin, Yujin Han, Haoyuan Guo, Zhenheng Yang, Difan Zou, Jiashi Feng, Xihui Liu"
ByteDance Ltd.,2024-12-08,MaskBit: Embedding-free Image Generation via Bit Tokens,"Mark Weber, Lijun Yu, Qihang Yu, Xueqing Deng, Xiaohui Shen, Daniel Cremers, Liang-Chieh Chen"
ByteDance Ltd.,2024-12-03,FullStack Bench: Evaluating LLMs as Full Stack Coders,"Siyao Liu, He Zhu, Jerry Liu, Shulin Xin, Aoyan Li, Rui Long, Li Chen, Jack Yang, Jinxiang Xia, Z.Y, Peng, Shukai Liu, Zhaoxiang Zhang, Jing Mai, Ge Zhang, Wenhao Huang, Kai Shen, Liang Xiang"
ByteDance Ltd.,2024-11-26,DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention,"Lianghui Zhu, Zilong Huang, Bencheng Liao, Jun Hao Liew, Hanshu Yan, Jiashi Feng, Xinggang Wang"
ByteDance Ltd.,2024-11-20,DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs,"Zhihan Liu, Shenao Zhang, Yongfei Liu, Boyi Liu, Yingxiang Yang, Zhaoran Wang"
ByteDance Ltd.,2024-11-13,LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive Hashing,"Xiaonan Nie, Qibin Liu, Fangcheng Fu, Shenhan Zhu, Xupeng Miao, Xiaoyang Li, Yang Zhang, Shouda Liu, Bin Cui"
ByteDance Ltd.,2024-11-11,SeedEdit: Align Image Re-Generation to Image Editing,"Yichun Shi, Peng Wang, Weilin Huang"
ByteDance Ltd.,2024-11-06,Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models,"Zhijian Zhuo, Ya Wang, Yutao Zeng, Xiaoqing Li, Xun Zhou, Jinwen Ma"
ByteDance Ltd.,2024-11-05,Classification Done Right for Vision-Language Pre-Training,"Zilong Huang, Qinghao Ye, Bingyi Kang, Jiashi Feng, Haoqi Fan"
ByteDance Ltd.,2024-11-04,How Far is Video Generation from World Model: A Physical Law Perspective,"Bingyi Kang, Yang Yue, Rui Lu, Zhijie Lin, Yang Zhao, Kaixin Wang, Gao Huang, Jiashi Feng"
ByteDance Ltd.,2024-10-27,AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions,"Ziming Li, Qianbo Zang, David Ma, Jiawei Guo, Tuney Zheng, Minghao Liu, Xinyao Niu, Yue Wang, Jian Yang, Jiaheng Liu, Wanjun Zhong, Wangchunshu Zhou, Wenhao Huang, Ge Zhang"
ByteDance Ltd.,2024-10-24,Why Does the Effective Context Length of LLMs Fall Short?,"Chenxin An, Jun Zhang, Ming Zhong, Lei Li, Shansan Gong, Yao Luo, Jingjing Xu, Lingpeng Kong"
ByteDance Ltd.,2024-10-22,Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering,"Ziyu Zhao, Tao Shen, Didi Zhu, Zexi Li, Jing Su, Xuwu Wang, Kun Kuang, Fei Wu"
ByteDance Ltd.,2024-10-10,Reward-Augmented Data Enhances Direct Preference Alignment of LLMs,"Shenao Zhang, Zhihan Liu, Zhaoran Wang"
ByteDance Ltd.,2024-10-09,KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks,"Kaijing Ma, Xinrun Du, Yunran Wang, Haoran Zhang, Zhoufutu Wen, Xingwei Qu, Jian Yang, Jiaheng Liu, Minghao Liu, Xiang Yue, Wenhao Huang, Ge Zhang"
ByteDance Ltd.,2024-10-03,FAN: Fourier Analysis Networks,"Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang, Jia Li, Jing Su, Jun Zhang, Jingjing Xu"
ByteDance Ltd.,2024-10-03,Loong: Generating Minute-level Long Videos with Autoregressive Language Models,"Yuqing Wang, Tianwei Xiong, Daquan Zhou, Zhijie Lin, Yang Zhao, Bingyi Kang, Jiashi Feng, Xihui Liu"
ByteDance Ltd.,2024-10-03,Video Instruction Tuning With Synthetic Data,"Yuanhan Zhang, Jinming Wu, Wei Li, Bo Li, Zejun Ma, Ziwei Liu, Chunyuan Li"
ByteDance Ltd.,2024-10-02,HybridFlow: A Flexible and Efficient RLHF Framework,"Guangming Sheng, Chi Zhang, Zilingfeng Ye, Xibin Wu, Wang Zhang, Ru Zhang, Yanghua Peng, Haibin Lin, Chuan Wu"
ByteDance Ltd.,2024-09-13,Seed-Music: A Unified Framework for High Quality and Controlled Music Generation,"Ye Bai, Haonan Chen, Jitong Chen, Zhuo Chen, Yi Deng, Xiaohong Dong, Lamtharn Hantrakul, Weituo Hao, Qingqing Huang, Zhongyi Huang, Dongya Jia, Feihu La, Duc Le, Bochen Li, Chumin Li, Hui Li, Xingxing Li, Shouda Liu, Wei-Tsung Lu, Yiqing Lu, Andrew Shaw, Janne Spijkervet, Yakun Sun, Bo Wang, Ju-Chiang Wang, Yuping Wang, Yuxuan Wang, Ling Xu, Yifeng Yang, Chao Yao, Shuo Zhang, Yang Zhang, Yilin Zhang, Hang Zhao, Ziyi Zhao, Dejian Zhong, Shicen Zhou, Pei Zou"
ByteDance Ltd.,2024-09-03,"An X-ray Significantly Variable, Luminous, Type 2 Quasar at z = 2.99 with a Massive Host Galaxy","Xiurui Zhao, Stefano Marchesi, Marco Ajello, Francesca Civano, Roberto Gilli, Giorgio Lanzuisi, Iván E. López, Ross Silver, Nuria Torres-Albà, Peter G. Boorman, Andrealuna Pizzetti"
ByteDance Ltd.,2024-07-29,ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development,"Borui Wan, Mingji Han, Yiyao Sheng, Yanghua Peng, Haibin Lin, Mofan Zhang, Zhichao Lai, Menghan Yu, Junda Zhang, Zuquan Song, Xin Liu, Chuan Wu"
ByteDance Ltd.,2024-07-28,"LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models","Feng Li, Renrui Zhang, Hao Zhang, Yuanhan Zhang, Bo Li, Wei Li, Zejun Ma, Chunyuan Li"
ByteDance Ltd.,2024-07-10,IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model,"Yatai Ji, Shilong Zhang, Jie Wu, Peize Sun, Weifeng Chen, Xuefeng Xiao, Sidi Yang, Yujiu Yang, Ping Luo"
ByteDance Ltd.,2024-07-10,Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition,"Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li, Xiaoyang Li, Zeyang Li, Zehua Lin, Rui Liu, Shouda Liu, Lu Lu, Yizhou Lu, Jingting Ma, Shengtao Ma, Yulin Pei, Chen Shen, Tian Tan, Xiaogang Tian, Ming Tu, Bo Wang, Hao Wang, Yuping Wang, Yuxuan Wang, Hanzhang Xia, Rui Xia, Shuangyi Xie, Hongmin Xu, Meng Yang, Bihong Zhang, Jun Zhang, Wanyi Zhang, Yang Zhang, Yawei Zhang, Yijie Zheng, Ming Zou"
ByteDance Ltd.,2024-06-19,SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words,"Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu"
ByteDance Ltd.,2024-06-13,Depth Anything V2,"Lihe Yang, Bingyi Kang, Zilong Huang, Zhen Zhao, Xiaogang Xu, Jiashi Feng, Hengshuang Zhao"
ByteDance Ltd.,2024-06-11,Autoregressive Pretraining with Mamba in Vision,"Sucheng Ren, Xianhang Li, Haoqin Tu, Feng Wang, Fangxun Shu, Lei Zhang, Jieru Mei, Linjie Yang, Peng Wang, Heng Wang, Alan Yuille, Cihang Xie"
ByteDance Ltd.,2024-06-11,An Image is Worth 32 Tokens for Reconstruction and Generation,"Qihang Yu, Mark Weber, Xueqing Deng, Xiaohui Shen, Daniel Cremers, Liang-Chieh Chen"
ByteDance Ltd.,2024-06-04,Seed-TTS: A Family of High-Quality Versatile Speech Generation Models,"Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, Yuanyuan Huo, Dongya Jia, Chumin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu, Xudong Liu, Yuchen Liu, Zhengxi Liu, Lu Lu, Junjie Pan, Xin Wang, Yuping Wang, Yuxuan Wang, Zhen Wei, Jian Wu, Chao Yao, Yifeng Yang, Yuanhao Yi, Junteng Zhang, Qidi Zhang, Shuo Zhang, Wenjie Zhang, Yang Zhang, Zilin Zhao, Dejian Zhong, Xiaobin Zhuang"
ByteDance Ltd.,2024-05-29,PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator,"Hanshu Yan, Xingchao Liu, Jiachun Pan, Jun Hao Liew, Qiang Liu, Jiashi Feng"
ByteDance Ltd.,2024-05-28,3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting,"Qihang Zhang, Yinghao Xu, Chaoyang Wang, Hsin-Ying Lee, Gordon Wetzstein, Bolei Zhou, Ceyuan Yang"
ByteDance Ltd.,2024-05-28,Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment,"Xin Xiao, Bohong Wu, Jiacong Wang, Chunyuan Li, Xun Zhou, Haoyuan Guo"
ByteDance Ltd.,2024-05-23,Unveiling the Tapestry of Consistency in Large Vision-Language Models,"Yuan Zhang, Fei Xiao, Tao Huang, Chun-Kai Fan, Hongyuan Dong, Jiawen Li, Jiacong Wang, Kuan Cheng, Shanghang Zhang, Haoyuan Guo"
ByteDance Ltd.,2024-05-02,StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation,"Yupeng Zhou, Daquan Zhou, Ming-Ming Cheng, Jiashi Feng, Qibin Hou"
ByteDance Ltd.,2024-05-01,DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises,"Jiasheng Ye, Zaixiang Zheng, Yu Bao, Lihua Qian, Mingxuan Wang"
ByteDance Ltd.,2024-04-29,PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning,"Lin Xu, Yilin Zhao, Daquan Zhou, Zhijie Lin, See Kiong Ng, Jiashi Feng"
ByteDance Ltd.,2024-04-21,Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis,"Yuxi Ren, Xin Xia, Yanzuo Lu, Jiacheng Zhang, Jie Wu, Pan Xie, Xing Wang, Xuefeng Xiao"
ByteDance Ltd.,2024-04-15,HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing,"Mude Hui, Siwei Yang, Bingchen Zhao, Yichun Shi, Heng Wang, Peng Wang, Yuyin Zhou, Cihang Xie"
ByteDance Ltd.,2024-04-09,Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion,"Fan Yang, Jianfeng Zhang, Yichun Shi, Bowen Chen, Chenxu Zhang, Huichao Zhang, Xiaofeng Yang, Jiashi Feng, Guosheng Lin"
ByteDance Ltd.,2024-03-20,Magic-Me: Identity-Specific Video Customized Diffusion,"Ze Ma, Daquan Zhou, Chun-Hsiao Yeh, Xue-She Wang, Xiuyu Li, Huanrui Yang, Zhen Dong, Kurt Keutzer, Jiashi Feng"
ByteDance Ltd.,2024-03-10,Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning,"Qinghao Ye, Xianhan Zeng, Fu Li, Chunyuan Li, Haoqi Fan"
ByteDance Ltd.,2024-03-02,SDXL-Lightning: Progressive Adversarial Diffusion Distillation,"Shanchuan Lin,Anran Wang,Xiao Yang"
ByteDance Ltd.,2024-02-23,"MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs","Ziheng Jiang,Haibin Lin,Yinmin Zhong,Qi Huang,Yangrui Chen,Zhi Zhang,Yanghua Peng,Xiang Li,Cong Xie,Shibiao Nong,Yulu Jia,Sun He,Hongmin Chen,Zhihao Bai,Qi Hou,Shipeng Yan,Ding Zhou,Yiyao Sheng,Zhuo Jiang,Haohan Xu,Haoran Wei,Zhang Zhang,Pengfei Nie,Leqi Zou,Sida Zhao,Liang Xiang,Zherui Liu,Zhe Li,Xiaoying Jia,Jianxi Ye,Xin Jin,Xin Liu"
ByteDance Ltd.,2024-01-19,Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data,"Lihe Yang,Bingyi Kang,Zilong Huang,Xiaogang Xu,Jiashi Feng,Hengshuang Zhao"
ByteDance Ltd.,2023-12-12,Vista-LLaMA:Reliable Video Narrator via Equal Distance to Visual Tokens,"Fan Ma, Xiaojie Jin, Heng Wang, Yuchen Xian, Jiashi Feng, Yi Yang"
ByteDance Ltd.,2023-12-04,PixelLM:Pixel Reasoning with Large Multimodal Model,"Zhongwei Ren, Zhicheng Huang, Yunchao Wei, Yao Zhao, Dongmei Fu, Jiashi Feng, Xiaojie Jin"
ByteDance Ltd.,2023-11-29,Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning,"Lihua Qian,Mingxuan Wang,Yang Liu,Hao Zhou"
ByteDance Ltd.,2023-11-27,MagicAnimate:Temporally Consistent Human Image Animation using Diffusion Model,"Zhongcong Xu,Jianfeng Zhang,Jun Hao Liew,Hanshu Yan,Jia-Wei Liu,Chenxu Zhang,Jiashi Feng,Mike Zheng Shou"
ByteDance Ltd.,2023-11-18,Make Pixels Dance: High-Dynamic Video Generation,"Yan Zeng,Guoqiang Wei,Jiani Zheng,Jiaxin Zou,Yang Wei,Yuchen Zhang,Hang Li"
ByteDance Ltd.,2023-10-20,SALMONN: Towards Generic Hearing Abilities for Large Language Models,"Changli Tang,Wenyi Yu,Guangzhi Sun,Xianzhao Chen,Tian Tan,Wei Li,Lu Lu,Zejun Ma,Chao Zhang"
ByteDance Ltd.,2023-09-09,AudioLDM 2: Learning holistic audio generation with self-supervised pretraining,"Haohe Liu,Qiao Tian,Yi Yuan,Xubo Liu,Xinhao Mei,Qiuqiang Kong,Yuping Wang,Wenwu Wang,Yuxuan Wang,Mark D. Plumbley"
ByteDance Ltd.,2023-08-28,MagicEdit: High-Fidelity and Temporally Coherent Video Editing,"Jun Hao Liew,Hanshu Yan,Jianfeng Zhang,Zhongcong Xu,Jiashi Feng"
ByteDance Ltd.,2023-06-13,PolyVoice: Language Models for Speech to Speech Translation,"Qianqian Dong,Zhiying Huang,Qiao Tian,Chen Xu,Tom Ko,Yunlong Zhao,Siyuan Feng,Tang Li,Kexin Wang,Xuxin Cheng,Fengpeng Yue,Ye Bai,Xi Chen,Lu Lu,Zejun Ma,Yuping Wang,Mingxuan Wang,Yuxuan Wang"
ByteDance Ltd.,2023-05-25,Efficient Neural Music Generation,"Max W. Y. Lam,Qiao Tian,Tang Li,Zongyu Yin,Siyuan Feng,Ming Tu,Yuliang Ji,Rui Xia,Mingbo Ma,Xuchen Song,Jitong Chen,Yuping Wang,Yuxuan Wang"
ByteDance Ltd.,2023-02-20,ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs,"Yujia Zhai,Chengquan Jiang,Leyuan Wang,Xiaoying Jia,Shang Zhang,Zizhong Chen,Xin Liu,Yibo Zhu"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2024-01-05,DeepSeek LLM: Scaling Open-Source Language Models with Longtermism,"Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li, Guowei Li, Jiashi Li, Yao Li, Y.K. Li, Wenfeng Liang, Fangyun Lin, A.X. Liu, Bo Liu, Wen Liu, Xiaodong Liu, Xin Liu, Yiyuan Liu, Haoyu Lu, Shanghao Lu, Fuli Luo, Shirong Ma, Xiaotao Nie, Tian Pei, Yishi Piao, Junjie Qiu, Hui Qu, Tongzheng Ren, Zehui Ren, Chong Ruan, Zhangli Sha, Zhihong Shao, Junxiao Song, Xuecheng Su, Jingxiang Sun, Yaofeng Sun, Minghui Tang, Bingxuan Wang, Peiyi Wang, Shiyu Wang, Yaohui Wang, Yongji Wang, Tong Wu, Y. Wu, Xin Xie, Zhenda Xie, Ziwei Xie, Yiliang Xiong, Hanwei Xu, R.X. Xu, Yanhong Xu, Dejian Yang, Yuxiang You, Shuiping Yu, Xingkai Yu, B. Zhang, Haowei Zhang, Lecong Zhang, Liyue Zhang, Mingchuan Zhang, Minghua Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Qihao Zhu, Yuheng Zou"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2024-05-07,"DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model","DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J.L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R.J. Chen, R.L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S.S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, T. Wang, Tian Pei, Tian Yuan, Tianyu Sun, W.L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X.Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun et al. (57 additional authors not shown)"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2024-12-27,DeepSeek-V3 Technical Report,"DeepSeek-AI, Aixin Liu, Bei Feng, Bing Xue, Bingxuan Wang, Bochao Wu, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fucong Dai, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Han Bao, Hanwei Xu, Haocheng Wang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J.L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jiawei Wang, Jin Chen, Jingchang Chen, Jingyang Yuan, Junjie Qiu, Junlong Li, Junxiao Song, Kai Dong, Kai Hu, Kaige Gao, Kang Guan, Kexin Huang, Kuai Yu, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Litong Wang, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qiancheng Wang, Qihao Zhu, Qinyu Chen, Qiushi Du, R.J. Chen, R.L. Jin, Ruiqi Ge, Ruisong Zhang, Ruizhe Pan, Runji Wang, Runxin Xu, Ruoyu Zhang, Ruyi Chen, S.S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Shuting Pan, T. Wang, Tao Yun, Tian Pei, Tianyu Sun, W.L. Xiao, Wangding Zeng et al. (100 additional authors not shown)"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2024-01-25,DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence,"Daya Guo, Qihao Zhu, Dejian Yang, Zhenda Xie, Kai Dong, Wentao Zhang, Guanting Chen, Xiao Bi, Y. Wu, Y.K. Li, Fuli Luo, Yingfei Xiong, Wenfeng Liang"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2024-06-17,DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence,"DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen, Xin Xie, Kang Guan, Yuxiang You, Aixin Liu, Qiushi Du, Wenjun Gao, Xuan Lu, Qinyu Chen, Yaohui Wang, Chengqi Deng, Jiashi Li, Chenggang Zhao, Chong Ruan, Fuli Luo, Wenfeng Liang"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2024-02-05,DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models,"Zhihong Shao, Peiyi Wang, Qihao Zhu, Runxin Xu, Junxiao Song, Xiao Bi, Haowei Zhang, Mingchuan Zhang, Y.K. Li, Y. Wu, Daya Guo"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2024-05-23,DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data,"Huajian Xin, Daya Guo, Zhihong Shao, Zhizhou Ren, Qihao Zhu, Bo Liu, Chong Ruan, Wenda Li, Xiaodan Liang"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2024-03-08,DeepSeek VL: Towards Real-World Vision-Language Understanding,"Haoyu Lu, Wen Liu, Bo Zhang, Bingxuan Wang, Kai Dong, Bo Liu, Jingxiang Sun, Tongzheng Ren, Zhuoshu Li, Hao Yang, Yaofeng Sun, Chengqi Deng, Hanwei Xu, Zhenda Xie, Chong Ruan"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2024-12-13,DeepSeek VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding,"Zhiyu Wu, Xiaokang Chen, Zizheng Pan, Xingchao Liu, Wen Liu, Damai Dai, Huazuo Gao, Yiyang Ma, Chengyue Wu, Bingxuan Wang, Zhenda Xie, Yu Wu, Kai Hu, Jiawei Wang, Yaofeng Sun, Yukun Li, Yishi Piao, Kang Guan, Aixin Liu, Xin Xie, Yuxiang You, Kai Dong, Xingkai Yu, Haowei Zhang, Liang Zhao, Yisong Wang, Chong Ruan"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2025-01-22,DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning,"DeepSeek-AI, Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Ruoyu Zhang, Runxin Xu, Qihao Zhu, Shirong Ma, Peiyi Wang, Xiao Bi, Xiaokang Zhang, Xingkai Yu, Yu Wu, Z.F. Wu, Zhibin Gou, Zhihong Shao, Zhuoshu Li, Ziyi Gao, Aixin Liu, Bing Xue, Bingxuan Wang, Bochao Wu, Bei Feng, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, Damai Dai, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fucong Dai, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Han Bao, Hanwei Xu, Haocheng Wang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Qu, Hui Li, Jianzhong Guo, Jiashi Li, Jiawei Wang, Jingchang Chen, Jingyang Yuan, Junjie Qiu, Junlong Li, J.L. Cai, Jiaqi Ni, Jian Liang, Jin Chen, Kai Dong, Kai Hu, Kaige Gao, Kang Guan, Kexin Huang, Kuai Yu, Lean Wang, Lecong Zhang, Liang Zhao, Litong Wang, Liyue Zhang, Lei Xu, Leyi Xia, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Meng Li, Miaojun Wang, Mingming Li, Ning Tian, Panpan Huang, Peng Zhang, Qiancheng Wang, Qinyu Chen, Qiushi Du, Ruiqi Ge, Ruisong Zhang, Ruizhe Pan, Runji Wang, R.J. Chen, R.L. Jin, Ruyi Chen, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shengfeng Ye, Shiyu Wang, Shuiping Yu, Shunfeng Zhou, Shuting Pan, S.S. Li et al. (100 additional authors not shown)"
"Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.",2025-04-03,Inference-Time Scaling for Generalist Reward Modeling,"Zijun Liu, Peiyi Wang, Runxin Xu, Shirong Ma, Chong Ruan, Peng Li, Yang Liu, Yu Wu"