RedStone: Curating General, Code, Math, and QA Data for Large Language Models.

Yaoyao Chang, Lei Cui 0001, Li Dong 0004, Shaohan Huang, Yangyu Huang, Yupan Huang, Scarlett Li, Tengchao Lv, Shuming Ma, Qinzheng Sun, Wenhui Wang 0003, Furu Wei, Ying Xin, Mao Yang 0004, Qiufeng Yin, Xingxing Zhang 0002

07 Nov 2025CoRR 2024EveryoneCC BY-SA 4.0
Loading