2025
1.
Yuan Yao, Tianyu Yu, Ao Zhang, Chongyi Wang,Junbo Cui, Hongji Zhu, Tianchi Cai, Haoyu Li, Weilin Zhao, Huarong Zhou, Zhihui He, Zhensheng Zou, Haoye Zhang, Shengding Hu, Zhi Zheng, Jie Zhou, Jie Cai, Jie Zhou, Xu Han, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun. MiniCPM-V: A GPT-4V Level Multimodal LLM on Your Phone.
Nature Communications. [Project: MiniCPM-V]
2. Tianyu Yu, Bo Ji, Shouli Wang, Shu Yao, Zefan Wang, Ganqu Cui, Lifan Yuan, Ning Ding, Yuan Yao†, Zhiyuan Liu, Maosong Sun, Tat-Seng Chua. († indicates corresponding author) RLPR: Scaling RLVR to General Domain without Verifiers. Preprint.
3. Ji Qi, Yuan Yao†, Yushi Bai, Bin Xu, Juanzi Li, Zhiyuan Liu, Tat-Seng Chua. († indicates corresponding author) Quicksviewer: An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes. Preprint
4. Wentong Chen, Junbo Cui, Jinyi Hu, Yujia Qin, Junjie Fang, Yue Zhao, Chongyi Wang, Jun Liu, Guirong Chen, Yupeng Huo, Yuan Yao†, Yankai Lin, Zhiyuan Liu, Maosong Sun. († indicates corresponding author) GUICourse: From General Vision Language Models to Versatile GUI Agents. ACL 2025.
5. Ganqu Cui, Lifan Yuan, Zefan Wang, Hanbin Wang, Wendi Li, Bingxiang He, Yuchen Fan, Tianyu Yu, Qixin Xu, Weize Chen, Jiarui Yuan, Huayu Chen, Kaiyan Zhang, Xingtai Lv, Shuo Wang, Yuan Yao, Xu Han, Hao Peng, Yu Cheng, Zhiyuan Liu, Maosong Sun, Bowen Zhou, Ning Ding. Process Reinforcement through Implicit Rewards. Preprint.
6. Tianyu Yu, Haoye Zhang, Qiming Li, Qixin Xu, Yuan Yao†, Da Chen, Xiaoman Lu, Ganqu Cui, Yunkai Dang, Taiwen He, Xiaocheng Feng, Jun Song, Bo Zheng, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun. († indicates corresponding author) RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness. CVPR 2025. Highlights.
7.
Yuan Yao, Tianyu Yu, Chongyi Wang, Junbo Cui, Bokai Xu, Hongji Zhu, Tianchi Cai, Fuwei Huang, Tianran Wang, Wenshuo Ma, etc. MiniCPM-o: A GPT-4o Level MLLM for Vision, Speech, and Multimodal Live Streaming on Your Phone.
[Project: MiniCPM-o]