Yinmin Zhong
Yinmin Zhong
About Me
Publications
Light
Dark
Automatic
3
RLHFuse: Efficient RLHF Training for Large Language Models with Inter- and Intra-Stage Fusion
Reinforcement Learning from Human Feedback (RLHF) enhances the alignment between LLMs and human preference. The workflow of RLHF …
Yinmin Zhong
,
Zili Zhang
,
Bingyang Wu
,
Shengyu Liu
,
Yukun Chen
,
Changyi Wan
,
Hanpeng Hu
,
Lei Xia
,
Ranchen Ming
,
Yibo Zhu
,
Xin Jin
PDF
Cite
DistTrain: Addressing Model and Data Heterogeneity with Disaggregated Training for Multimodal Large Language Models
Multimodal large language models (LLMs) have demonstrated significant potential in a wide range of AI applications. Yet, training …
Zili Zhang
,
Yinmin Zhong
,
Ranchen Ming
,
Hanpeng Hu
,
Jianjian Sun
,
Zheng Ge
,
Yibo Zhu
,
Xin Jin
PDF
Cite
Fast Distributed Inference Serving for Large Language Models
Large language models (LLMs) power a new generation of interactive AI applications exemplified by ChatGPT. The interactive nature of …
Bingyang Wu
,
Yinmin Zhong
,
Zili Zhang
,
Gang Huang
,
Xuanzhe Liu
,
Xin Jin
PDF
Cite
Cite
×