Chao Jin

I am currently a third-year Ph.D. candidate at the Computer Systems Research Group in the School of Computer Science at Peking University, advised by Prof. Xin Jin.

My research interests include machine learning systems, distributed systems, and cloud computing, with a recent focus on the intersection of large language models (LLMs), generative AI, and innovative system design.

I received my B.S. degree in computer science from the School of Electronics Engineering and Computer Science (EECS), Peking University in 2023.

Email: chaojin (at) pku (dot) edu (dot) cn

Selected Publications (view all »)

MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production
Chao Jin*, Ziheng Jiang*, Zhihao Bai, Zheng Zhong, Juncai Liu, Xiang Li, Ningxin Zheng, Xi Wang, Cong Xie, Qi Huang, Wen Heng, Yiyuan Ma, Wenlei Bao, Size Zheng, Yanghua Peng, Haibin Lin, Xuanzhe Liu, Xin Jin, Xin Liu
(* Equal Contribution)
European Conference on Computer Systems (EuroSys 2026), Edinburgh, UK, April 27-30, 2026. (To appear)
[PDF]

SpecRL: Efficient RL for LLMs with Dynamic and Online Speculative Decoding
Chao Jin, Yinmin Zhong, Zili Zhang, Yimin Jiang, Yibo Zhu
The 1st Frontier AI Systems Workshop (FAISys 2025), Hong Kong, November 14-15, 2025.
[PDF] [Slides]

MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism
Ruidong Zhu*, Ziheng Jiang*, Chao Jin*, Peng Wu, Cesar A. Stuardo, Dongyang Wang, Xinlei Zhang, Huaping Zhou, Haoran Wei, Yang Cheng, Jianzhe Xiao, Xinyi Zhang, Lingjun Liu, Haibin Lin, Li-Wen Chang, Jianxi Ye, Xiao Yu, Xuanzhe Liu, Xin Jin, Xin Liu
(* Equal Contribution)
ACM Special Interest Group on Data Communication (SIGCOMM 2025), Coimbra, Portugal, September 8-11, 2025.
[PDF] [Slides]

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation
Chao Jin, Zili Zhang, Xuanlin Jiang, Fangyue Liu, Xin Liu, Xuanzhe Liu, Xin Jin
ACM Transactions on Computer Systems (TOCS 2025), 2025.
[PDF]

Ditto: Efficient Serverless Analytics with Elastic Parallelism
Chao Jin, Zili Zhang, Xingyu Xiang, Songyun Zou, Gang Huang, Xuanzhe Liu, Xin Jin
ACM Special Interest Group on Data Communication (SIGCOMM 2023), New York City, September 10-14, 2023.
[PDF] [Slides]

Teaching

  • Teaching Assistant, Operating Systems (Honor Track) at PKU, 2024 Spring.
  • Teaching Assistant, Introduction to Computer System at PKU, 2021 Fall.

Internship

  • Stepfun [2025.4 - 2025.8], Advised by Ranchen Ming, Research Intern of RL Training Infra.
  • ByteDance Seed [2023.7 - 2025.3], Advised by Ziheng Jiang and Haibin Lin, Research Intern of Seed-Foundation-MLSys.
  • Alibaba Cloud [2022.3 - 2022.8], Advised by Rui Miao, Research Intern of Networked Systems.

Honors and Awards

  • [2024] Merit Student (2/56)
  • [2024] Huawei Scholarship at School of Computer Science, Peking University
  • [2023] Presidential Scholarship of Peking University (highest honor for Ph.D. students at Peking University)
  • [2021] Learning Excellence Award at Peking University
  • [2020] Merit Student
  • [2020] Xiaomi Scholarship at Peking University