Full Publications

2025

Towards Swift Serverless LLM Cold Starts with ParaServe
Chiheng Lou, Sheng Qi, Chao Jin, Dapeng Nie, Haoran Yang, Xuanzhe Liu, Xin Jin
In Preprint.
[PDF]

FaaSPR: Latency-oriented Placement and Routing Optimization for Serverless Workflow Processing
Yunshan Jia, Chao Jin, Qing Li, Xuanzhe Liu, Xin Jin
IEEE Transactions on Networking (TON), 2025.
[PDF]

2024

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation
Chao Jin, Zili Zhang, Xuanlin Jiang, Fangyue Liu, Xin Liu, Xuanzhe Liu, Xin Jin
In Preprint.
[PDF]

Pyxis: Scheduling Mixed Tasks in Disaggregated Datacenters
Sheng Qi, Chao Jin, Mosharaf Chowdhury, Zhenming Liu, Gang Huang, Xuanzhe Liu, Xin Jin
IEEE Transactions on Parallel and Distributed Systems (TPDS 2024), 2024.
[PDF]

Jolteon: Unleashing the Promise of Serverless for Serverless Workflows
Zili Zhang, Chao Jin, Xin Jin
USENIX Symposium on Networked Systems Design and Implementation (NSDI 2024), Santa Clara, April 16–18, 2024.
[PDF] [Slides]

2023

Ditto: Efficient Serverless Analytics with Elastic Parallelism
Chao Jin, Zili Zhang, Xingyu Xiang, Songyun Zou, Gang Huang, Xuanzhe Liu, Xin Jin
ACM Special Interest Group on Data Communication (SIGCOMM 2023), New York City, September 10-14, 2023.
[PDF] [Slides]

Fast, Approximate Vector Queries on Very Large Unstructured Datasets
Zili Zhang, Chao Jin, Linpeng Tang, Xuanzhe Liu, Xin Jin
USENIX Symposium on Networked Systems Design and Implementation (NSDI 2023), Boston, April 17–19, 2023.
[PDF] [Slides]

2022

Melon: Breaking the Memory Wall for Resource-Efficient On-Device Machine Learning
Qipeng Wang, Mengwei Xu, Chao Jin, Xinran Dong, Jinliang Yuan, Gang Huang, Xin Jin, Yunxin Liu, Xuanzhe Liu
Proceedings of the 20th ACM International Conference on Mobile Systems, Applications, and Services (MobiSys 2022), Portland, June 25-July 1, 2022.
[PDF]