Selected Papers and Preprints: (Author name is listed in alphabetical order)
Position: LLM Serving Needs Mathematical Optimization and Algorithmic Foundations, Not Just Heuristics,
Zijie Zhou [Paper]
ICML 2026
A Queueing-Theoretic Framework for Stability Analysis of LLM Inference with KV Cache Memory Constraints,
Chengyi Nie, Nian Si, Zijie Zhou
ICML 2026
A Universal Load Balancing Principle and Its Application to Large Language Model Serving,
Zixi Chen, Tianci Bu, Chendong Song, Xin Lu, Yinyu Ye, Zijie Zhou [Paper]
Theoretically Optimal Attention/FFN Ratios in Disaggregated LLM Serving,
Chendong Song, Meixuan Wang, Hang Zhou, Hong Liang, Yuan Lyu, Zixi Chen, Yuwei Fan, Zijie Zhou [Paper]
LLM Serving Optimization with Variable Prefill and Decode Lengths,
Meixuan Wang, Yinyu Ye, Zijie Zhou [Paper]
INFORMS Undergraduate Operations Research Prize Competition Finalist
Adaptively Robust LLM Inference Optimization under Prediction Uncertainty,
Zixi Chen, Yinyu Ye, Zijie Zhou [Paper]
Online Scheduling for LLM Inference with KV Cache Constraints,
Patrick Jaillet, Jiashuo Jiang, Konstantina Mellou, Marco Molinaro, Chara Podimata, Zijie Zhou [Paper]
Pigeonhole design: Balancing sequential experiments from an online matching perspective,
Jinglong Zhao, Zijie Zhou [Paper]
Management Science, 2025
Online Resource Allocation with Convex-Set Advice,
Negin Golrezaei, Patrick Jaillet, Zijie Zhou [Paper]
Operations Research (accepted)
When Should you Offer an Upgrade: Online Upgrading Mechanisms for Resource Allocation,
Patrick Jaillet, Chara Podimata, Andrew Vakhutinsky, Zijie Zhou [Paper]
WINE 2024, INFORMS Service Science Best Student Paper Competition Finalist
Near-Optimal Primal-Dual Algorithms for Quantity-Based Network Revenue Management,
Rui Sun, Xinshang Wang, Zijie Zhou [Paper]
Major Revision in Mathematics of Operations Research
Grace Period is All You Need: Realize Individual Fairness in Revenue Management,
Patrick Jaillet, Chara Podimata, Zijie Zhou [Paper]
WINE 2024