Selected Papers and Preprints: (Author name is listed in alphabetical order)
A Universal Load Balancing Principle and Its Application to Large Language Model Serving ,
Zixi Chen, Tianci Bu, Chendong Song, Xin Lu, Yinyu Ye, Zijie Zhou [Paper]
Theoretically Optimal Attention/FFN Ratios in Disaggregated LLM Serving,
Chendong Song, Meixuan Wang, Hang Zhou, Hong Liang, Yuan Lyu, Zixi Chen, Yuwei Fan, Zijie Zhou [Paper]
LLM Serving Optimization with Variable Prefill and Decode Lengths,
Meixuan Wang, Yinyu Ye, Zijie Zhou [Paper]
INFORMS Undergraduate Operations Research Prize Competition Finalist
Adaptively Robust LLM Inference Optimization under Prediction Uncertainty,
Zixi Chen, Yinyu Ye, Zijie Zhou [Paper]
Online Scheduling for LLM Inference with KV Cache Constraints,
Patrick Jaillet, Jiashuo Jiang, Konstantina Mellou, Marco Molinaro, Chara Podimata, Zijie Zhou [Paper]
Pigeonhole design: Balancing sequential experiments from an online matching perspective,
Jinglong Zhao, Zijie Zhou [Paper]
Management Science, 2025
When Should you Offer an Upgrade: Online Upgrading Mechanisms for Resource Allocation,
Patrick Jaillet, Chara Podimata, Andrew Vakhutinsky, Zijie Zhou [Paper]
WINE 2024, INFORMS Service Science Best Student Paper Competition Finalist
Near-Optimal Primal-Dual Algorithms for Quantity-Based Network Revenue Management,
Rui Sun, Xinshang Wang, Zijie Zhou [Paper]
Major Revision in Mathematics of Operations Research
Online Resource Allocation with Convex-Set Advice,
Negin Golrezaei, Patrick Jaillet, Zijie Zhou [Paper]
Minor Revision in Operations Research
Grace Period is All You Need: Realize Individual Fairness in Revenue Management,
Patrick Jaillet, Chara Podimata, Zijie Zhou [Paper]
WINE 2024