Publications:
A full and updated list of publications can be found in my DBLP
Conference Proceedings and Journals:
Fast Inference for Augmented Large Language Models [openreview] (NeurIPS 2025)
Rana Shahout, Cong Liang, Shiji Xin, Qianru Lao, Yong Cui, Minlan Yu, Michael Mitzenmacher.
Don't Stop Me Now: Embedding Based Scheduling for LLMs (ICLR 2025) [openreview]
Rana Shahout, Eran Malach, Chunwei Liu, Weifan Jiang, Minlan Yu, Michael Mitzenmacher.
Queueing, Predictions, and Large Language Models: Challenges and Open Problems (Stochastic Systems 2025) [paper]
Michael Mitzenmacher, Rana Shahout.
Faster, Cheaper, Just as Good: Cost- and Latency-Constrained Routing for LLMs (ICLR workshop 2025) [openreview]
Javid Lakha, Minlan Yu, Rana Shahout.
Prefix and Output Length-Aware Scheduling for Efficient Online LLM Inference (ICLR workshop 2025) [openreview]
Iñaki Arango, Ayush Noori, Yepeng Huang, Rana Shahout and Minlan Yu.
Geometric Sketch: The Inflatable-Shrinkable Sketch (AINA 2025) [paper]
Dvir Biton, Roy Friedman, Rana Shahout.
Palimpzest: Optimizing AI-Powered Analytics with Declarative Query Processing (CIDR 2025)
Chunwei Liu, Matthew Russo, Michael Cafarella, Lei Cao, Peter Baile Chen, Zui Chen, Michael Franklin, Tim Kraska, Samuel Madden, Rana Shahout, Gerardo Vitagliano.
SkipPredict: When to Invest in Predictions for Scheduling (NeurIPS 2024)[openreview]
Rana Shahout, Michael Mitzenmacher.
From Logs to Causal Diagnosis of Large Systems (VLDB 2025)
Markos Markakis, Brit Youngmann, Trinity Gao, Ziyu Zhang, Rana Shahout, Peter Baile Chen, Chunwei Liu, Ibrahim Sabek, Michael Cafarella.
Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams (IEEE ICNP 2024) [arXiv]
Rana Shahout, Michael Mitzenmacher.
Learning-Augmented Frequency Estimation in Sliding Windows (IEEE ICNP workshop 2024) [arXiv]
Rana Shahout, Ibrahim Sabek, Michael Mitzenmacher.
Distributed Recoverable Sketches (OPODIS 2024)
Diana Cohen, Roy Friedman, Rana Shahout.
Geometric Sketch: an Inflatable and Shrinkable Sketch (AINA 2024) [paper]
Dvir Biton, Roy Friedman, Rana Shahout.
Sawmill: From Logs to Causal Diagnosis of Large Systems (demo) (SIGMOD 2024) [ACM]
Markos Markakis, An Bo Chen, Brit Youngmann, Trinity Gao, Ziyu Zhang, Rana Shahout, Peter Baile Chen, Chunwei Liu, Ibrahim Sabek, Michael Cafarella
Press ECCS to Doubt (Your Causal Graph) (GUIDEAI@SIGMOD 2024) [ACM]
Markos Markakis, Ziyu Zhang, Rana Shahout, Trinity Gao, Chunwei Liu, Ibrahim Sabek, Michael Cafarella.
Best paper award
Sketching the Path to Efficiency: Lightweight Learned Cache Replacement (OPODIS 2023) [paper]
Rana Shahout, Roy Friedman.
Rana Shahout, Yehonatan Peisakhovsky, Sasha Stoikov, Nikhil Garg.
Together is Better: Heavy Hitters Quantile Estimation (ACM SIGMOD 2023) [ACM]
Rana Shahout, Roy Friedman, Ran Ben Basat.
Box Queries over Multidimensional Streams (Information Systems 2022)
Roy Friedman, Rana Shahout*.
* The authors are listed alphabetically
CELL: Counter Estimation for Per-flow Traffic in Streams and Sliding Windows (ICNP 2021) [IEEE]
Rana Shahout, Roy Friedman, Dolev Adas.
Box Queries over Multidimensional Streams (DEBS 2021). [ACM]
Roy Friedman, Rana Shahout*.
* The authors are listed alphabetically
EvenDB: optimizing key-value storage for spatial locality (EuroSys 2020). [ACM]
Eran Gilad, Edward Bortnikov, Anastasia Braginsky, Yonatan Gottesman, Eshcar Hillel, Idit Keidar, Nurit Moscovici, Rana Shahout.
Stream Frequency Over Interval Queries (VLDB 2019). [VLDB]
Ran Ben-Basat, Roy Friedman, Rana Shahout*.
* The authors are listed alphabetically
Frequent elements on query defined ranges (INFOCOM 2018) [IEEE]
Ran Ben Basat, Rana Shahout, Roy Friedman.
Preprints:
From Score Distributions to Balance: Plug-and-Play Mixture-of-Experts Routing [arXiv]
Rana Shahout, Colin Cai, Yilun Du, Minlan Yu, Michael Mitzenmacher.
Intra-Request Branch Orchestration for Efficient LLM Reasoning [arXiv]
Weifan Jiang, Rana Shahout, Yilun Du, Michael Mitzenmacher, Minlan Yu.
Federated Learning Clients Clustering with Adaptation to Data Drifts [arXiv]
Minghao Li, Dmitrii Avdiukhin, Rana Shahout, Nikita Ivkin, Vladimir Braverman, Minlan Yu.