Yufan Li

yufan_li (at) g (dot) harvard (dot) edu

Department of Statistics, Harvard University

1 Oxford St, Cambridge, MA, 02138

CVGoogle Scholar

I'm a 5-th year  Ph.D. student in the statistics department at Harvard University. I'm fortunate to be advised by Professors Subhabrata Sen  and Pragya Sur. My PhD research focuses on (i) establishing robust theoretical foundation for high dimensional statistics particularly for data with complex global dependencies; (ii) designing ML algorithms with provable guarantees for fundamental problems in data sciences. I also interned at Google DeepMind during the summer of 2024, hosted by Ben Adlam, where I worked on transformer pretraining and scaling laws. Before my PhD, I obtained my bachelor's degree from University of Toronto and masters degree from Harvard University. 

Education

Harvard University, Department of Statistics, Cambridge, MA, Aug 2020-May 2025

Harvard University, SEAS, Cambridge, MA, , Aug. 2018 -May 2020

University of Toronto, Applied Science & Engineering, Toronto, ON, , Sep.2013 -May 2018

Internship Experience

Google DeepMind, Science of Scaling @ Path to AGI, Cambridge, MA, May-Oct 2024

High Dimensional Statistics & Probability

Spectrum-Aware Debiasing: A Modern Inference Framework with Application to Principal Component Regression, with Pragya Sur [in submission at Annals of Statistics]

Random Linear Estimation with Rotationally-Invariant Designs: Asymptotics at High Temperature, with Zhou Fan, Subhabrata Sen & Yihong Wu [published IEEE Transactions on Information Theory]

TAP Equations for Orthogonally Invariant Spin Glasses at High Temperature, with Zhou Fan & Subhabrata Sen [accepted Annales de l'Institut Henri Poincaré B: Probabilités et Statistiques]

Machine Learning and Methodologies

Understanding Optimal Feature Transfer via a Fine-Grained Bias-Variance Analysis, with Ben Adlam & Subhabrata Sen [in submission]

"Solvable" Batched Bandits: Balance Risk and Reward in Phased Release Problem, with Iavor Bojinov & Jialiang Mao  [published NeurIPS 2023]

ROTI-GCV: Generalized Cross-Validation for Right-Rotationally Invariant Data, with Kevin Luo & Pragya Sur [in submission at AISTATS]

 Conferences & Summer Schools

Consulting, Reading Group & Teaching

Statistical Consultant, Harvard Statistics Consulting Service, Dec. 2021-Present

Probability & Math. Physics Reading Group, HT Yau's group, Dec. 2021-Present

Teaching Fellow, Department of Statistics, Harvard University, 2021-Present