Yaoqing Yang

Assistant Professor

15 Thayer Drive, Hanover, NH 03755-4404

Yaoqing.Yang AT dartmouth.edu

My current research focuses on diagnosing and mitigating failures in machine learning models. For example, I analyze shape and geometric features in high-dimensional spaces, such as loss landscapes, weight matrix spectral densities, and decision boundaries, to provide actionable insights for addressing common failure modes in these models. I also apply these techniques to applications such as 3D point clouds and graphs. My research draws inspiration from statistical learning and information theory.

You are welcome to email me if you want to work with me. Please apply to our PhD program using the link below.

PhD in Dartmouth CS

More information about me.

Postdoc, RISE Lab, EECS, UC Berkeley.

PhD, ECE, CMU.

BS, EE, Tsinghua.

Google Scholar | CV | LinkedIn

News

[Aug 2025] Accepted the invitation to serve as an Area Chair at ICLR 2026.

[July 2025] Our paper "From spikes to heavy tails" is accepted by Transactions on Machine Learning Research.

[June 2025] I am honored to receive a grant to study HT-SR theory for the quantification and evaluation of AI models.

[May 2025] One paper is accepted by the findings of ACL 2025.

[May 2025] We have two papers accepted by ICML 2025. See you in Vancouver.

[April 2025] Accepted the invitation to serve as a Session Chair at ICLR 2025.

[April 2025] I gave a talk at CMU CyLab.

[Feb 2025] I gave a talk at Lawrence Berkeley National Laboratory. It was nice to go back and visit New Dumplings again (a low-profile Michelin one-star restaurant on San Pablo Avenue).

[Feb 2025] Accepted the invitation to serve as an Area Chair at NeurIPS 2025.

[Jan 2025] Our paper "Mitigating memorization in language models" is accepted by ICLR 2025 as a spotlight.

[Dec 2024] We will organize a workshop on AI for Science at ICLR 2025. Stay tuned!

[Nov 2024] I gave a talk at Google Research and received helpful feedback to improve our work.

[Nov 2024] I am honored to receive the Burke Research Initiation Award from Dartmouth.

[Sep 2024] We have two papers accepted by NeurIPS 2024.

[Aug 2024] Accepted the invitation to serve as an Area Chair at ICLR 2025.

[Aug 2024] I am honored to receive a grant from DOE to study scientific foundation models.

[Aug 2024] We uploaded a video to introduce our new ICML paper on model diagnosis.

[July 2024] I am honored to receive a grant from DARPA.

[July 2024] Two new papers are online. The first paper analyzes the heavy-tailed weight matrix spectrum from the feature learning perspective, and the second paper introduces a new ensemble learning method called SharpBalance.

[June 2024] Accepted the invitation to serve as an Area Chair at NeurIPS 2024.

[May 2024] Two papers accepted by ICML 2024. Stay tuned!

[Jan 2024] Our paper "Teach LLMs to phish: stealing private information from language models" is accepted by ICLR 2024.

[Sep 2023] Our paper on "Temperature balancing" has been accepted by NeurIPS 2023 as a spotlight.

[Sep 2023] Our paper "When are ensembles really effective" is accepted by NeurIPS 2023.

Selected publications

From spikes to heavy tails: unveiling the spectral evolution of neural networks

Vignesh Kothapalli, Tianyu Pang, Shenyang Deng, Zongmin Liu, Yaoqing Yang

Transactions on Machine Learning Research 2025

Summary: This paper uncovers the mechanism behind the emergence of heavy-tailed empirical spectral densities (ESDs). We show that heavy-tailed ESDs arise from the interaction between an ESD spike, caused by feature learning, and the bulk, resulting from iid random weight initialization. This theory reveals several surprising facts: the emergence of a heavy-tailed spectrum (1) does not require SGD noise during training, (2) does not need the model to be overparameterized or sufficiently interpolated, and (3) does not require more than one spike in the ESD.

Yaoqing Yang

News

Selected publications

Talks and seminars