Large Language Models
Spectral Transformers
Optimization
MLCommons training algorithms benchmark
SAMUEL: adaptive gradient methods with local guarantees.
GGT: efficient full matrix adaptive regularization.
Extreme Tensoring: memory efficient adaptive regularization.
Control and Reinforcement Learning
AI For Better Medical Ventilators
Non-stochastic Control Theory
Differentiable Reinforcement Learning