Jascha links

Understanding and correcting pathologies in the training of learned optimizers

Unbiased Gradient Estimation in Unrolled Computation Graphs with Persistent Evolution Strategies

Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves

Reverse engineering learned optimizers reveals known and novel mechanisms

Meta-Learning Update Rules for Unsupervised Representation Learning

Page updated

Google Sites

Report abuse