Understanding and correcting pathologies in the training of learned optimizers
Unbiased Gradient Estimation in Unrolled Computation Graphs with Persistent Evolution Strategies
Reverse engineering learned optimizers reveals known and novel mechanisms
Meta-Learning Update Rules for Unsupervised Representation Learning