Neural NID Rules: Relational Out-of-Training distribution in World Models
Alternative proof for lower bound in Gaussian bandits using properties of the Renyi divergences.
Alternative proof compared to Azar et al 2012