Positional Normalization
Neural Information Processing Systems (NeurIPS), 2019, Spotlight
Abstract: A widely deployed method for reducing the training time of deep neural networks is to normalize activations at each layer. Although various normalization schemes have been proposed, they all follow a common theme: normalize across spatial dimensions and discard the extracted statistics. In this paper, we propose a novel normalization method that noticeably departs from this convention. Our approach, which we refer to as Positional Normalization (PONO), normalizes exclusively across channels --- a naturally appealing dimension, which captures the first and second moments of features extracted at a particular image position. We argue that these moments convey structural information about the input image and the extracted features, which opens a new avenue along which a network can benefit from feature normalization: Instead of disregarding the PONO normalization constants, we propose to re-inject them into later layers to preserve or transfer structural information in generative networks.
Paper
[PDF]
Code and Applications Update
[ PONO_github ]
Bibtex
@inproceedings{li2019positional,
title={Positional Normalization},
author={Li, Boyi and Wu, Felix and Weinberger, Kilian Q and Belongie, Serge},
booktitle={Advances in Neural Information Processing Systems},
pages={1620--1632},
year={2019}
}
Video