Bayesian attention modules.Fan, X., Zhang, S., Chen, B., & Zhou, M. (2020). Advances in Neural Information Processing Systems, 33, 16362-16376.Â