Variance Reduction for Policy Gradient with

Action-Dependent Factorized Baselines


Cathy Wu, Aravind Rajeswaran, Yan Duan, Vikash Kumar,

Alexandre M Bayen, Sham Kakade, Igor Mordatch, Pieter Abbeel