Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning