RL from Cross-domain Videos with Video Prediction Models