Multi-task Batch Reinforcement Learning with Metric Learning