Path Planning for Multi-Arm Manipulators using Soft Actor-Critic with Hindsight Experience Replay