Safe Multi-Agent Learning with Soft Constrained Policy Optimization in Real Robot Control