Toward Scalable Multi-Robot Control: Fast Policy Learning in Distributed MPC