Learning to Assemble Through Large-Scale Structured RL