Ring-threading in Simulation
To evaluate DPIIL's scalability, a second experiment was conducted for learning a ring-threading task with a 6-DOF UR5e robot in a Robosuite environment. The goal is to grasp a ring with the random initial positions and insert it through a peg with a fixed position, regardless of the physical contact. The dimension of the state is 51D, consisting of the robot's joint angles and the ring's position, as described in Table 1. The action is 6D, specifying the end-effector translation (e.g., x, y, z-axes), rotation (e.g., y, z-axes), and gripper manipulation (e.g., open or closed).