Graph-based Reinforcement Learning meets Mixed Integer Programs:

An application to 3D robot assembly discovery