Scaffolding Dexterous Manipulation

with Vision-Language Models