DEF-oriCORN: efficient 3D scene understanding for robust language-directed pick-and-place