Zero-shot object detection with attention