Zero-Shot Building Attribute Extraction from Large-Scale 

Vision & Language Models