Vision Language Models Can Parse Floor Plan Maps


David DeFazio*, Hrudayangam Mehta*,  Jeremy Blackburn, Shiqi Zhang

*Equal Contribution


Binghamton University


Paper    Code    Video