Vision Language Models Can Parse Floor Plan Maps


David DeFazio*, Hrudayangam Mehta*, Meng Wang*, Ping Yang, Jeremy Blackburn, Shiqi Zhang

*Equal Contribution


Binghamton University


Paper    Code    Video