Enhancing Vision-Language Navigation with Multimodal Event Knowledge from Real-World Indoor Tour Videos

Enhancing Vision-Language Navigation with Multimodal Event Knowledge from Real-World Indoor Tour Videos

Enhancing Vision-Language Navigation with Multimodal Event Knowledge

from Real-World Indoor Tour Videos

Anonymous

YouTube-Event-KG

Open-source data can be downloaded in link.

Spatio-EventNav

Spatio-EventNav

Code of fusion strategy is coming soon...

Page updated

Google Sites

Report abuse