ViTaPEs

Visuotactile Position Embeddings for Cross-Modal Alignment in Multimodal Transformers