Video Text Spotting
Video Text Spotting
TransDETR: End-to-end Video Text Spotting with Transformer
Contrastive Learning of Semantic and Visual Representations for Text Tracking
Video Retrieval with Vision and Text Aggregation
Video Retrieval with Vision and Text Aggregation