Call for Paper and Submission guidlines

Call for Papers

The workshop will provide a common platform to discuss the recent progress, challenges, and opportunities in developing transformer-based models for various computer vision applications. To this end, we welcome transformer-based original research contributions in the following topics (but are not limited to):

• Theoretical insights into transformer-based models

• Novel transformer models for spatial (image) and temporal (video) data modeling

• Efficient transformer architectures, including novel mechanisms for self-attention

• Visualizing and interpreting transformer networks

• Generative models for transformer networks

• Hybrid network designs combining the strengths of transformer models with convolutional and graph-based models

• Unsupervised, weakly supervised, and semi-supervised learning with transformer models

• Multi-modal learning combining visual data with text, speech, and knowledge graphs

• Leveraging multi-spectral data like satellite imagery and infrared images in transformer models for improved semantic understanding of visual content

• Transformer-based designs for low-level vision problems such as image super- resolution, deblurring, de-raining, and denoising

• Novel transformer-based methods for high-level vision problems such as object detection, segmentation, activity recognition, and pose estimation

• Transformer models for volumetric, mesh, and point-cloud data processing in 3D and 4D data regimes.

SUBMISSION GUIDELINES

Call for paper: pdf

Format: All the submissions should follow the Springer LNCS instructions adapted for ACCV 2022.

Page Limit: 14 pages

Submission Site: https://cmt3.research.microsoft.com/VTTAACCV2022

The workshop is organized in conjunction with the
The 16^th Asian Conference on Computer Vision (ACCV2022)

Vision Transformers: Theory and applications 2022

visiontransformer.accv [ at ] gmail.com