Publications

                                                       Note: An overview of my publications is also available in Google Scholar and DBLP.

Publications are listed in reverse chronological order. Click on the name of the publication to view the full paper. These papers are made available for personal use only, subject to author's and publisher's copyright. 

2024

 VideoGrounding-DINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding.

Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. (Code)

 Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery.

Mubashir Noman, Muzammal Naseer, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. (Code)

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.

Bin Xie, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. (Code)

Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors.

Nicolae-Catalin Ristea, Florinel-Alin Croitoru, Radu Tudor Ionescu, Marius Popescu, Fahad Shahbaz Khan, Mubarak Shah

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. (Code)

GLaMM: Pixel Grounding Large Multimodal Model.

Hanoona Rasheed, Muhammad Maaz, Sahal Shaji, Abdelrahman Shaker, Salman Khan, Hisham Cholakkal, Rao M. Anwer, Eric Xing, Ming-Hsuan Yang, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. (Code)

Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning.

Wenjin Hou, Shiming Chen, Shuhuang Chen, Ziming Hong, Yan Wang, Xuetao Feng, Salman Khan, Fahad Shahbaz Khan, Xinge You

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. 

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning.

Ziyang Luo, Nian Liu, Wangbo Zhao, Xuguang Yang, Dingwen Zhang, Deng-Ping Fan, Fahad Shahbaz Khan, Junwei Han

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. (Code)

 Composed Video Retrieval via Enriched Context and Discriminative Embeddings.

Omkar Thawakar, Muzammal Naseer, Rao Muhammad Anwer, Salman Khan, Michael Felsberg, Mubarak Shah, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. (Code)

GeoChat: Grounded Large Vision-Language Model for Remote Sensing.

Kartik Kuckreja, Muhammad Sohail Danish, Muzammal Naseer, Abhijit Das, Salman Khan, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. (Code)

Progressive Semantic-Guided Vision Transformer for Zero-Shot Learning.

Shiming Chen, Wenjin Hou, Salman Khan, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2024. 

2023

Boosting Adversarial Transferability using Dynamic Cues.

Muzammal Naseer, Ahmad Mahmood, Salman Khan, Fahad Shahbaz Khan

International Conference on Learning Representations (ICLR), Rwanda 2023. (Code)

Self-regulating Prompts: Foundational Model Adaptation without Forgetting.

Muhammad Uzair Khattak, Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan

IEEE Conference on Computer Vision (ICCV), France, 2023. (Code)

SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications.

Abdelrahman Shaker, Muhammad Maaz, Hanoona Rasheed, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan

IEEE Conference on Computer Vision (ICCV), France, 2023. (Code)

Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition.

Syed Talal Wasim, Muhammad Uzair Khattak, Muzammal Naseer, Salman Khan, Mubarak Shah, Fahad Shahbaz Khan

IEEE Conference on Computer Vision (ICCV), France, 2023. (Code)

Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation.

Nian Liu, Kepan Nan, Wangbo Zhao, Yuanwei Liu, Xiwen Yao, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Junwei Han, Fahad Shahbaz Khan

IEEE Conference on Computer Vision (ICCV), France, 2023. (Code)

3D Instance Segmentation via Enhanced Spatial and Semantic Supervision.

Salwa Al Khatib, Mohamed El Amine Boudjoghra, Jean Lahoud, Fahad Shahbaz Khan

IEEE Conference on Computer Vision (ICCV), France, 2023. (Code)

Generative Multiplane Neural Radiance for 3D-Aware Image Generation.

Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan. 

IEEE Conference on Computer Vision (ICCV), France, 2023. (Code)

PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery.

Sheng Zhang, Salman Khan, Zhiqiang Shen, Muzammal Naseer, Guangyi Chen, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

3D-Aware Multi-Class Image-to-Image Translation With NeRFs.

Senmao Li, Joost van de Weijer, Yaxing Wang, Fahad Shahbaz Khan, Meiqin Liu, Jian Yang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

 Burstormer: Burst Image Restoration and Enhancement Transformer.

Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection.

Long Li, Junwei Han, Ni Zhang, Nian Liu, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

 Person Image Synthesis via Denoising Diffusion Model.

Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Jorma Laaksonen, Mubarak Shah, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection.

Muhammad Akhtar Munir, Muhammad Haris Khan, Salman Khan, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

 MaPLe: Multi-Modal Prompt Learning.

Muhammad Uzair Khattak, Hanoona Rasheed, Muhammad Maaz, Salman Khan, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

 Vita-CLIP: Video and Text Adaptive CLIP via Multimodal Prompting.

Syed Talal Wasim, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

Fine-Tuned CLIP Models Are Efficient Video Learners.

Hanoona Rasheed, Muhammad Uzair Khattak, Muhammad Maaz, Salman Khan, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement.

Nancy Mehta, Akshay Dudhane, Subrahmanyam Murala, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Canada, 2023. (Code)

2022

Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection.

Hanoona Abdul Rasheed, Muhammad Maaz, Muhammd Uzair Khattak, Salman Khan, Fahad Shahbaz Khan

Conference on Neural Information Processing Systems (NeurIPS), USA, 2022. (Code)

An Investigation into Whitening Loss for Self-supervised Learning.

Xi Weng, Lei Huang, Lei Zhao, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan

Conference on Neural Information Processing Systems (NeurIPS), USA, 2022. (Code)

On Improving Adversarial Transferability of Vision Transformers .

Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Fahad Shahbaz Khan and Fatih Porikli

International Conference on Learning Representations (ICLR), Virtual, 2022. (Spotlight) (Code)

PSTR: End-to-End One-Step Person Search With Transformers.

Jiale Cao, Yanwei Pang, Rao Muhammad Anwer, Hisham Cholakkal, Jin Xie, Mubarak Shah and Fahad Shahbaz Khan

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2022. (Code)

Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection.

Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas Moeslund and Mubarak Shah

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2022. (Oral) (Code)

Self-supervised Video Transformer.

Kanchana Ranasinghe, Muzammal Naseer, Salman H. Khan, Fahad Shahbaz Khan and Michael Ryoo

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2022. (Oral) (Code)

Burst Image Restoration and Enhancement.

Akshay Dudhane, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan and Ming-Hsuan Yang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2022. (Oral) (Code)

Restormer: Efficient Transformer for High-Resolution Image Restoration.

Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan and Ming-Hsuan Yang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2022. (Oral) (Code)

UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection.

Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan and Mubarak Shah

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2022.  (Code + Dataset)

Energy-based Latent Aligner for Incremental Learning.

K. J. Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer and Vineeth Balasubramanian

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2022. (Code)

Spatio-temporal Relation Modeling for Few-shot Action Recognition.

Anirudh Thatipelli, Sanath Narayan, Salman Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan and Bernard Ghanem

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2022. (Code)

OW-DETR: Open-world Detection Transformer.

Akshita Gupta, Sanath Narayan, K. J. Joseph, Salman H. Khan, Fahad Shahbaz Khan and Mubarak Shah

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), USA, 2022. (Code)


Dense Gaussian Processes for Few-Shot Segmentation.

Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan and Martin Danelljan

European Conference on Computer Vision (ECCV), Israel, 2022. (Code)


DoodleFormer: Creative Sketch Drawing with Transformers.

Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen and Michael Felsberg

European Conference on Computer Vision (ECCV), Israel, 2022. (Code)


OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning.

Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan and Mubarak Shah

European Conference on Computer Vision (ECCV), Israel, 2022. (Code)


Class-Agnostic Object Detection with Multi-modal Transformer.

Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer and Ming-Hsuan Yang

European Conference on Computer Vision (ECCV), Israel, 2022. (Code)


Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer.

Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg and Fahad Shahbaz Khan

European Conference on Computer Vision (ECCV), Israel, 2022. (Code)

2021

From Handcrafted to Deep Features for Pedestrian Detection: A Survey.

Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan and Ling Shao.  

IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2021.

Distilled Siamese Networks for Visual Tracking.

Jianbing Shen, Yuanpei Liu, Xingping Dong, Xiankai Lu, Fahad Shahbaz Khan and Steven Hoi.  

IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2021.

A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video.

Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu and Mubarak Shah.  

IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2021.

Incremental Object Detection via Meta-Learning.

K J Joseph, Jathushan Rajasegaran, Salman Khan, Fahad Shahbaz Khan and Vineeth Balasubramanian.  

IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2021.

Transformers in Vision: A Survey.

Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan and Mubarak Shah.  

ACM Computing Surveys (ACM CSUR), 2021.

Intriguing Properties of Vision Transformers.

Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan and Ming-Hsuan Yang

Conference on Neural Information Processing Systems (NeurIPS), Virtual, 2021. (Spotlight) (Code)

Towards Open World Object Detection.

K. J. Joseph, Salman H. Khan, Fahad Shahbaz Khan and Vineeth Balasubramanian

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 2021. (Oral) (Code)

Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning.

Mamshad Nayeem Rizve, Salman H. Khan, Fahad Shahbaz Khan and Mubarak Shah

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 2021. 

Anomaly Detection in Video via Self-Supervised and Multi-Task Learning.

Mariana-Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu and Mubarak Shah

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 2021. 

Multi-Stage Progressive Image Restoration.

Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang and Ling Shao

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 2021. (Code)

Learning To Fuse Asymmetric Feature Maps in Siamese Trackers.

Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao and Jianbing Shen

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 2021. (Code)

Handwriting Transformers.

Ankan Kumar Bhunia, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan and Mubarak Shah

IEEE Conference on Computer Vision (ICCV), Virtual, 2021. 

D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations.

Sanath Narayan, Hisham Cholakkal, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang and Ling Shao

IEEE Conference on Computer Vision (ICCV), Virtual, 2021. (Code)

Discriminative Region-based Multi-Label Zero-Shot Learning.

Sanath Narayan, Akshita Gupta, Salman H. Khan, Fahad Shahbaz Khan, Ling Shao and Mubarak Shah

IEEE Conference on Computer Vision (ICCV), Virtual, 2021. (Code)

Orthogonal Projection Loss.

Kanchana Ranasinghe, Muzammal Naseer, Munawar Hayat, Salman H. Khan and Fahad Shahbaz Khan

IEEE Conference on Computer Vision (ICCV), Virtual, 2021. (Code)

On Generating Transferable Targeted Perturbations.

Muzammal Naseer, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan and Fatih Porikli

IEEE Conference on Computer Vision (ICCV), Virtual, 2021. (Code)

2020

Towards Partial Supervision for Generic Object Counting in Natural Scenes.

Hisham Cholakkal, Guolei Sun, Salman Khan, Fahad Shahbaz Khan, Ling Shao and Luc Van Gool.  

IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2020. 

Confidence Propagation through CNNs for Guided Sparse Depth Regression.

Abdelrahman Eldesokey, Michael Felsberg and Fahad Shahbaz Khan.  

IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), vol 42(10), 2423-2436, 2020. (Code)

SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation.

Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang and Ling Shao.  

European Conference on Computer Vision (ECCV), Glasgow, Scotland, 2020.  (Code)

Fixing Localization Errors to Improve Image Classification.

Guolei Sun, Salman Khan, Wen Li, Hisham Cholakkal, Fahad Shahbaz Khan and Luc Van Gool.  

European Conference on Computer Vision (ECCV), Glasgow, Scotland, 2020.  

Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification.

Sanath Narayan, Akshita Gupta, Fahad Shahbaz Khan, Cees Snoek, and Ling Shao.  

European Conference on Computer Vision (ECCV), Glasgow, Scotland, 2020.  (Code)

Count- and Similarity-aware R-CNN for Pedestrian Detection.

Jin Xie, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao and Mubarak Shah.  

European Conference on Computer Vision (ECCV), Glasgow, Scotland, 2020.  (Code)

Learning Enriched Features for Real Image Restoration and Enhancement.

Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang and Ling Shao.  

European Conference on Computer Vision (ECCV), Glasgow, Scotland, 2020.  (Code)

Learning Fast and Robust Target Models for Video Object Segmentation.

Andreas Robinson, Felix Jaremo Lawin, Martin Danelljan, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020. (Oral) (Code)

A Self-supervised Approach for Adversarial Robustness.

Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan and Fatih Porikli.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020. (Oral) (Code)

CycleISP: Real Image Restoration via Improved Data Synthesis.

Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang and Ling Shao. 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020. (Oral) (Code)

iTAML: An Incremental Task-Agnostic Meta-learning Approach.

Jathushan Rajasegaran, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan and Mubarak Shah.  IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.  (Code)

MineGAN: Effective Knowledge Transfer From GANs to Target Domains With Few Images.

Yaxing Wang, Abel Gonzalez-Garcia, David Berga, Luis Herranz, Fahad Shahbaz Khan and Joost van de Weijer. 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.  (Code)

D2Det: Towards High Quality Object Detection and Instance Segmentation.

Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang and Ling Shao.

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.  (Code)

Semi-Supervised Learning for Few-Shot Image-to-Image Translation.

Yaxing Wang, Salman Khan, Abel Gonzalez-Garcia, Joost van de Weijer and Fahad Shahbaz Khan. 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.  (Code)

Learning Human-Object Interaction Detection Using Interaction Points.

Tiancai Wang, Tong Yang, Martin Danelljan, Fahad Shahbaz Khan, Xiangyu Zhang and Jian Sun. 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.  (Code)

AnimalWeb: A Large-Scale Hierarchical Dataset of Annotated Animal Faces.

Muhammad Haris Khan, John McDonagh, Salman Khan, Muhammad Shahabuddin, Aditya Arora, Fahad Shahbaz Khan, Ling Shao and Georgios Tzimiropoulos.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020.  

Fine-grained Recognition: Accounting for Subtle Differences between Similar Classes.

Guolei Sun, Hisham Cholakkal, Salman H. Khan, Fahad Shahbaz Khan and Ling Shao.  

American Association of Artificial Intelligence (AAAI), New York, USA, 2020. 

2019

Random Path Selection for Incremental Learning.

Jathushan Rajasegaran, Munawar Hayat, Salman Khan, Fahad Shahbaz Khan and Ling Shao. 

Neural Information Processing Systems (NeurIPS),  Vancouver, Canada, 2019.   (Code)

Cross-Domain Transferability of Adversarial Perturbations.

Muzammal Naseer, Salman Khan, Harris Khan, Fahad Shahbaz Khan and Fatih Porikli.  

Neural Information Processing Systems (NeurIPS),  Vancouver, Canada, 2019.   (Code)

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization.

Sanath Narayan, Hisham Cholakkal, Fahad Shahbaz Khan and Ling Shao.  

IEEE International Conference on Computer Vision (ICCV),  Seoul , Korea, 2019.   (Code)

Deep Contextual Attention for Human-Object Interaction Detection.

Tiancai Wang, Rao Muhammad Anwer, Muhammad Haris Khan, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao and Jorma Laaksonen.  

IEEE International Conference on Computer Vision (ICCV),  Seoul , Korea, 2019. 

Enriched Feature Guided Refinement Network for Object Detection.

Jing Nie, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang and Ling Shao.  

IEEE International Conference on Computer Vision (ICCV),  Seoul , Korea, 2019.   (Code)

Mask-Guided Attention Network for Occluded Pedestrian Detection.

Yanwei Pang, Jin Xie, Muhammad Haris Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan and Ling Shao.  

IEEE International Conference on Computer Vision (ICCV),  Seoul , Korea, 2019.   (Code)

Learning the Model Update for Siamese Trackers.

Lichao Zhang, Abel Gonzalez-Garcia, Joost van de Weijer, Martin Danelljan and Fahad Shahbaz Khan.  

IEEE International Conference on Computer Vision (ICCV),  Seoul , Korea, 2019.   (Code)

Learning Rich Features at High-Speed for Single-Shot Object Detection.

Tiancai Wang, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang and Ling Shao.  

IEEE International Conference on Computer Vision (ICCV),  Seoul , Korea, 2019.   (Code)

ATOM: Accurate Tracking by Overlap Maximization.

Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Long Beach, USA, 2019.  (Oral)  (Code)

A Generative Appearance Model for End-to-end Video Object Segmentation.

Joakim Johnander, Martin Danelljan, Emil Brissman, Fahad Shahbaz Khan and Michael Felsberg. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, USA, 2019. (Oral) 

Object Counting and Instance Segmentation with Image-level Supervision.

Hisham Cholakkal, Guolei Sun, Fahad Shahbaz Khan and Ling Shao. 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Long Beach, USA, 2019.  (Code)

Object-centric Auto-encoders and Dummy Anomalies for Abnormal Event Detection in Video.

Radu Tudor Ionescu, Fahad Shahbaz Khan, Mariana-Iuliana Georgescu and Ling Shao. 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Long Beach, USA, 2019.

Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition.

Devraj Mandal, Sanath Narayan, Saikumar Dwivedi, Vikram Gupta, Shuaib Ahmed, Fahad Shahbaz Khan and Ling Shao. 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Long Beach, USA, 2019.  (Code)

Efficient Featurized Image Pyramid Network for Single Shot Detector.

Yanwei Pang, Tiancai Wang, Rao Muhammad Anwer, Fahad Shahbaz Khan and Ling Shao.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Long Beach, USA, 2019. 

Synthetic Data Generation for End-to-End Thermal Infrared Tracking.

Lichao Zhang, Abel Gonzalez-Garcia, Joost van de Weijer, Martin Danelljan and Fahad Shahbaz Khan.  

IEEE Transactions on Image Processing (TIP), vol 28(4), 1837-1850, 2019.

2018

Density Adaptive Point Set Registration.

Felix Järemo Lawin, Martin Danelljan, Fahad Shahbaz Khan, Per-Erik Forssén and Michael Felsberg. 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Salt Lake City, USA, 2018. (Oral)  (Code)

Unveiling the Power of Deep Tracking.

Goutam Bhat, Joakim Johnander, Martin Danelljan, Fahad Shahbaz Khan and Michael Felsberg. 

European Conference on Computer Vision (ECCV),  Munich, Germany, 2018. 

Binary Patterns Encoded Convolutional Neural Networks for Texture Recognition and Remote Sensing Scene Classification.

Rao Anwer, Fahad Shahbaz Khan, Joost van de Weijer, Matthieu Molinier and Jorma Laaksonen. ISPRS Journal of Photogrammetry and Remote Sensing (PHOTO), vol 138, 74-85, 2018.

Beyond Eleven Color Names for Image Understanding.

Lu Yu, Lichao Zhang, Joost van de Weijer, Fahad Shahbaz Khan, Yongmei Cheng and C. Alejandro Párraga.  

Machine Vision and Applications (MVA), vol 29(2), 361-373, 2018. (Code + Data)

Scale coding bag of deep features for human attribute and action recognition.

Fahad Shahbaz Khan, Joost van de Weijer, Rao Muhammad Anwer, Andrew D. Bagdanov, Michael Felsberg and Jorma Laaksonen.  

Machine Vision and Applications (MVA), vol 29(1), 55-71, 2018.

Propagating Confidences through CNNs for Sparse Data Regression.

Abdelrahman Eldesokey, Michael Felsberg and Fahad Shahbaz Khan.  

British Machine Vision Conference (BMVC), Newcastle upon Tyne, United Kingdom, 2018. (Spotlight Oral) 

Combining Local and Global Models for Robust Re-detection.

Goutam Bhat, Martin Danelljan, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE International Conference on Advanced Video and Signal-based Surveillance (AVSS),  Auckland, New Zealand, 2018. (Oral) 

On the Optimization of Advanced DCF-Trackers.

Joakim Johnander, Goutam Bhat, Martin Danelljan, Fahad Shahbaz Khan and Michael Felsberg. European Conference on Computer Vision Workshops (ECCVW),  Munich, Germany, 2018. 

Two-Stream Part-Based Deep Representation for Human Attribute Recognition.

Rao Muhammad Anwer, Fahad Shahbaz Khan and Jorma Laaksonen.  

IEEE Conference on Biometrics (ICB),  Gold Coast, Australia, 2018. 

Deep motion and appearance cues for visual tracking.

Martin Danelljan, Goutam Bhat, Susanna Gladh, Fahad Shahbaz Khan and Michael Felsberg. 

Pattern Recognition Letters (PRL). 2018. (Special Issue: ICPR 2016 Best Papers)

2017

Discriminative Scale Space Tracking.

Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), vol 39(8), 1561-1575, 2017. (Webpage + Code)

ECO: Efficient Convolution Operators for Tracking.

Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Hawaii, USA, 2017.  (Webpage + Code)

DCCO: Towards Deformable Continuous Convolution Operators for Visual Tracking.

Joakim Johnander, Martin Danelljan, Fahad Shahbaz Khan and Michael Felsberg.  

Computer Analysis of Images and Patterns (CAIP),  Ystad, Sweden, 2017. (Oral)

Deep Projective 3D Semantic Segmentation.

Felix Järemo Lawin, Martin Danelljan, Patrik Tosteberg, Goutam Bhat, Fahad Shahbaz Khan and Michael Felsberg.

Computer Analysis of Images and Patterns (CAIP),  Ystad, Sweden, 2017. (Oral)

TEX-Nets: Binary Patterns Encoded Convolutional Neural Networks for Texture Recognition.

Rao Muhammad Anwer, Fahad Shahbaz Khan, Joost van de Weijer and Jorma Laaksonen.  

ACM Conference on Multimedia Retrieval (ICMR),  Bucharest, Romania, 2017. (Oral)

Top-Down Deep Appearance Attention for Action Recognition.

Rao Muhammad Anwer, Fahad Shahbaz Khan, Joost van de Weijer and Jorma Laaksonen. 

Scandinavian Conference on Image Analysis  (SCIA),  Tromso, Norway, 2017. (Oral)

Ellipse Detection for Visual Cyclists Analysis In the Wild.

Abdelrahman Eldesokey, Michael Felsberg, Fahad Shahbaz Khan.  

Computer Analysis of Images and Patterns (CAIP),  Ystad, Sweden, 2017.

2016

Adaptive Decontamination of the Training Set: A Unified Formulation for Discriminative Visual Tracking.

Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Las Vegas, USA, 2016. (Webpage + Code)

A Probabilistic Framework for Color-Based Point Set Registration.

Martin Danelljan, Giulia Meneghetti, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Las Vegas, USA, 2016. 

Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking.

Martin Danelljan, Andreas Robinson, Fahad Shahbaz Khan and Michael Felsberg.  

European Conference on Computer Vision (ECCV),  Amsterdam, The Netherlands, 2016. (Oral)  (Webpage + Code)

Deep motion features for visual tracking.

Susanna Gladh, Martin Danelljan, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE Conference on Pattern Recognition (ICPR),  Cancun, Mexico, 2016. (Oral) (Best Paper Award)

Aligning the dissimilar: A probabilistic method for feature-based point set registration.

Martin Danelljan, Giulia Meneghetti, Fahad Shahbaz Khan and Michael Felsberg. 

IEEE Conference on Pattern Recognition (ICPR),  Cancun, Mexico, 2016. (Oral) 

Combining Visual Tracking and Person Detection for Long Term Tracking on a UAV.

Gustav Häger, Goutam Bhat, Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg, Piotr Rudol and Patrick Doherty. 

Symposium on Visual Computing (ISVC),  Las Vegas, USA, 2016.

Combining Holistic and Part-based Deep Representations for Computational Painting Categorization.

Rao Muhammad Anwer, Fahad Shahbaz Khan, Joost van de Weijer and Jorma Laaksonen.  

ACM Conference on Multimedia Retrieval (ICMR),  New York, USA, 2017. 

2015

Recognizing Actions Through Action-Specific Person Detection.

Fahad Shahbaz Khan, Jiaolong Xu, Joost van de Weijer, Andrew D. Bagdanov, Rao Muhammad Anwer and Antonio M. López.  

IEEE Transactions on Image Processing (TIP), vol 24(11), 4422-4432, 2015.

Compact color-texture description for texture classification.

Fahad Shahbaz Khan, Rao Muhammad Anwer, Joost van de Weijer, Michael Felsberg and Jorma Laaksonen.  

Pattern recognition Letters (PRL), vol 51, 16-22, 2015.

Learning Spatially Regularized Correlation Filters for Visual Tracking.

Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE Conference on Computer Vision (ICCV),  Santiago, Chile, 2015. (Winner of OPENCV Challenge)  (Webpage + Code)

Convolutional Features for Correlation Filter Based Visual Tracking.

Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan and Michael Felsberg.  

IEEE Conference on Computer Vision Workshops (ICCVW),  Santiago, Chile, 2015. (Top Rank: VOT 2015 Challenge)  

Coloring Channel Representations for Visual Tracking.

Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan and Michael Felsberg.  

Scandinavian Conference on Image Analysis  (SCIA),  Copenhagen, Denmark, 2015. 

Deep Semantic Pyramids for Human Attributes and Action Recognition.

Fahad Shahbaz Khan, Rao Muhammad Anwer, Joost van de Weijer, Michael Felsberg and Jorma Laaksonen. 

Scandinavian Conference on Image Analysis  (SCIA),  Copenhagen, Denmark, 2015. 

An Overview of Color Name Applications in Computer Vision.

Joost van de Weijer and Fahad Shahbaz Khan.  

Computational Color Imaging Workshop  (CCIW),  Saint Etienne, France, 2015. 

2014

Accurate Scale Estimation for Robust Visual Tracking.

Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan and Michael Felsberg.  

British Machine Vision Conference  (BMVC),  Nottingham, United Kingdom, 2014. (Winner of VOT 2014 Challenge) (Webpage + Code)

Adaptive Color Attributes for Real-Time Visual Tracking.

Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg and Joost van de Weijer.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Columbus, USA, 2014. (Oral) (Webpage + Code)

Painting-91: a large scale database for computational painting categorization.

Fahad Shahbaz Khan, Shida Beigpour, Joost van de Weijer and Michael Felsberg. 

Machine Vision and Applications (MVA), vol 25(5), 1385-1397, 2014. (Dataset)

Semantic Pyramids for Gender and Action Recognition.

Fahad Shahbaz Khan, Joost van de Weijer, Rao Muhammad Anwer, Michael Felsberg and Carlo Gatta. 

IEEE Transactions on Image Processing (TIP), vol 23(8), 3633-3645, 2014.

A Low-Level Active Vision Framework for Collaborative Unmanned Aircraft Systems.

Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg, Karl Granström, Fredrik Heintz, Piotr Rudol, Mariusz Wzorek, Jonas Kvarnström and Patrick Doherty.  

European Conference on Computer Vision Workshops (ECCVW),  Zurich, Switzerland, 2014.

Scale Coding Bag-of-Words for Action Recognition.

Fahad Shahbaz Khan, Joost van de Weijer, Andrew D. Bagdanov and Michael Felsberg.  

IEEE Conference on Pattern Recognition (ICPR),  Stockholm, Sweden, 2014.

2013

Discriminative Color Descriptors.

Rahat Khan, Joost van de Weijer, Fahad Shahbaz Khan, Damien Muselet, Christophe Ducottet and Cécile Barat. 

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Portland, USA, 2013.

Coloring Action Recognition in Still Images.

Fahad Shahbaz Khan, Rao Muhammad Anwer, Joost van de Weijer, Andrew D. Bagdanov, Antonio M. López and Michael Felsberg.  

International Journal of Computer Vision (IJCV), vol 105(3), 205-221, 2013.

Evaluating the Impact of Color on Texture Recognition.

Fahad Shahbaz Khan, Joost van de Weijer, Sadiq Ali and Michael Felsberg.  

Computer Analysis of Images and Patterns (CAIP),  York, United Kingdom, 2013.

2012

Modulating Shape Features by Color Attention for Object Recognition.

Fahad Shahbaz Khan, Joost van de Weijer and María Vanrell.  

International Journal of Computer Vision (IJCV), vol 98(1), 49-64, 2012. (Webpage + Code)

Color attributes for object detection.

Fahad Shahbaz Khan, Rao Muhammad Anwer, Joost van de Weijer, Andrew D. Bagdanov, María Vanrell and Antonio M. López.  

IEEE Conference on Computer Vision and Pattern Recognition (CVPR),  Rhode Island, USA, 2012. (Webpage + Code)

Discriminative compact pyramids for object and scene recognition.

Noha M. Elfiky, Fahad Shahbaz Khan, Joost van de Weijer and Jordi Gonzàlez.  

Pattern Recognition (PR), vol 45(4), 1627-1636, 2012.

2011

Portmanteau Vocabularies for Multi-Cue Image Representation.

Fahad Shahbaz Khan, Joost van de Weijer, Andrew D. Bagdanov and María Vanrell.  

Neural Information processing (NIPS), Garanada, Spain, 2012. (Webpage + Code)

2010

The Impact of Color on Bag-of-Words Based Object Recognition.

David Augusto Rojas Vigo, Fahad Shahbaz Khan, Joost van de Weijer and Theo Gevers.  

IEEE Conference on Pattern Recognition (ICPR), Istanbul, Turkey, 2010.

2009

Top-down color attention for object recognition.

Fahad Shahbaz Khan, Joost van de Weijer and María Vanrell.  

IEEE Conference on Computer Vision (ICCV), Kyoto, Japan, 2009. (Webpage + Code)