Attention Is All You Need
Railroad is not a Train: Saliency as Pseudo-pixel Supervision for Weakly Supervised Semantic Segmentation
Visualizing and Understanding Convolutional Networks
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
ImageNet Classification with Deep Convolutional Neural Networks
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation
Deep Face Recognition: A Survey
Path Aggregation Network for Instance Segmentation
A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning
Transformer: Attention is all you need
PolyTransform: Deep Polygon Transformer for Instance Segmentation
SSD:Single Shot MultiBox Detector