van den Oord, S Dieleman, H Zen, K Simonyan, O Vinyals, A Graves, N Kalchbrenner, A Senior, K Kavukcuoglu, " WaveNet:a generative model for raw audio ", arxiv 2016

05/8/2025 (Thursday)

Slides

Attention based models

Transformers, Graph neural networks

K. Xu, J. Ba, R. Kiros, K. Cho, A. Courville, R. Salakhudinov, R. Zemel, Y. Bengio, "Show, attend and tell: Neural image caption generation with visual attention", ICML 2015.

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, "Attention Is All You Need ", NeurIPS 2017.

J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding", NAACL-HLT 2019.

Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, Sergey Zagoruyko, "End-to-End Object Detection with Transformers", ECCV 2020.

Week 7: 05/13/2025 (Tuesday)

Slides

Slides (revised)

Transformers

Transformer visualization (Jay Alammar)

BERT visualization (Jay Alammar)

05/15/2025 (Thursday)

Slides

Slides (revised)

Sparse Representations

B. Olshausen and D. Field, 1996. " Emergence of Simple-Cell Receptive Field Properties by Learning a Sparse Code for Natural Images ", Nature.

EJ Candes, T Tao, 2006. " Near-optimal signal recovery from random projections: Universal encoding strategies? ", IEEE Trans. on Information Theory.

EJ Candès, X Li, Y Ma, J Wright, 2011. " Robust principal component analysis ?" Journal of the ACM.

Week 8: 05/20/2025 (Tuesday)

Slides

Weakly-supervised learning

T. G. Dietterich, R. H. Lathrop, T. Lozano-Perez. Solving the multiple instance problem with axis-parallel rectangles ". Artificial Intelligence 1997.

C Zhang, JC Platt, PA Viola, " Multiple instance boosting for object detection ", NeurIPS 2006.

05/22/2025 (Thursday)

Slides

Semi-supervised learning

X. Zhu, " Semi-supervised learning literature survey ", technical report, 2005.

M Belkin, P Niyogi, V Sindhwani, " Manifold regularization: A geometric framework for learning from labeled and unlabeled examples ", JMLR 2006.

Kihyuk Sohn, et al., "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence", NeurIPS, 2020.

Week 9: 05/27/2025 (Tuesday)

Slides (self-supervised)

Self-supervised learning

Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, Ross Girshick , "Momentum Contrast for Unsupervised Visual Representation Learning", CVPR 2020.

Jean-Bastien Grill, "Bootstrap your own latent: A new approach to self-supervised Learning", arXiv:2006.07733, 2020.

05/29/2025 (Thursday)

Slides (semi-supervised)

Generative modeling

Zhuowen Tu, "Learning Generative Models via Discriminative Approaches", CVPR, 2007.

Ian Goodfellow et al., "Generative adversarial networks", NeurIPS, 2014.

Jascha Sohl-Dickstein, Eric Weiss, Niru Maheswaranathan, Surya Ganguli, "Deep Unsupervised Learning using Nonequilibrium Thermodynamics", ICML 2015.

Week 10: 06/03/2025 (Tuesday)

Slides

Generative modeling

DP Kingma, M Welling, "Auto-encoding variational bayes", ICLR 2013.

06/05/2025 (Thursday)

Slides

Slides (Diffusion models)

Diffusion Models

Large language models

Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova, "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding", 2018

.Tom B. Brown et al, "Language Models are Few-Shot Learners", NeurIPS, 2020.

Long Ouyang et al., "Training language models to follow instructions with human feedback", 2022.

Alec Radford et al., "Learning Transferable Visual Models From Natural Language Supervision", 2021.

Page updated

Google Sites

Report abuse