Over the years, through my research and courses, I have collected some research paper notes. These are aimed at being easier to read and understand while also being concise. Suggestions are welcome :)
Prompt Injection Attacks and Defenses for LLM-based Agentic Systems
Related works [Notes]
Vision-Language Modeling:
VQA: Visual Question Answering [Notes]
Visual Dialog: Datasets and Models [Notes]
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks [Notes]
Flamingo: a Visual Language Model for Few-Shot Learning [Notes]
Analyzing the Behavior of Visual Question Answering Models [Notes]
High-Resolution Image Synthesis with Latent Diffusion Models [Notes]
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding [Notes]
PaLM-E: An Embodied Multimodal Language Model [Notes]
DreamFusion: Text-to-3D using 2D Diffusion [Notes]
Continual Learning:
Selective Replay Enhances Learning in Online Continual Analogical Reasoning [Notes]
Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis [Notes]
Replay in Deep Learning: Current Approaches and Missing Biological Elements [Notes]
REMIND Your Neural Network to Prevent Catastrophic Forgetting [Notes]
Self-Supervised Training Enhances Online Continual Learning [Notes]
FearNet: Brain-Inspired Model For Iincremental Learning [Notes]
Natural Language Processing: