Course Information
Course Title: CIS 6930 Special Topics in Large Language Models
Semester: Fall 2025
Lectures: Monday Wednesday Friday 11:45am - 12:35pm at CSE E222
Office Hour: Friday 1:00pm - 2:00pm
Course Description
Large Language Models (LLMs) provide the foundational technology behind recent breakthroughs in Natural Language Processing, enabling systems to and perform a wide range of language-driven tasks. This course provides a comprehensive introduction to the fundamentals in Large Language Models, covering their tokenization, model architecture, training methods, inference, fine-tuning, post-training techniques. We will also introduce recent advances in Large Language Models, including retrieval-augmented LLMs, knowledge-enhanced LLMs, reasoning and planning with LLMs, LLMs agent, and multi-modality LLMs. In addition, we will discuss the current limitations of Large Language Models, such as harm, hallucinations, and safety alignment concerns, and will explore potential mitigation strategies for addressing these issues.
Grading
Quizzes (25%): Tentative quiz dates are Sep 5th, Sep 19th, Oct 3rd, Oct 24th, Nov 7th, Nov 21st 2025
Mid-term Exam (35%): Oct 10th, 2025 11:45am - 12:35pm at CSE E222
Final Projects (40%): Project Proposal (10%) due Oct 5th 2025, In-class Presentation (15%), Final Project (15%) due Dec 5th 2025
Course Schedule (tentative)
Week # Topics
Week 1 Lectures Introduction
LLMs capabilities and limitations
Week 2 N-gram Language Model
LLMs Tokenization Algorithms, Byte-Pair Encoding, WordPiece, Unigram Tokenizer
Week 3 Neural Architectures for Language Model, Feedforward Neural Language Model
Recurrent Neural Network for Language Model, Long Short-Term Memory Model
Week 4 Sequence-to-Sequence Language Model
Attention Mechanism, Self-Attention, Multi-head Attention
Transformer Language Model
Week 5 Encoder Language Model, Encoder-Decoder Language Model, Decoder-only Language Model
Pre-Training of Large Language Models, Data Curation for LLMs
Week 6 Large Language Model Scaling Law and Emergent Behavior
Inference of Large Language Models, LLMs Decoding Strategies, Controllable Text Generation
Week 7 Fine-Tuning of Large Language Models
Parameter-Efficient Fine-Tuning of Large Language Models
Week 8 Post-Training of Large Language Models, In-context Learning, Instruction Tuning, Alignment
Reinforcement Learning in Large Language Models, Proximal Policy Optimization (PPO),
Direct Preference Optimization (DPO), Group Relative Policy Optimization (GRPO)
Week 9 Retrieval-augmented LLMs, Retrieval Augmentation Generation
Knowledge-enhanced LLMs, Knowledge Boundary, Knowledge Steering, Knowledge Editing etc.
Week 10 Reasoning and Planning in Large Language Models
LLMs Agent and Multi-Agent systems
Week 11 LLMs Hallucination Detection and Mitigation
LLMs Harm, Debias, Detoxification
Week 12 Large Multi-Modality Models, Vision-Language Foundation Models
Week 13 Students Projects Presentation
Week 14 Students Projects Presentation