🍁 2025 Fall
⏰ Tuesday/Thursday, 4:00-5:50 p.m.
📍 SAL 101
Complete videos for this class will be available on Brightspace for enrolled students.
Contact: Students should ask all course-related questions in the Ed forum, where you will also find announcements. You will find the course Ed on the course Brightspace page. For external enquiries, emergencies, or personal matters that you don't wish to put in a private Ed post, you can email us at csci544-25f@googlegroups.com. Please send all emails to this mailing list -- do not email the instructors directly. We will try to respond within 48-72 hours.
Instructor: Jieyu Zhao
Office Hour: 30mins after the class
TA: Rebecca Dorn
Office Hour: Thursdays 9:30-10:30 AM, Zoom Link Here
TA: Thomas Reeves
Office Hour:
TA: Sahana Ramnath
Office Hour: Mondays 11AM-12 noon, Zoom link here
TA: Pengda Xiang
Office Hour:
TA: Muzi Tao
Office Hour: Tuesday 3:00-4:00PM, Zoom Link Here
TA: Ziyi Liu
Office Hour: Friday 2:00-3:00PM, Zoom Link Here
Check Brightspace.
[08/13] I don't know when the grader positions for this class will be available.
This course covers both fundamental and cutting-edge topics in Natural Language Processing (NLP) with a focus on Language Models. Natural language processing (NLP) has been revolutionized by the advancement of large-scale language models, achieving state-of-the-art performance across a wide variety of tasks. This course will cover the fundamentals of language modeling and related topics in natural language processing, deep learning, and machine learning. Students will gain familiarity with the capabilities of large language models as well as get hands-on experience with building and evaluating small-scale language models. The class will also explore the real-world consequences of deploying language models, such as the ethics and harms associated with them.
[Tentatively for now]
Calendar and prespecified syllabus are subject to change. More details, e.g., reading materials and additional resources, will be added as the semester continues. All work (except the project final report) is due on the specified date by 11:59 pm PT.
There will be three components to course grades:
Homeworks (20%).
5% X 4: There will be four coding homework assignments based on the topics of the class.
Quizzes (10%).
2% X 5: Multiple-Choice Questions and Short Answers. Missed quizzes will receive a zero grade, and there will be no make-up quizzes.
Class Projects (45%).
Each student will do a group class project based on the topics covered in the class. Students will propose their own project, do the research and build a proof-of-concept, create a video demonstration of the proof-of-concept, and present the project in their report.
Proposal: 5%
Status Reports: 10%
Project Presentation: 10%
Final Write-up: 20%
Exams (25%)
Midterm (10%): The midterm exam will contain a mixture of multiple-choice and long-form questions, covering about the first half of the material covered in the class.
Final (15%): The final exam at the end of the semester, covering all of the material covered in the class, will contain a mixture of multiple-choice and long-form questions.
Grading inquiries and questions about the grading of the homework and the quizzes can be asked (to the TAs) within two weeks from the grading date (the date the grades are released). Grades will be available within 2-2.5 weeks after submission.
All written assignments related to the final project should use the standard *ACL paper submission template.
Students are allowed a maximum of 6 late days total for all assignments (but NOT the quiz sheets). You may use up to 3 late days per assignment. Using one late day for a project assignment involves each of the teammates using a late day each. Partial late days are not permitted. For every extra late day beyond the allowed late days, the student / team will lose 20% of the grade for the assignment.
Note: Please familiarize yourself with the academic policies and read the note about student well-being.
The following texts are useful, but none are required. All of them can be read free online.
Dan Jurafsky and James H. Martin. Speech and Language Processing (2024 pre-release)
Jacob Eisenstein. Natural Language Processing
Yoav Goldberg. A Primer on Neural Network Models for Natural Language Processing
Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning
Delip Rao and Brian McMahan. Natural Language Processing with PyTorch (requires Stanford login).
Lewis Tunstall, Leandro von Werra, and Thomas Wolf. Natural Language Processing with Transformers
If you have no background in neural networks but would like to take the course anyway, you might well find one of these books helpful to give you more background:
Michael A. Nielsen. Neural Networks and Deep Learning
Eugene Charniak. Introduction to Deep Learning