Fall 2020 / CSCE 771: Computer Processing of Natural Language
Quick Info - When and Where
Monday/Wednesday 3:55 pm – 5:10 pm
Virtual meeting link: BlackBoard Ultra (note the change)
Room: (Optional)
Instructor Information
Instructor: Biplav Srivastava
E-mail: biplav.s AT sc.edu
Office Hours: 1100-1200 MWF
Attendance Policy
Students are expected to attend lectures by listening to videos (asynchronous) or joining live classes using Blackboard Ultra (synchronous) or being present in class (if instructor allows and student elects to attend). They are expected to participate in quizzes, do their project and complete paper reading.
Suggested Reading
Details
Full details are available in syllabus here. Slides are below and code is here.
Bulletin Description
This will be an advanced course on computer-based processing of communication between people (languages) with a focus on text (NLP), and appreciation of multi-modal communication involving speech and images. The course will have lessons on parsing of input, representation of syntax and semantics, and analysis to derive insights. It will explain learning and reasoning based methods for extracting entities, disambiguating and linking them, and applying them for real-world problems. We will also discuss issues in building advanced systems for conversation (“chatbots”) and machine translation, and ethical concerns while testing and fielding them with people.
Prerequisites
Experience with a first course on Artificial Intelligence is desirable, example: CSCE 580. Students without this course may still enroll knowing that necessary AI background can be found in the standard AI book, “AI – A Modern Approach”; http://aima.cs.berkeley.edu/. Necessary material is in chapters 3,4,5 and 6.
Learning outcomes
L1: Appreciate diversity and similarity in natural languages – text, speech and visual; focus of course will, however, be text (NLP) and English
L2: Understand issues related to data and tools. Experiment design, Metrics for evaluation and to detect bias, Methods to build trust in processing – transparent assessment, Providing explanations for output
L3: Data processing: (a) Structured data representation from unstructured text; (b) Extract entities and relationships; (c) Extract contexts; (d) representation learning – word embedding
L4: AI methods in NLP: (a) Learning methods – including language models, (b) Reasoning, (c) Representation – knowledge graphs/ ontology
L5: NLP applications – (a) Document intelligence: sentiment, translation; (b) collaborative assistants
Grading Schema
Project: 50% for project report and code, plus 10% for elevator presentation to class
Analysis project OR
Dataset must be one from instructor given list
Use NLP methods to present new insights
Example: Analyze text of mask regulations in a US state
New method (Research) project
Problem to be discussed with instructor.
Example: Translate from a mixed language to standard English
Quiz: 20%
At least 4 based on preceding lectures
Exam: Paper reading and presentation to class: 20%
Read a paper accepted at a top NLP / AI conference: ACL 2019-2020, AAAI 2019-2020, IJCAI 2019-2020, NeurIPS 2019-2020
Present a 1-slide summary to class