Introduction to Data Management
Description: In this course, we will introduce the basics of modern data management. You will learn about schema design, data modeling, query languages, building database applications, transactions, and many other topics. Through lectures, sections, and assignments, you will learn about and understand how real-world data management systems work, and how they touch upon many aspects of our daily lives.
Questions and Comments: If you have any questions or comments about lecture, sections, assignments, exams, or whatever, post on Piazza. This is the fastest way to get in contact with the 344 staff and your peers. You can post privately to just the staff and anonymously to the whole class if you want to. Please note that when signing up for Piazza, Piazza specifies an option (default is to agree) to share your contact information with companies for recruiting purposes. Opt out of this option if you wish. If you need to talk about anything personal or offline, see us during office hours or email the instructor (jleang @ cs) to set up a meeting. And we love feedback! If you have any concerns, comments, or suggestions let us know through this anonymous survey.
For additional details about the class, please refer to the syllabus.
CSE 344 Staff
cse344-staff @ cs
Jonathan Leang (instructor)
jleang @ cs
OH: M, 10:30-11:30, CSE 216
W, 15:00-16:00, CSE 216
Shana Hutchison
shutchis @ cs
OH: Th, 10:30-11:30, Gates 150
Yisu "Remy" Wang
remywang @ cs
(grader)
Jack Khuu
jackkhuu @ cs
OH: Th, 17:00-18:00, Gates 121
Brendan Murphy
bsmurphy @ cs
OH: M, 12:00-13:00, 4th fl. breakout
Peter Li
pzli97 @ cs
OH: W, 1230-1330, CSE 021
Andrew Guterman
anguterm @ cs
OH: Tu, 12:00-13:15, Gates 121
Khang Phan
kphan000 @ cs
OH: M, 14:00 - 15:00, Gates 153
Amy Xu
amyxu @ cs
OH: F, 13:30 - 14:30, Gates 151
Allison Chou
aachou @ cs
(grader)
Calendar
(Lecture) MWF, 9:30-10:20, CSE2 G10 (cse344a_sp19 @ uw)
(Section AA w/ Shana and Khang) Th, 8:30-9:20, MGH 242 (cse344aa_sp19 @ uw)
(Section AB w/ Shana and Andrew) Th, 9:30-10:20, MGH 242 (cse344ab_sp19 @ uw)
(Section AC w/ Amy and Brendan) Th, 12:30-1:20, MGH 242 (cse344ac_sp19 @ uw)
(Section AD w/ Jack and Peter) Th, 1:30-2:20, MGH 234 (cse344ad_sp19 @ uw)
Lecture Resources:
- April 1 - Introduction
- April 3 - Joins
- April 5 - Aggregates
- April 8 - Subqueries
- SQL and RA query reference
- April 10 - ER Diagrams <annotated>
- April 12 - Review <annotated>
- April 15 - Design Theory <annotated>
- April 17 - Management <annotated>
- April 19 - Cost Estimation <annotated>
- April 22 - Index Tuning <annotated>
- April 24 - Serializability <annotated>
- April 26 - Locks <annotated>
- April 29 - Isolation Levels <see panopto for annotations>
- April 29 - Review (Solutions)
- May 1 - Parallel Databases <annotated>
- May 3 - Spark
- May 8 - Datalog <annotated>
- May 10 - NoSQL <annotated>
- May 13 - Wide-Column Store <annotated>
- May 15 - NoSQL Internals <annotated>
- May 17 - BigTable <annotated> <demo code>
- May 20 - Semi-Structured Data <annotated>
- May 22 - SQL++ <annotated>
- May 24 - More SQL++ <annotated>
- May 27 - Memorial day <annotated>
- May 29 - Stream Fundamentals <annotated>
- May 31 - Stream Timing <annotated>
- June 3 - Stream Infrastructure
- June 5 - Stream Flink-Kafka API <demo code>
- June 7 - Victory Lap
Sections Resources:
- April 4 - Slides, Optional Worksheet, Optional Worksheet Solution
- April 11 - Slides, Worksheet, Worksheet Solutions
- April 18 - Schema Design Notes, Slides, Worksheet, Worksheet Solutions
- April 25 - Slides, Worksheet, Worksheet Solutions, Cost Estimation Guide
- May 2 - Slides, Worksheet, Worksheet Solutions
- May 9 - Slides, Worksheet, Worksheet Solutions
- May 16 - Slides
- May 23 - Slides, Worksheet, Worksheet Solutions
Assignments
Homework
- Pre-344 Survey
- Security and Privacy Code of Ethics (Due 4/7, 11:59 PM)
- HW1, starter code (Due 4/16, 11:59 PM)
- Additional Administrivia
- HW2, starter files (Due 4/26 11:59 PM)
- HW3, starter code (Due 5/14 11:59 PM)
- HW4, starter code (Due 5/31 11:59 PM)
- Extra Credit HW5, starter code (Due 6/12 8:30 AM, no late days)
Labs
- Lab 1, starter code (Due 4/30, 11:59 PM)
- Lab 2, starter code (Due 6/4 11:59 PM)
Exams
There will be a midterm and a final for this course. You may have handwritten notes for both exams. For the midterm you will be allowed 1 sheet of notes (front and back). For the final you will be allowed 2 sheets of notes (front and back). The midterm will be held on May 6th (Monday) during lecture time at 9:30 AM. The final is scheduled for June 12th (Wednesday) at 8:30 AM.
Midterm exam resources
Final exam resources
Other Resources
Textbooks
- Database Systems: The Complete Book (2nd Edition) <errata>
- Fundamentals of Database Systems (7th Edition)
Related Groups
- UW Database Group
- UW eScience Institute
- SIGMOD (Special Interest Group on Management of Data)
- PODS (Symposium on Principles of Database Systems)
- CIDR (Conference on Innovative Data Systems Research)
- VLDB (Very Large Data Base Endowment Inc.)