Home

eThis is a list of the topics that we will tentatively be discussing each day of the semester. I will add links from each one to the slides that I use during the lecture. Those should be present the night before the lecture at the latest. Also listed are the readings/videos for each day and what is due on those days.

DateTopicVideosDue
8-24Course Introduction
 
8-29Scala and Collection Refresher plus sbt and plottingFull Playlist 
8-31Parallel Collections
 
9-5Spark, Distributed Programming, and RDDs
Data Set Report #1
9-7Problem Solving (Problems)
Quiz #1 (Ch. 1-3)
9-12Special RDDs

9-14Problem Solving (Problems)

9-19SQL Refresher
Quiz #2 (Ch 4-5)
9-21Problem Solving (Problems)

9-26Spark SQL Basics and DataFrames

9-28Problem Solving (Problems)

10-3Datasets and UDFs

10-5Problem Solving (Problems)

10-10MLLib Introduction/Regression Analysis
Quiz #3 (Ch. 6-7)
10-12Problem Solving (Problems)

10-17DPS (No class)

10-19Work Day

10-24Classification

10-26Problem Solving (Problems)

10-31Collaborative Filtering
Quiz #4 (Ch. 8-9)
11-2Problem Solving (Problems)

11-7GraphX

11-9Problem Solving (Problems)
Final Project Ideas
11-14Clustering
Quiz #5 (Ch. 10-11)
11-16Problem Solving (Problems)

11-21Project Work Day

11-23Thanksgiving (No Class)

11-28Deep Learning

11-30Problem Solving (Problems)
Quiz #6 (Ch. 12-13)
12-5Conclusions
Data Set Report #2
12-14, noonFinal Project Presentations