SD201: Mining of Massive Datasets, Fall 2018

Lectures

Lecture 1a: Introduction to Data Mining and Big Data

Labs

Lab on 26/09/18. Exercises are not going to be evaluated, no report or solution is asked Ex1: Python and PageRank Ex2: Clustering
Lab 24/10/2018 on Spark (not evaluated)


Material

Tutorial on Python
Exercises

Announcements
26/09 the rooms booked for all remaining TP sessions have been specified on synapses
03/10 Solutions for the lab on PageRank and clustering have been posted
05/10 Some exercises have been posted (check the section "Material"). No more exercise will be posted. 
19/10 Fixed typo on slides Lec6a (evaluation of a classifier, leave-one-out)
22/10 All the material for the lab session on 24/10 has been posted. The lab will not be evaluated
28/10 No notes/documents are allowed during the final exam. Students can bring their own calculators (only some simple calculations will be required).