This course is an introduction to data analysis and prediction techniques and tools.
List of topics
Introduction to Statistical Learning
Linear Regression, Logistic Regression, Linear Discriminant Analysis
Cross-validation, Bootstrap
Model selection & Regularization (Ridge and Lasso)
Dimensionality reduction methods
Non-linear Models (Polynomial regression, Splines, Generalized additive models)
Tree-based methods (Trees, Bagging, Random Forests, Boosting)
Support Vector Machines
Neural Networks
Unsupervised Learning (Clustering Methods)
Textbook
An Introduction to Statistical Learning: With Applications in R, Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Trevor Hastie and Robert TibshiraniÂ
Prerequisites
PH1700-Intermediate Biostatistics and good practical R programming skills